Latest nc news

High-Dimensional Inference for Cluster-Based Graphical Models

By
Published On :: 2020

Motivated by modern applications in which one constructs graphical models based on a very large number of features, this paper introduces a new class of cluster-based graphical models, in which variable clustering is applied as an initial step for reducing the dimension of the feature space. We employ model assisted clustering, in which the clusters contain features that are similar to the same unobserved latent variable. Two different cluster-based Gaussian graphical models are considered: the latent variable graph, corresponding to the graphical model associated with the unobserved latent variables, and the cluster-average graph, corresponding to the vector of features averaged over clusters. Our study reveals that likelihood based inference for the latent graph, not analyzed previously, is analytically intractable. Our main contribution is the development and analysis of alternative estimation and inference strategies, for the precision matrix of an unobservable latent vector Z. We replace the likelihood of the data by an appropriate class of empirical risk functions, that can be specialized to the latent graphical model and to the simpler, but under-analyzed, cluster-average graphical model. The estimators thus derived can be used for inference on the graph structure, for instance on edge strength or pattern recovery. Inference is based on the asymptotic limits of the entry-wise estimates of the precision matrices associated with the conditional independence graphs under consideration. While taking the uncertainty induced by the clustering step into account, we establish Berry-Esseen central limit theorems for the proposed estimators. It is noteworthy that, although the clusters are estimated adaptively from the data, the central limit theorems regarding the entries of the estimated graphs are proved under the same conditions one would use if the clusters were known in advance. As an illustration of the usage of these newly developed inferential tools, we show that they can be reliably used for recovery of the sparsity pattern of the graphs we study, under FDR control, which is verified via simulation studies and an fMRI data analysis. These experimental results confirm the theoretically established difference between the two graph structures. Furthermore, the data analysis suggests that the latent variable graph, corresponding to the unobserved cluster centers, can help provide more insight into the understanding of the brain connectivity networks relative to the simpler, average-based, graph.

High-Dimensional Inference for Cluster-Based Graphical Models

Fast Rates for General Unbounded Loss Functions: From ERM to Generalized Bayes

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions

Exact Guarantees on the Absence of Spurious Local Minima for Non-negative Rank-1 Robust Principal Component Analysis

Multiparameter Persistence Landscapes

Generalized Optimal Matching Methods for Causal Inference

Smoothed Nonparametric Derivative Estimation using Weighted Difference Quotients

The weight function in the subtree kernel is decisive

(1 + epsilon)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets

Identifiability of Additive Noise Models Using Conditional Variances

TIGER: using artificial intelligence to discover our collections

Q&A with Tara June Winch

Town launches new Community Support Hotline

Branching random walks with uncountably many extinction probability vectors

Recent developments in complex and spatially correlated functional data

On estimating the location parameter of the selected exponential population under the LINEX loss function

A primer on the characterization of the exchangeable Marshall–Olkin copula via monotone sequences

Nonparametric discrimination of areal functional data

Bootstrap-based testing inference in beta regressions

Bayesian inference on power Lindley distribution based on different loss functions

Keeping the balance—Bridge sampling for marginal likelihood estimation in finite mixture, mixture of experts and Markov mixture models

Influence measures for the Waring regression model

A temporal perspective on the rate of convergence in first-passage percolation under a moment condition

Hierarchical modelling of power law processes for the analysis of repairable systems with different truncation times: An empirical Bayes approach

Necessary and sufficient conditions for the convergence of the consistent maximal displacement of the branching random walk

The equivalence of dynamic and static asset allocations under the uncertainty caused by Poisson processes

Odysseus asleep : uncollected sequences, 1994-2019

Reclaiming indigenous governance : reflections and insights from Australia, Canada, New Zealand, and the United States

Flexible, boundary adapted, nonparametric methods for the estimation of univariate piecewise-smooth functions

Scalar-on-function regression for predicting distal outcomes from intensively gathered longitudinal data: Interpretability for applied scientists

Pitfalls of significance testing and &#36;p&#36;-value variability: An econometrics perspective

Statistical inference for dynamical systems: A review

$M$-functionals of multivariate scatter

Semi-parametric estimation for conditional independence multivariate finite mixture models

Log-concavity and strong log-concavity: A review

Adaptive clinical trial designs for phase I cancer studies

Analyzing complex functional brain networks: Fusing statistics and network science to understand the brain

Statistical inference for disordered sphere packings

Curse of dimensionality and related issues in nonparametric functional regression

Identifying the consequences of dynamic treatment strategies: A decision-theoretic overview

Discrete variations of the fractional Brownian motion in the presence of outliers and an additive noise

Was one of your ancestors a whaler?

Was your ancestor a doctor?

Statistical errors in Monte Carlo-based inference for random elements. (arXiv:2005.02532v2 [math.ST] UPDATED)

Interpreting Rate-Distortion of Variational Autoencoder and Using Model Uncertainty for Anomaly Detection. (arXiv:2005.01889v2 [cs.LG] UPDATED)

Data-Space Inversion Using a Recurrent Autoencoder for Time-Series Parameterization. (arXiv:2005.00061v2 [stat.ML] UPDATED)

A Global Benchmark of Algorithms for Segmenting Late Gadolinium-Enhanced Cardiac Magnetic Resonance Imaging. (arXiv:2004.12314v3 [cs.CV] UPDATED)

Strong Converse for Testing Against Independence over a Noisy channel. (arXiv:2004.00775v2 [cs.IT] UPDATED)

Mnemonics Training: Multi-Class Incremental Learning without Forgetting. (arXiv:2002.10211v3 [cs.CV] UPDATED)

On the impact of selected modern deep-learning techniques to the performance and celerity of classification models in an experimental high-energy physics use case. (arXiv:2002.01427v3 [physics.data-an] UPDATED)

The Finish Line: Drainage Efficiency

Building Product Transparency— Be Careful What You Ask For

New Gadget Analyzes Everything Including Building Industry

Cost-Effective, Energy Efficient Concrete Sandwich Panels

NCS Trust ‘sad and disappointed’ at government plans to shut it down

Companies' 'Green' Efforts Include Products’ Material Content

Incomplete information can fuel misjudgment: study

Incident involving highwall collapse spurs MSHA safety alert

Conagra Brands Announces Sustainable Development Award Winners

Flexco launches Natural Elements in wood, stone looks

The First Sealer to Give a Beautiful, Luxurious Appearance

Metallika blends beauty and function

Redi Trench Blends Design and Function in Shower Applications

Reclaimé Collection by Quick-Step includes new White Washed Oak look

Top 2024 Advances in Alternative Protein

Subscribe To Our Newsletter

Pitfalls of significance testing and $p$-value variability: An econometrics perspective