Latest an news

A Bayesian approach to disease clustering using restricted Chinese restaurant processes

By projecteuclid.org
Published On :: Wed, 08 Apr 2020 22:01 EDT

Claudia Wehrhahn, Samuel Leonard, Abel Rodriguez, Tatiana Xifara.

Source: Electronic Journal of Statistics, Volume 14, Number 1, 1449--1478.

Abstract:
Identifying disease clusters (areas with an unusually high incidence of a particular disease) is a common problem in epidemiology and public health. We describe a Bayesian nonparametric mixture model for disease clustering that constrains clusters to be made of adjacent areal units. This is achieved by modifying the exchangeable partition probability function associated with the Ewen’s sampling distribution. We call the resulting prior the Restricted Chinese Restaurant Process, as the associated full conditional distributions resemble those associated with the standard Chinese Restaurant Process. The model is illustrated using synthetic data sets and in an application to oral cancer mortality in Germany.

A Bayesian approach to disease clustering using restricted Chinese restaurant processes

A fast and consistent variable selection method for high-dimensional multivariate linear regression with a large number of explanatory variables

Computing the degrees of freedom of rank-regularized estimators and cousins

Rate optimal Chernoff bound and application to community detection in the stochastic block models

Consistency and asymptotic normality of Latent Block Model estimators

&#36;k&#36;-means clustering of extremes

Sparsely observed functional time series: estimation and prediction

Testing goodness of fit for point processes via topological data analysis

On the distribution, model selection properties and uniqueness of the Lasso estimator in low and high dimensions

Reduction problems and deformation approaches to nonstationary covariance functions over spheres

On a Metropolis–Hastings importance sampling estimator

Modal clustering asymptotics with applications to bandwidth selection

Estimation of a semiparametric transformation model: A novel approach based on least squares minimization

The bias and skewness of M -estimators in regression

A Low Complexity Algorithm with O(√T) Regret and O(1) Constraint Violations for Online Convex Optimization with Long Term Constraints

A Model of Fake Data in Data-driven Analysis

Lower Bounds for Parallel and Randomized Convex Optimization

Path-Based Spectral Clustering: Guarantees, Robustness to Outliers, and Fast Algorithms

On Mahalanobis Distance in Functional Settings

Weighted Message Passing and Minimum Energy Flow for Heterogeneous Stochastic Block Models with Side Information

Neyman-Pearson classification: parametrics and sample size requirement

Generalized probabilistic principal component analysis of correlated data

On lp-Support Vector Machines and Multidimensional Kernels

Perturbation Bounds for Procrustes, Classical Scaling, and Trilateration, with Applications to Manifold Learning

Expectation Propagation as a Way of Life: A Framework for Bayesian Inference on Partitioned Data

Connecting Spectral Clustering to Maximum Margins and Level Sets

High-Dimensional Interactions Detection with Sparse Principal Hessian Matrix

Convergences of Regularized Algorithms and Stochastic Gradient Methods with Random Projections

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

GluonCV and GluonNLP: Deep Learning in Computer Vision and Natural Language Processing

Lower Bounds for Testing Graphical Models: Colorings and Antiferromagnetic Ising Models

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

On the consistency of graph-based Bayesian semi-supervised learning and the scalability of sampling algorithms

On the Complexity Analysis of the Primal Solutions for the Accelerated Randomized Dual Coordinate Ascent

Noise Accumulation in High Dimensional Classification and Total Signal Index

Latent Simplex Position Model: High Dimensional Multi-view Clustering with Uncertainty Quantification

Learning Linear Non-Gaussian Causal Models in the Presence of Latent Variables

Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

Branch and Bound for Piecewise Linear Neural Network Verification

Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Ancestral Gumbel-Top-k Sampling for Sampling Without Replacement

Skill Rating for Multiplayer Games. Introducing Hypernode Graphs and their Spectral Theory

Sparse and low-rank multivariate Hawkes processes

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions

Exact Guarantees on the Absence of Spurious Local Minima for Non-negative Rank-1 Robust Principal Component Analysis

Kymatio: Scattering Transforms in Python

Multiparameter Persistence Landscapes

On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics

Union of Low-Rank Tensor Spaces: Clustering and Completion

Estimation of a Low-rank Topic-Based Model for Information Cascades

The Finish Line: Cast Stone and EIFS

The Finish Line: Changing Stucco to EIFS

The Finish Line: Cleaning EIFS

The Finish Line: Earthquakes and EIFS

The Finish Line: Adhesives vs. Mechanical Fasteners

The Finish Line: EPS Vs. Polyisocyanurate Insulation

The Finish Line: Sealants

The Finish Line: Building Walls in the Land Down Under

EPDs, HPDs and Red Lists (Oh My)!

Building Product Transparency— Be Careful What You Ask For

Anti-LEED Legislation

An Energy Label for Buildings

Benefits of the Variable Refrigerant Flow

New Gadget Analyzes Everything Including Building Industry

ANSI Green Globes 2015

Subscribe To Our Newsletter

$k$-means clustering of extremes