Latest l news

Incorporating conditional dependence in latent class models for probabilistic record linkage: Does it matter?

By projecteuclid.org
Published On :: Wed, 16 Oct 2019 22:03 EDT

Huiping Xu, Xiaochun Li, Changyu Shen, Siu L. Hui, Shaun Grannis.

Source: The Annals of Applied Statistics, Volume 13, Number 3, 1753--1790.

Abstract:
The conditional independence assumption of the Felligi and Sunter (FS) model in probabilistic record linkage is often violated when matching real-world data. Ignoring conditional dependence has been shown to seriously bias parameter estimates. However, in record linkage, the ultimate goal is to inform the match status of record pairs and therefore, record linkage algorithms should be evaluated in terms of matching accuracy. In the literature, more flexible models have been proposed to relax the conditional independence assumption, but few studies have assessed whether such accommodations improve matching accuracy. In this paper, we show that incorporating the conditional dependence appropriately yields comparable or improved matching accuracy than the FS model using three real-world data linkage examples. Through a simulation study, we further investigate when conditional dependence models provide improved matching accuracy. Our study shows that the FS model is generally robust to the conditional independence assumption and provides comparable matching accuracy as the more complex conditional dependence models. However, when the match prevalence approaches 0% or 100% and conditional dependence exists in the dominating class, it is necessary to address conditional dependence as the FS model produces suboptimal matching accuracy. The need to address conditional dependence becomes less important when highly discriminating fields are used. Our simulation study also shows that conditional dependence models with misspecified dependence structure could produce less accurate record matching than the FS model and therefore we caution against the blind use of conditional dependence models.

Incorporating conditional dependence in latent class models for probabilistic record linkage: Does it matter?

A hierarchical Bayesian model for single-cell clustering using RNA-sequencing data

A Bayesian mark interaction model for analysis of tumor pathology images

Sequential decision model for inference and prediction on nonuniform hypergraphs with application to knot matching from computational forestry

Network classification with applications to brain connectomics

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics

Distributional regression forests for probabilistic precipitation forecasting in complex terrain

Fast dynamic nonparametric distribution tracking in electron microscopic data

Network modelling of topological domains using Hi-C data

Spatio-temporal short-term wind forecast: A calibrated regime-switching method

The classification permutation test: A flexible approach to testing for covariate imbalance in observational studies

Identifying multiple changes for a functional data sequence with application to freeway traffic segmentation

A hidden Markov model approach to characterizing the photo-switching behavior of fluorophores

Imputation and post-selection inference in models with missing data: An application to colorectal cancer surveillance guidelines

Introduction to papers on the modeling and analysis of network data—II

Stratonovich type integration with respect to fractional Brownian motion with Hurst parameter less than &#36;1/2&#36;

Local law and Tracy–Widom limit for sparse stochastic block models

Frequency domain theory for functional time series: Variance decomposition and an invariance principle

Bayesian linear regression for multivariate responses under group sparsity

A refined Cramér-type moderate deviation for sums of local statistics

Weighted Lépingle inequality

Convergence of persistence diagrams for topological crackle

Concentration of the spectral norm of Erdős–Rényi random graphs

On Sobolev tests of uniformity on the circle with an extension to the sphere

Exponential integrability and exit times of diffusions on sub-Riemannian and metric measure spaces

Scaling limits for super-replication with transient price impact

Directional differentiability for supremum-type functionals: Statistical applications

Noncommutative Lebesgue decomposition and contiguity with applications in quantum statistics

Perfect sampling for Gibbs point processes using partial rejection sampling

First-order covariance inequalities via Stein’s method

On estimation of nonsmooth functionals of sparse normal means

On sampling from a log-concave density using kinetic Langevin diffusions

Busemann functions and semi-infinite O’Connell–Yor polymers

On the best constant in the martingale version of Fefferman’s inequality

Functional weak limit theorem for a local empirical process of non-stationary time series and its application

Logarithmic Sobolev inequalities for finite spin systems and applications

Kernel and wavelet density estimators on manifolds and more general metric spaces

Optimal functional supervised classification with separation condition

A fast algorithm with minimax optimal guarantees for topic models with an unknown number of topics

Local differential privacy: Elbow effect in optimal density estimation and adaptation over Besov ellipsoids

On the eigenproblem for Gaussian bridges

Influence of the seed in affine preferential attachment trees

Estimating the number of connected components in a graph via subgraph sampling

Sojourn time dimensions of fractional Brownian motion

Efficient estimation in single index models through smoothing splines

Random orthogonal matrices and the Cayley transform

Reliable clustering of Bernoulli mixture models

On the probability distribution of the local times of diagonally operator-self-similar Gaussian fields with stationary increments

Limit theorems for long-memory flows on Wiener chaos

The finish line: Attachment of Signs

The Finish Line: Katrina One Year After

The Finish Line: Cast Stone and EIFS

The Finish Line: Changing Stucco to EIFS

The Finish Line: A Case Study: What is Causing This?

The Finish Line: All About Rust

The Finish Line: Backwrapping vs. Edgewrapping

The Finish Line: Cleaning EIFS

The Finish Line: Floor Line Joints

The Finish Line: FAQ's About EIFS Part 1

The Finish Line: Drainage Efficiency

The Finish Line: Earthquakes and EIFS

The Finish Line: Types of EIFS

The Finish Line: Eco-Friendliness of EIFS

The Finish Line: Foam Shapes Revisited

Subscribe To Our Newsletter

Stratonovich type integration with respect to fractional Brownian motion with Hurst parameter less than $1/2$