Latest y news

Modifying the Chi-square and the CMH test for population genetic inference: Adapting to overdispersion

By projecteuclid.org
Published On :: Wed, 15 Apr 2020 22:05 EDT

Kerstin Spitzer, Marta Pelizzola, Andreas Futschik.

Source: The Annals of Applied Statistics, Volume 14, Number 1, 202--220.

Abstract:
Evolve and resequence studies provide a popular approach to simulate evolution in the lab and explore its genetic basis. In this context, Pearson’s chi-square test, Fisher’s exact test as well as the Cochran–Mantel–Haenszel test are commonly used to infer genomic positions affected by selection from temporal changes in allele frequency. However, the null model associated with these tests does not match the null hypothesis of actual interest. Indeed, due to genetic drift and possibly other additional noise components such as pool sequencing, the null variance in the data can be substantially larger than accounted for by these common test statistics. This leads to $p$-values that are systematically too small and, therefore, a huge number of false positive results. Even, if the ranking rather than the actual $p$-values is of interest, a naive application of the mentioned tests will give misleading results, as the amount of overdispersion varies from locus to locus. We therefore propose adjusted statistics that take the overdispersion into account while keeping the formulas simple. This is particularly useful in genome-wide applications, where millions of SNPs can be handled with little computational effort. We then apply the adapted test statistics to real data from Drosophila and investigate how information from intermediate generations can be included when available. We also discuss further applications such as genome-wide association studies based on pool sequencing data and tests for local adaptation.

Modifying the Chi-square and the CMH test for population genetic inference: Adapting to overdispersion

Surface temperature monitoring in liver procurement via functional variance change-point analysis

A statistical analysis of noisy crowdsourced weather data

Modeling microbial abundances and dysbiosis with beta-binomial regression

Integrative survival analysis with uncertain event times in application to a suicide risk study

BART with targeted smoothing: An analysis of patient-specific stillbirth risk

A general theory for preferential sampling in environmental networks

Bayesian indicator variable selection to incorporate hierarchical overlapping group structure in multi-omics applications

On Bayesian new edge prediction and anomaly detection in computer networks

A hierarchical curve-based approach to the analysis of manifold data

A simple, consistent estimator of SNP heritability from genome-wide association studies

New formulation of the logistic-Gaussian process to analyze trajectory tracking data

Empirical Bayes analysis of RNA sequencing experiments with auxiliary information

Outline analyses of the called strike zone in Major League Baseball

Propensity score weighting for causal inference with multiple treatments

A nonparametric spatial test to identify factors that shape a microbiome

A latent discrete Markov random field approach to identifying and classifying historical forest communities based on spatial multivariate tree species counts

Objective Bayes model selection of Gaussian interventional essential graphs for the identification of signaling pathways

Fitting a deeply nested hierarchical model to a large book review dataset using a moment-based estimator

Principal nested shape space analysis of molecular dynamics data

Microsimulation model calibration using incremental mixture approximate Bayesian computation

Fire seasonality identification with multimodality tests

Statistical inference for partially observed branching processes with application to cell lineage tracking of in vivo hematopoiesis

Estimating the rate constant from biosensor data via an adaptive variational Bayesian approach

A semiparametric modeling approach using Bayesian Additive Regression Trees with an application to evaluate heterogeneous treatment effects

Radio-iBAG: Radiomics-based integrative Bayesian analysis of multiplatform genomic data

Bayesian methods for multiple mediators: Relating principal stratification and causal mediation in the analysis of power plant emission controls

Wavelet spectral testing: Application to nonstationary circadian rhythms

Bayesian modeling of the structural connectome for studying Alzheimer’s disease

A hierarchical Bayesian model for single-cell clustering using RNA-sequencing data

A Bayesian mark interaction model for analysis of tumor pathology images

Sequential decision model for inference and prediction on nonuniform hypergraphs with application to knot matching from computational forestry

RCRnorm: An integrated system of random-coefficient hierarchical regression models for normalizing NanoString nCounter data

Modeling seasonality and serial dependence of electricity price curves with warping functional autoregressive dynamics

Fast dynamic nonparametric distribution tracking in electron microscopic data

Identifying multiple changes for a functional data sequence with application to freeway traffic segmentation

Introduction to papers on the modeling and analysis of network data—II

Stratonovich type integration with respect to fractional Brownian motion with Hurst parameter less than &#36;1/2&#36;

Local law and Tracy–Widom limit for sparse stochastic block models

Frequency domain theory for functional time series: Variance decomposition and an invariance principle

Bayesian linear regression for multivariate responses under group sparsity

A refined Cramér-type moderate deviation for sums of local statistics

Weighted Lépingle inequality

Concentration of the spectral norm of Erdős–Rényi random graphs

On Sobolev tests of uniformity on the circle with an extension to the sphere

Exponential integrability and exit times of diffusions on sub-Riemannian and metric measure spaces

Directional differentiability for supremum-type functionals: Statistical applications

Noncommutative Lebesgue decomposition and contiguity with applications in quantum statistics

On sampling from a log-concave density using kinetic Langevin diffusions

Busemann functions and semi-infinite O’Connell–Yor polymers

The Finish Line: Katrina One Year After

The Finish Line: A Case Study: What is Causing This?

The Finish Line: Drainage Efficiency

The Finish Line: Types of EIFS

The Finish Line: EPS Vs. Polyisocyanurate Insulation

The Finish Line: Keep it Dry

The Finish Line: Keep it Dry Part 2

The Finish Line: Know Your EIFS

Will Synthetic Biology Save the World?

EPDs, HPDs and Red Lists (Oh My)!

Building Product Transparency— Be Careful What You Ask For

An Energy Label for Buildings

Hydronic Floor Heating

Green Advocacy vs. Informed Consent

New Gadget Analyzes Everything Including Building Industry

Subscribe To Our Newsletter

Stratonovich type integration with respect to fractional Brownian motion with Hurst parameter less than $1/2$