Latest co news

Communications and networking : 14th EAI International Conference, ChinaCom 2019, Shanghai, China, November 29 - December 1, 2019, proceedings.

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: ChinaCom (Conference) (14th : 2019 : Shanghai, China)

Callnumber: Online

ISBN: 9783030411176

Full Article

Common problems in the newborn nursery : an evidence and case-based guide

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783319956725 (electronic bk.)

Full Article

Commercial status of plant breeding in India

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Tiwari, Aparna, author.

Callnumber: Online

ISBN: 9789811519062

Full Article

Combustion emissions

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Schofield, Keith.

Callnumber: Online

ISBN: 9780128191279 (electronic bk.)

Full Article

Biscuit, cookie and cracker process and recipes

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Sykes, Glyn, author

Callnumber: Online

ISBN: 9780128206133 (electronic bk.)

Full Article

Biology and ecology of venomous marine cnidarians

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Santhanam, Ramasamy, 1946- author

Callnumber: Online

ISBN: 9789811516030 (electronic bk.)

Full Article

Bioeconomy for beginners

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Bioökonomie für Einsteiger. English

Callnumber: Online

ISBN: 9783662603901 (electronic bk.)

Full Article

Binary code fingerprinting for cybersecurity : application to malicious code fingerprinting

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Alrabaee, Saed, authior

Callnumber: Online

ISBN: 9783030342388 (electronic bk.)

Full Article

Berquist's musculoskeletal imaging companion

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Peterson, Jeffrey J., author.

Callnumber: Online

ISBN: 9781496314994

Full Article

Anxiety disorders : rethinking and understanding recent discoveries

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9789813297050 (electronic bk.)

Full Article

Anatomical chart company atlas of pathophysiology

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Atlas of pathophysiology.

Callnumber: Online

ISBN: 9781496370921

Full Article

African edible insects as alternative source of food, oil, protein and bioactive components

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783030329525 (electronic bk.)

Full Article

Advanced age geriatric care : a comprehensive guide

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9783319969985 (electronic bk.)

Full Article

A treatise on topical corticosteroids in dermatology : use, misuse and abuse

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Callnumber: Online

ISBN: 9789811046094

Full Article

100 cases in clinical pharmacology, therapeutics and prescribing

By dal.novanet.ca
Published On :: Fri, 1 May 2020 19:44:43 -0300

Author: Layne, Kerry, author.

Callnumber: Online

ISBN: 9780429624537 electronic book

Full Article

Notice of Construction - Kennedy Rd. and Ravenshoe Rd.

By www.eastgwillimbury.ca
Published On :: Sun, 03 May 2020 16:28:03 GMT

Full Article

Notice of Construction - Woodbine Ave.

By www.eastgwillimbury.ca
Published On :: Fri, 24 Apr 2020 18:41:27 GMT

Full Article

COVID-19 Update

By www.eastgwillimbury.ca
Published On :: Wed, 06 May 2020 18:48:44 GMT

Full Article

Hays County Joins the Texas Purchasing Group by BidNet Direct

By www.prweb.com
Published On ::

Hays County announced it has joined the Texas Purchasing Group and will be publishing and distributing upcoming bid opportunities on the system along with their current platform in these unprecedented...

(PRWeb April 09, 2020)

Read the full story at https://www.prweb.com/releases/hays_county_joins_the_texas_purchasing_group_by_bidnet_direct/prweb17021429.htm

Full Article

Domestic Gag Rule Reduces Contraceptive Access For Nearly 370,000...

By www.prweb.com
Published On ::

According to data released by Power to Decide, an estimated 369,960 New Jersey women of reproductive age (13-44) in need of publicly funded contraception live in counties impacted by the...

(PRWeb April 09, 2020)

Read the full story at https://www.prweb.com/releases/domestic_gag_rule_reduces_contraceptive_access_for_nearly_370_000_women_living_in_new_jersey/prweb17040987.htm

Full Article

Wine Retailers Seek Alcohol Shipping Compromise with 18 States

By www.prweb.com
Published On ::

National Association of Wine Retailers Release Letter Delivered to Attorneys General and Alcohol Regulatory Chiefs Concerning Unconstitutional and Unenforceable Wine Shipping Bans

(PRWeb April 15, 2020)

Read the full story at https://www.prweb.com/releases/wine_retailers_seek_alcohol_shipping_compromise_with_18_states/prweb17050617.htm

Full Article

In Battle to Fight Coronavirus Pandemic, LeadingAge Nursing Home...

By www.prweb.com
Published On ::

Aging Services Providers Dedicated to Fulfilling Their Critical Role in Public Health System

(PRWeb April 18, 2020)

Read the full story at https://www.prweb.com/releases/in_battle_to_fight_coronavirus_pandemic_leadingage_nursing_home_members_support_texas_action_to_gather_and_leverage_data/prweb17055806.htm

Full Article

New Partnerships Emerge for COVID-19 Relief: Dade County Farm Bureau...

By www.prweb.com
Published On ::

Harvested produce crops feed Florida Department of Corrections’ (FDC) more than 87,000 inmates; action saves food costs while reducing COVID-19 related supply chain impacts.

(PRWeb April 20, 2020)

Read the full story at https://www.prweb.com/releases/new_partnerships_emerge_for_covid_19_relief_dade_county_farm_bureau_teams_with_state_leaders_to_launch_farm_to_inmate_program/prweb17052045.htm

Full Article

STRmix Now Being Used by Suffolk County Crime Lab, Contra Costa...

By www.prweb.com
Published On ::

New organizations bring total number of U.S. forensic labs using STRmix to 55.

(PRWeb April 23, 2020)

Read the full story at https://www.prweb.com/releases/strmix_now_being_used_by_suffolk_county_crime_lab_contra_costa_sheriffs_office/prweb17057336.htm

Full Article

Jamboree Begins Construction on Capstone Development to Change...

By www.prweb.com
Published On ::

In a public-private partnership to develop housing, resident services and hope for 102 working families in Haster Orangewood community, Jamboree Housing Corporation and the City of Anaheim announce...

(PRWeb April 27, 2020)

Read the full story at https://www.prweb.com/releases/jamboree_begins_construction_on_capstone_development_to_change_trajectory_of_neighborhood_in_anaheim_ca/prweb17073166.htm

Full Article

Colorado Court Rules STRmix Is “Relevant and Reliable” Practice for...

By www.prweb.com
Published On ::

Defendant’s Motion to Exclude Expert Testimony regarding evidence generated by STRmix denied.

(PRWeb May 08, 2020)

Read the full story at https://www.prweb.com/releases/colorado_court_rules_strmix_is_relevant_and_reliable_practice_for_interpreting_likelihood_ratios/prweb17101548.htm

Full Article

Almost sure uniqueness of a global minimum without convexity

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Gregory Cox.

Source: The Annals of Statistics, Volume 48, Number 1, 584--606.

Abstract:
This paper establishes the argmin of a random objective function to be unique almost surely. This paper first formulates a general result that proves almost sure uniqueness without convexity of the objective function. The general result is then applied to a variety of applications in statistics. Four applications are discussed, including uniqueness of M-estimators, both classical likelihood and penalized likelihood estimators, and two applications of the argmin theorem, threshold regression and weak identification.

Full Article

Efficient estimation of linear functionals of principal components

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Vladimir Koltchinskii, Matthias Löffler, Richard Nickl.

Source: The Annals of Statistics, Volume 48, Number 1, 464--490.

Abstract:
We study principal component analysis (PCA) for mean zero i.i.d. Gaussian observations $X_{1},dots,X_{n}$ in a separable Hilbert space $mathbb{H}$ with unknown covariance operator $Sigma $. The complexity of the problem is characterized by its effective rank $mathbf{r}(Sigma):=frac{operatorname{tr}(Sigma)}{|Sigma |}$, where $mathrm{tr}(Sigma)$ denotes the trace of $Sigma $ and $|Sigma|$ denotes its operator norm. We develop a method of bias reduction in the problem of estimation of linear functionals of eigenvectors of $Sigma $. Under the assumption that $mathbf{r}(Sigma)=o(n)$, we establish the asymptotic normality and asymptotic properties of the risk of the resulting estimators and prove matching minimax lower bounds, showing their semiparametric optimality.

Full Article

Uniformly valid confidence intervals post-model-selection

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

François Bachoc, David Preinerstorfer, Lukas Steinberger.

Source: The Annals of Statistics, Volume 48, Number 1, 440--463.

Abstract:
We suggest general methods to construct asymptotically uniformly valid confidence intervals post-model-selection. The constructions are based on principles recently proposed by Berk et al. ( Ann. Statist. 41 (2013) 802–837). In particular, the candidate models used can be misspecified, the target of inference is model-specific, and coverage is guaranteed for any data-driven model selection procedure. After developing a general theory, we apply our methods to practically important situations where the candidate set of models, from which a working model is selected, consists of fixed design homoskedastic or heteroskedastic linear models, or of binary regression models with general link functions. In an extensive simulation study, we find that the proposed confidence intervals perform remarkably well, even when compared to existing methods that are tailored only for specific model selection procedures.

Full Article

Consistent selection of the number of change-points via sample-splitting

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Changliang Zou, Guanghui Wang, Runze Li.

Source: The Annals of Statistics, Volume 48, Number 1, 413--439.

Abstract:
In multiple change-point analysis, one of the major challenges is to estimate the number of change-points. Most existing approaches attempt to minimize a Schwarz information criterion which balances a term quantifying model fit with a penalization term accounting for model complexity that increases with the number of change-points and limits overfitting. However, different penalization terms are required to adapt to different contexts of multiple change-point problems and the optimal penalization magnitude usually varies from the model and error distribution. We propose a data-driven selection criterion that is applicable to most kinds of popular change-point detection methods, including binary segmentation and optimal partitioning algorithms. The key idea is to select the number of change-points that minimizes the squared prediction error, which measures the fit of a specified model for a new sample. We develop a cross-validation estimation scheme based on an order-preserved sample-splitting strategy, and establish its asymptotic selection consistency under some mild conditions. Effectiveness of the proposed selection criterion is demonstrated on a variety of numerical experiments and real-data examples.

Full Article

Concentration and consistency results for canonical and curved exponential-family models of random graphs

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Michael Schweinberger, Jonathan Stewart.

Source: The Annals of Statistics, Volume 48, Number 1, 374--396.

Abstract:
Statistical inference for exponential-family models of random graphs with dependent edges is challenging. We stress the importance of additional structure and show that additional structure facilitates statistical inference. A simple example of a random graph with additional structure is a random graph with neighborhoods and local dependence within neighborhoods. We develop the first concentration and consistency results for maximum likelihood and $M$-estimators of a wide range of canonical and curved exponential-family models of random graphs with local dependence. All results are nonasymptotic and applicable to random graphs with finite populations of nodes, although asymptotic consistency results can be obtained as well. In addition, we show that additional structure can facilitate subgraph-to-graph estimation, and present concentration results for subgraph-to-graph estimators. As an application, we consider popular curved exponential-family models of random graphs, with local dependence induced by transitivity and parameter vectors whose dimensions depend on the number of nodes.

Full Article

Testing for principal component directions under weak identifiability

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Davy Paindaveine, Julien Remy, Thomas Verdebout.

Source: The Annals of Statistics, Volume 48, Number 1, 324--345.

Abstract:
We consider the problem of testing, on the basis of a $p$-variate Gaussian random sample, the null hypothesis $mathcal{H}_{0}:oldsymbol{ heta}_{1}=oldsymbol{ heta}_{1}^{0}$ against the alternative $mathcal{H}_{1}:oldsymbol{ heta}_{1} eq oldsymbol{ heta}_{1}^{0}$, where $oldsymbol{ heta}_{1}$ is the “first” eigenvector of the underlying covariance matrix and $oldsymbol{ heta}_{1}^{0}$ is a fixed unit $p$-vector. In the classical setup where eigenvalues $lambda_{1}>lambda_{2}geq cdots geq lambda_{p}$ are fixed, the Anderson ( Ann. Math. Stat. 34 (1963) 122–148) likelihood ratio test (LRT) and the Hallin, Paindaveine and Verdebout ( Ann. Statist. 38 (2010) 3245–3299) Le Cam optimal test for this problem are asymptotically equivalent under the null hypothesis, hence also under sequences of contiguous alternatives. We show that this equivalence does not survive asymptotic scenarios where $lambda_{n1}/lambda_{n2}=1+O(r_{n})$ with $r_{n}=O(1/sqrt{n})$. For such scenarios, the Le Cam optimal test still asymptotically meets the nominal level constraint, whereas the LRT severely overrejects the null hypothesis. Consequently, the former test should be favored over the latter one whenever the two largest sample eigenvalues are close to each other. By relying on the Le Cam’s asymptotic theory of statistical experiments, we study the non-null and optimality properties of the Le Cam optimal test in the aforementioned asymptotic scenarios and show that the null robustness of this test is not obtained at the expense of power. Our asymptotic investigation is extensive in the sense that it allows $r_{n}$ to converge to zero at an arbitrary rate. While we restrict to single-spiked spectra of the form $lambda_{n1}>lambda_{n2}=cdots =lambda_{np}$ to make our results as striking as possible, we extend our results to the more general elliptical case. Finally, we present an illustrative real data example.

Full Article

Bootstrap confidence regions based on M-estimators under nonstandard conditions

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Stephen M. S. Lee, Puyudi Yang.

Source: The Annals of Statistics, Volume 48, Number 1, 274--299.

Abstract:
Suppose that a confidence region is desired for a subvector $ heta $ of a multidimensional parameter $xi =( heta ,psi )$, based on an M-estimator $hat{xi }_{n}=(hat{ heta }_{n},hat{psi }_{n})$ calculated from a random sample of size $n$. Under nonstandard conditions $hat{xi }_{n}$ often converges at a nonregular rate $r_{n}$, in which case consistent estimation of the distribution of $r_{n}(hat{ heta }_{n}- heta )$, a pivot commonly chosen for confidence region construction, is most conveniently effected by the $m$ out of $n$ bootstrap. The above choice of pivot has three drawbacks: (i) the shape of the region is either subjectively prescribed or controlled by a computationally intensive depth function; (ii) the region is not transformation equivariant; (iii) $hat{xi }_{n}$ may not be uniquely defined. To resolve the above difficulties, we propose a one-dimensional pivot derived from the criterion function, and prove that its distribution can be consistently estimated by the $m$ out of $n$ bootstrap, or by a modified version of the perturbation bootstrap. This leads to a new method for constructing confidence regions which are transformation equivariant and have shapes driven solely by the criterion function. A subsampling procedure is proposed for selecting $m$ in practice. Empirical performance of the new method is illustrated with examples drawn from different nonstandard M-estimation settings. Extension of our theory to row-wise independent triangular arrays is also explored.

Full Article

Spectral and matrix factorization methods for consistent community detection in multi-layer networks

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Subhadeep Paul, Yuguo Chen.

Source: The Annals of Statistics, Volume 48, Number 1, 230--250.

Abstract:
We consider the problem of estimating a consensus community structure by combining information from multiple layers of a multi-layer network using methods based on the spectral clustering or a low-rank matrix factorization. As a general theme, these “intermediate fusion” methods involve obtaining a low column rank matrix by optimizing an objective function and then using the columns of the matrix for clustering. However, the theoretical properties of these methods remain largely unexplored. In the absence of statistical guarantees on the objective functions, it is difficult to determine if the algorithms optimizing the objectives will return good community structures. We investigate the consistency properties of the global optimizer of some of these objective functions under the multi-layer stochastic blockmodel. For this purpose, we derive several new asymptotic results showing consistency of the intermediate fusion techniques along with the spectral clustering of mean adjacency matrix under a high dimensional setup, where the number of nodes, the number of layers and the number of communities of the multi-layer graph grow. Our numerical study shows that the intermediate fusion techniques outperform late fusion methods, namely spectral clustering on aggregate spectral kernel and module allegiance matrix in sparse networks, while they outperform the spectral clustering of mean adjacency matrix in multi-layer networks that contain layers with both homophilic and heterophilic communities.

Full Article

Optimal rates for community estimation in the weighted stochastic block model

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Min Xu, Varun Jog, Po-Ling Loh.

Source: The Annals of Statistics, Volume 48, Number 1, 183--204.

Abstract:
Community identification in a network is an important problem in fields such as social science, neuroscience and genetics. Over the past decade, stochastic block models (SBMs) have emerged as a popular statistical framework for this problem. However, SBMs have an important limitation in that they are suited only for networks with unweighted edges; in various scientific applications, disregarding the edge weights may result in a loss of valuable information. We study a weighted generalization of the SBM, in which observations are collected in the form of a weighted adjacency matrix and the weight of each edge is generated independently from an unknown probability density determined by the community membership of its endpoints. We characterize the optimal rate of misclustering error of the weighted SBM in terms of the Renyi divergence of order 1/2 between the weight distributions of within-community and between-community edges, substantially generalizing existing results for unweighted SBMs. Furthermore, we present a computationally tractable algorithm based on discretization that achieves the optimal error rate. Our method is adaptive in the sense that the algorithm, without assuming knowledge of the weight densities, performs as well as the best algorithm that knows the weight densities.

Full Article

Model assisted variable clustering: Minimax-optimal recovery and algorithms

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

Florentina Bunea, Christophe Giraud, Xi Luo, Martin Royer, Nicolas Verzelen.

Source: The Annals of Statistics, Volume 48, Number 1, 111--137.

Abstract:
The problem of variable clustering is that of estimating groups of similar components of a $p$-dimensional vector $X=(X_{1},ldots ,X_{p})$ from $n$ independent copies of $X$. There exists a large number of algorithms that return data-dependent groups of variables, but their interpretation is limited to the algorithm that produced them. An alternative is model-based clustering, in which one begins by defining population level clusters relative to a model that embeds notions of similarity. Algorithms tailored to such models yield estimated clusters with a clear statistical interpretation. We take this view here and introduce the class of $G$-block covariance models as a background model for variable clustering. In such models, two variables in a cluster are deemed similar if they have similar associations will all other variables. This can arise, for instance, when groups of variables are noise corrupted versions of the same latent factor. We quantify the difficulty of clustering data generated from a $G$-block covariance model in terms of cluster proximity, measured with respect to two related, but different, cluster separation metrics. We derive minimax cluster separation thresholds, which are the metric values below which no algorithm can recover the model-defined clusters exactly, and show that they are different for the two metrics. We therefore develop two algorithms, COD and PECOK, tailored to $G$-block covariance models, and study their minimax-optimality with respect to each metric. Of independent interest is the fact that the analysis of the PECOK algorithm, which is based on a corrected convex relaxation of the popular $K$-means algorithm, provides the first statistical analysis of such algorithms for variable clustering. Additionally, we compare our methods with another popular clustering method, spectral clustering. Extensive simulation studies, as well as our data analyses, confirm the applicability of our approach.

Full Article

Robust sparse covariance estimation by thresholding Tyler’s M-estimator

By projecteuclid.org
Published On :: Mon, 17 Feb 2020 04:02 EST

John Goes, Gilad Lerman, Boaz Nadler.

Source: The Annals of Statistics, Volume 48, Number 1, 86--110.

Abstract:
Estimating a high-dimensional sparse covariance matrix from a limited number of samples is a fundamental task in contemporary data analysis. Most proposals to date, however, are not robust to outliers or heavy tails. Toward bridging this gap, in this work we consider estimating a sparse shape matrix from $n$ samples following a possibly heavy-tailed elliptical distribution. We propose estimators based on thresholding either Tyler’s M-estimator or its regularized variant. We prove that in the joint limit as the dimension $p$ and the sample size $n$ tend to infinity with $p/n ogamma>0$, our estimators are minimax rate optimal. Results on simulated data support our theoretical analysis.

Full Article

Joint convergence of sample autocovariance matrices when $p/n o 0$ with application

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Monika Bhattacharjee, Arup Bose.

Source: The Annals of Statistics, Volume 47, Number 6, 3470--3503.

Abstract:
Consider a high-dimensional linear time series model where the dimension $p$ and the sample size $n$ grow in such a way that $p/n o 0$. Let $hat{Gamma }_{u}$ be the $u$th order sample autocovariance matrix. We first show that the LSD of any symmetric polynomial in ${hat{Gamma }_{u},hat{Gamma }_{u}^{*},ugeq 0}$ exists under independence and moment assumptions on the driving sequence together with weak assumptions on the coefficient matrices. This LSD result, with some additional effort, implies the asymptotic normality of the trace of any polynomial in ${hat{Gamma }_{u},hat{Gamma }_{u}^{*},ugeq 0}$. We also study similar results for several independent MA processes. We show applications of the above results to statistical inference problems such as in estimation of the unknown order of a high-dimensional MA process and in graphical and significance tests for hypotheses on coefficient matrices of one or several such independent processes.

Full Article

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Kyoungjae Lee, Jaeyong Lee, Lizhen Lin.

Source: The Annals of Statistics, Volume 47, Number 6, 3413--3437.

Abstract:
In this paper we study the high-dimensional sparse directed acyclic graph (DAG) models under the empirical sparse Cholesky prior. Among our results, strong model selection consistency or graph selection consistency is obtained under more general conditions than those in the existing literature. Compared to Cao, Khare and Ghosh [ Ann. Statist. (2019) 47 319–348], the required conditions are weakened in terms of the dimensionality, sparsity and lower bound of the nonzero elements in the Cholesky factor. Furthermore, our result does not require the irrepresentable condition, which is necessary for Lasso-type methods. We also derive the posterior convergence rates for precision matrices and Cholesky factors with respect to various matrix norms. The obtained posterior convergence rates are the fastest among those of the existing Bayesian approaches. In particular, we prove that our posterior convergence rates for Cholesky factors are the minimax or at least nearly minimax depending on the relative size of true sparseness for the entire dimension. The simulation study confirms that the proposed method outperforms the competing methods.

Full Article

Hypothesis testing on linear structures of high-dimensional covariance matrix

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Shurong Zheng, Zhao Chen, Hengjian Cui, Runze Li.

Source: The Annals of Statistics, Volume 47, Number 6, 3300--3334.

Abstract:
This paper is concerned with test of significance on high-dimensional covariance structures, and aims to develop a unified framework for testing commonly used linear covariance structures. We first construct a consistent estimator for parameters involved in the linear covariance structure, and then develop two tests for the linear covariance structures based on entropy loss and quadratic loss used for covariance matrix estimation. To study the asymptotic properties of the proposed tests, we study related high-dimensional random matrix theory, and establish several highly useful asymptotic results. With the aid of these asymptotic results, we derive the limiting distributions of these two tests under the null and alternative hypotheses. We further show that the quadratic loss based test is asymptotically unbiased. We conduct Monte Carlo simulation study to examine the finite sample performance of the two tests. Our simulation results show that the limiting null distributions approximate their null distributions quite well, and the corresponding asymptotic critical values keep Type I error rate very well. Our numerical comparison implies that the proposed tests outperform existing ones in terms of controlling Type I error rate and power. Our simulation indicates that the test based on quadratic loss seems to have better power than the test based on entropy loss.

Full Article

Quantile regression under memory constraint

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Xi Chen, Weidong Liu, Yichen Zhang.

Source: The Annals of Statistics, Volume 47, Number 6, 3244--3273.

Abstract:
This paper studies the inference problem in quantile regression (QR) for a large sample size $n$ but under a limited memory constraint, where the memory can only store a small batch of data of size $m$. A natural method is the naive divide-and-conquer approach, which splits data into batches of size $m$, computes the local QR estimator for each batch and then aggregates the estimators via averaging. However, this method only works when $n=o(m^{2})$ and is computationally expensive. This paper proposes a computationally efficient method, which only requires an initial QR estimator on a small batch of data and then successively refines the estimator via multiple rounds of aggregations. Theoretically, as long as $n$ grows polynomially in $m$, we establish the asymptotic normality for the obtained estimator and show that our estimator with only a few rounds of aggregations achieves the same efficiency as the QR estimator computed on all the data. Moreover, our result allows the case that the dimensionality $p$ goes to infinity. The proposed method can also be applied to address the QR problem under distributed computing environment (e.g., in a large-scale sensor network) or for real-time streaming data.

Full Article

Adaptive estimation of the rank of the coefficient matrix in high-dimensional multivariate response regression models

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Xin Bing, Marten H. Wegkamp.

Source: The Annals of Statistics, Volume 47, Number 6, 3157--3184.

Abstract:
We consider the multivariate response regression problem with a regression coefficient matrix of low, unknown rank. In this setting, we analyze a new criterion for selecting the optimal reduced rank. This criterion differs notably from the one proposed in Bunea, She and Wegkamp ( Ann. Statist. 39 (2011) 1282–1309) in that it does not require estimation of the unknown variance of the noise, nor does it depend on a delicate choice of a tuning parameter. We develop an iterative, fully data-driven procedure, that adapts to the optimal signal-to-noise ratio. This procedure finds the true rank in a few steps with overwhelming probability. At each step, our estimate increases, while at the same time it does not exceed the true rank. Our finite sample results hold for any sample size and any dimension, even when the number of responses and of covariates grow much faster than the number of observations. We perform an extensive simulation study that confirms our theoretical findings. The new method performs better and is more stable than the procedure of Bunea, She and Wegkamp ( Ann. Statist. 39 (2011) 1282–1309) in both low- and high-dimensional settings.

Full Article

Randomized incomplete $U$-statistics in high dimensions

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Xiaohui Chen, Kengo Kato.

Source: The Annals of Statistics, Volume 47, Number 6, 3127--3156.

Abstract:
This paper studies inference for the mean vector of a high-dimensional $U$-statistic. In the era of big data, the dimension $d$ of the $U$-statistic and the sample size $n$ of the observations tend to be both large, and the computation of the $U$-statistic is prohibitively demanding. Data-dependent inferential procedures such as the empirical bootstrap for $U$-statistics is even more computationally expensive. To overcome such a computational bottleneck, incomplete $U$-statistics obtained by sampling fewer terms of the $U$-statistic are attractive alternatives. In this paper, we introduce randomized incomplete $U$-statistics with sparse weights whose computational cost can be made independent of the order of the $U$-statistic. We derive nonasymptotic Gaussian approximation error bounds for the randomized incomplete $U$-statistics in high dimensions, namely in cases where the dimension $d$ is possibly much larger than the sample size $n$, for both nondegenerate and degenerate kernels. In addition, we propose generic bootstrap methods for the incomplete $U$-statistics that are computationally much less demanding than existing bootstrap methods, and establish finite sample validity of the proposed bootstrap methods. Our methods are illustrated on the application to nonparametric testing for the pairwise independence of a high-dimensional random vector under weaker assumptions than those appearing in the literature.

Full Article

Active ranking from pairwise comparisons and when parametric assumptions do not help

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Reinhard Heckel, Nihar B. Shah, Kannan Ramchandran, Martin J. Wainwright.

Source: The Annals of Statistics, Volume 47, Number 6, 3099--3126.

Abstract:
We consider sequential or active ranking of a set of $n$ items based on noisy pairwise comparisons. Items are ranked according to the probability that a given item beats a randomly chosen item, and ranking refers to partitioning the items into sets of prespecified sizes according to their scores. This notion of ranking includes as special cases the identification of the top-$k$ items and the total ordering of the items. We first analyze a sequential ranking algorithm that counts the number of comparisons won, and uses these counts to decide whether to stop, or to compare another pair of items, chosen based on confidence intervals specified by the data collected up to that point. We prove that this algorithm succeeds in recovering the ranking using a number of comparisons that is optimal up to logarithmic factors. This guarantee does depend on whether or not the underlying pairwise probability matrix, satisfies a particular structural property, unlike a significant body of past work on pairwise ranking based on parametric models such as the Thurstone or Bradley–Terry–Luce models. It has been a long-standing open question as to whether or not imposing these parametric assumptions allows for improved ranking algorithms. For stochastic comparison models, in which the pairwise probabilities are bounded away from zero, our second contribution is to resolve this issue by proving a lower bound for parametric models. This shows, perhaps surprisingly, that these popular parametric modeling choices offer at most logarithmic gains for stochastic comparisons.

Full Article

Sorted concave penalized regression

By projecteuclid.org
Published On :: Wed, 30 Oct 2019 22:03 EDT

Long Feng, Cun-Hui Zhang.

Source: The Annals of Statistics, Volume 47, Number 6, 3069--3098.

Abstract:
The Lasso is biased. Concave penalized least squares estimation (PLSE) takes advantage of signal strength to reduce this bias, leading to sharper error bounds in prediction, coefficient estimation and variable selection. For prediction and estimation, the bias of the Lasso can be also reduced by taking a smaller penalty level than what selection consistency requires, but such smaller penalty level depends on the sparsity of the true coefficient vector. The sorted $ell_{1}$ penalized estimation (Slope) was proposed for adaptation to such smaller penalty levels. However, the advantages of concave PLSE and Slope do not subsume each other. We propose sorted concave penalized estimation to combine the advantages of concave and sorted penalizations. We prove that sorted concave penalties adaptively choose the smaller penalty level and at the same time benefits from signal strength, especially when a significant proportion of signals are stronger than the corresponding adaptively selected penalty levels. A local convex approximation for sorted concave penalties, which extends the local linear and quadratic approximations for separable concave penalties, is developed to facilitate the computation of sorted concave PLSE and proven to possess desired prediction and estimation error bounds. Our analysis of prediction and estimation errors requires the restricted eigenvalue condition on the design, not beyond, and provides selection consistency under a required minimum signal strength condition in addition. Thus, our results also sharpens existing results on concave PLSE by removing the upper sparse eigenvalue component of the sparse Riesz condition.

Full Article

Inference for the mode of a log-concave density

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Charles R. Doss, Jon A. Wellner.

Source: The Annals of Statistics, Volume 47, Number 5, 2950--2976.

Abstract:
We study a likelihood ratio test for the location of the mode of a log-concave density. Our test is based on comparison of the log-likelihoods corresponding to the unconstrained maximum likelihood estimator of a log-concave density and the constrained maximum likelihood estimator where the constraint is that the mode of the density is fixed, say at $m$. The constrained estimation problem is studied in detail in Doss and Wellner (2018). Here, the results of that paper are used to show that, under the null hypothesis (and strict curvature of $-log f$ at the mode), the likelihood ratio statistic is asymptotically pivotal: that is, it converges in distribution to a limiting distribution which is free of nuisance parameters, thus playing the role of the $chi_{1}^{2}$ distribution in classical parametric statistical problems. By inverting this family of tests, we obtain new (likelihood ratio based) confidence intervals for the mode of a log-concave density $f$. These new intervals do not depend on any smoothing parameters. We study the new confidence intervals via Monte Carlo methods and illustrate them with two real data sets. The new intervals seem to have several advantages over existing procedures. Software implementing the test and confidence intervals is available in the R package verb+logcondens.mode+.

Full Article

Test for high-dimensional correlation matrices

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Shurong Zheng, Guanghui Cheng, Jianhua Guo, Hongtu Zhu.

Source: The Annals of Statistics, Volume 47, Number 5, 2887--2921.

Abstract:
Testing correlation structures has attracted extensive attention in the literature due to both its importance in real applications and several major theoretical challenges. The aim of this paper is to develop a general framework of testing correlation structures for the one , two and multiple sample testing problems under a high-dimensional setting when both the sample size and data dimension go to infinity. Our test statistics are designed to deal with both the dense and sparse alternatives. We systematically investigate the asymptotic null distribution, power function and unbiasedness of each test statistic. Theoretically, we make great efforts to deal with the nonindependency of all random matrices of the sample correlation matrices. We use simulation studies and real data analysis to illustrate the versatility and practicability of our test statistics.

Full Article

Eigenvalue distributions of variance components estimators in high-dimensional random effects models

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Zhou Fan, Iain M. Johnstone.

Source: The Annals of Statistics, Volume 47, Number 5, 2855--2886.

Abstract:
We study the spectra of MANOVA estimators for variance component covariance matrices in multivariate random effects models. When the dimensionality of the observations is large and comparable to the number of realizations of each random effect, we show that the empirical spectra of such estimators are well approximated by deterministic laws. The Stieltjes transforms of these laws are characterized by systems of fixed-point equations, which are numerically solvable by a simple iterative procedure. Our proof uses operator-valued free probability theory, and we establish a general asymptotic freeness result for families of rectangular orthogonally invariant random matrices, which is of independent interest. Our work is motivated in part by the estimation of components of covariance between multiple phenotypic traits in quantitative genetics, and we specialize our results to common experimental designs that arise in this application.

Full Article

Exact lower bounds for the agnostic probably-approximately-correct (PAC) machine learning model

By projecteuclid.org
Published On :: Fri, 02 Aug 2019 22:04 EDT

Aryeh Kontorovich, Iosif Pinelis.

Source: The Annals of Statistics, Volume 47, Number 5, 2822--2854.

Abstract:
We provide an exact nonasymptotic lower bound on the minimax expected excess risk (EER) in the agnostic probably-approximately-correct (PAC) machine learning classification model and identify minimax learning algorithms as certain maximally symmetric and minimally randomized “voting” procedures. Based on this result, an exact asymptotic lower bound on the minimax EER is provided. This bound is of the simple form $c_{infty}/sqrt{ u}$ as $ u oinfty$, where $c_{infty}=0.16997dots$ is a universal constant, $ u=m/d$, $m$ is the size of the training sample and $d$ is the Vapnik–Chervonenkis dimension of the hypothesis class. It is shown that the differences between these asymptotic and nonasymptotic bounds, as well as the differences between these two bounds and the maximum EER of any learning algorithms that minimize the empirical risk, are asymptotically negligible, and all these differences are due to ties in the mentioned “voting” procedures. A few easy to compute nonasymptotic lower bounds on the minimax EER are also obtained, which are shown to be close to the exact asymptotic lower bound $c_{infty}/sqrt{ u}$ even for rather small values of the ratio $ u=m/d$. As an application of these results, we substantially improve existing lower bounds on the tail probability of the excess risk. Among the tools used are Bayes estimation and apparently new identities and inequalities for binomial distributions.

Full Article

On testing conditional qualitative treatment effects

By projecteuclid.org
Published On :: Tue, 21 May 2019 04:00 EDT

Chengchun Shi, Rui Song, Wenbin Lu.

Source: The Annals of Statistics, Volume 47, Number 4, 2348--2377.

Abstract:
Precision medicine is an emerging medical paradigm that focuses on finding the most effective treatment strategy tailored for individual patients. In the literature, most of the existing works focused on estimating the optimal treatment regime. However, there has been less attention devoted to hypothesis testing regarding the optimal treatment regime. In this paper, we first introduce the notion of conditional qualitative treatment effects (CQTE) of a set of variables given another set of variables and provide a class of equivalent representations for the null hypothesis of no CQTE. The proposed definition of CQTE does not assume any parametric form for the optimal treatment rule and plays an important role for assessing the incremental value of a set of new variables in optimal treatment decision making conditional on an existing set of prescriptive variables. We then propose novel testing procedures for no CQTE based on kernel estimation of the conditional contrast functions. We show that our test statistics have asymptotically correct size and nonnegligible power against some nonstandard local alternatives. The empirical performance of the proposed tests are evaluated by simulations and an application to an AIDS data set.

Full Article

Communications and networking : 14th EAI International Conference, ChinaCom 2019, Shanghai, China, November 29 - December 1, 2019, proceedings.

Common problems in the newborn nursery : an evidence and case-based guide

Commercial status of plant breeding in India

Combustion emissions

Biscuit, cookie and cracker process and recipes

Biology and ecology of venomous marine cnidarians

Bioeconomy for beginners

Binary code fingerprinting for cybersecurity : application to malicious code fingerprinting

Berquist's musculoskeletal imaging companion

Anxiety disorders : rethinking and understanding recent discoveries

Anatomical chart company atlas of pathophysiology

African edible insects as alternative source of food, oil, protein and bioactive components

Advanced age geriatric care : a comprehensive guide

A treatise on topical corticosteroids in dermatology : use, misuse and abuse

100 cases in clinical pharmacology, therapeutics and prescribing

Notice of Construction - Kennedy Rd. and Ravenshoe Rd.

Notice of Construction - Woodbine Ave.

COVID-19 Update

Hays County Joins the Texas Purchasing Group by BidNet Direct

Domestic Gag Rule Reduces Contraceptive Access For Nearly 370,000...

Wine Retailers Seek Alcohol Shipping Compromise with 18 States

In Battle to Fight Coronavirus Pandemic, LeadingAge Nursing Home...

New Partnerships Emerge for COVID-19 Relief: Dade County Farm Bureau...

STRmix Now Being Used by Suffolk County Crime Lab, Contra Costa...

Jamboree Begins Construction on Capstone Development to Change...

Colorado Court Rules STRmix Is “Relevant and Reliable” Practice for...

Almost sure uniqueness of a global minimum without convexity

Efficient estimation of linear functionals of principal components

Uniformly valid confidence intervals post-model-selection

Consistent selection of the number of change-points via sample-splitting

Concentration and consistency results for canonical and curved exponential-family models of random graphs

Testing for principal component directions under weak identifiability

Bootstrap confidence regions based on M-estimators under nonstandard conditions

Spectral and matrix factorization methods for consistent community detection in multi-layer networks

Optimal rates for community estimation in the weighted stochastic block model

Model assisted variable clustering: Minimax-optimal recovery and algorithms

Robust sparse covariance estimation by thresholding Tyler’s M-estimator

Joint convergence of sample autocovariance matrices when &#36;p/n o 0&#36; with application

Minimax posterior convergence rates and model selection consistency in high-dimensional DAG models based on sparse Cholesky factors

Hypothesis testing on linear structures of high-dimensional covariance matrix

Quantile regression under memory constraint

Adaptive estimation of the rank of the coefficient matrix in high-dimensional multivariate response regression models

Randomized incomplete &#36;U&#36;-statistics in high dimensions

Active ranking from pairwise comparisons and when parametric assumptions do not help

Sorted concave penalized regression

Inference for the mode of a log-concave density

Test for high-dimensional correlation matrices

Eigenvalue distributions of variance components estimators in high-dimensional random effects models

Exact lower bounds for the agnostic probably-approximately-correct (PAC) machine learning model

On testing conditional qualitative treatment effects

The Finish Line: Changing Stucco to EIFS

The Finish Line: Eco-Friendliness of EIFS

Meeting Codes with Wall Assemblies

Green Advocacy vs. Informed Consent

Farming with Shipping Containers

Coal: Not the New Black

Cost-Effective, Energy Efficient Concrete Sandwich Panels

American Industrial Partners to Acquire PPG’s Architectural Coatings Business

Fundraising Regulator appoints four new committee members

Veterans’ care charity to merge into larger counterpart

Only 12 per cent of leading charities publicly recognise a trade union, analysis suggests

Next chair of the National Lottery Community Fund revealed

Companies' 'Green' Efforts Include Products’ Material Content

“Commitment to the Environment”

Incomplete information can fuel misjudgment: study

Subscribe To Our Newsletter

Joint convergence of sample autocovariance matrices when $p/n o 0$ with application

Randomized incomplete $U$-statistics in high dimensions