Latest m news

On the Complexity Analysis of the Primal Solutions for the Accelerated Randomized Dual Coordinate Ascent

By
Published On :: 2020

Dual first-order methods are essential techniques for large-scale constrained convex optimization. However, when recovering the primal solutions, we need $T(epsilon^{-2})$ iterations to achieve an $epsilon$-optimal primal solution when we apply an algorithm to the non-strongly convex dual problem with $T(epsilon^{-1})$ iterations to achieve an $epsilon$-optimal dual solution, where $T(x)$ can be $x$ or $sqrt{x}$. In this paper, we prove that the iteration complexity of the primal solutions and dual solutions have the same $Oleft(frac{1}{sqrt{epsilon}} ight)$ order of magnitude for the accelerated randomized dual coordinate ascent. When the dual function further satisfies the quadratic functional growth condition, by restarting the algorithm at any period, we establish the linear iteration complexity for both the primal solutions and dual solutions even if the condition number is unknown. When applied to the regularized empirical risk minimization problem, we prove the iteration complexity of $Oleft(nlog n+sqrt{frac{n}{epsilon}} ight)$ in both primal space and dual space, where $n$ is the number of samples. Our result takes out the $left(log frac{1}{epsilon} ight)$ factor compared with the methods based on smoothing/regularization or Catalyst reduction. As far as we know, this is the first time that the optimal $Oleft(sqrt{frac{n}{epsilon}} ight)$ iteration complexity in the primal space is established for the dual coordinate ascent based stochastic algorithms. We also establish the accelerated linear complexity for some problems with nonsmooth loss, e.g., the least absolute deviation and SVM.

On the Complexity Analysis of the Primal Solutions for the Accelerated Randomized Dual Coordinate Ascent

Graph-Dependent Implicit Regularisation for Distributed Stochastic Subgradient Descent

Noise Accumulation in High Dimensional Classification and Total Signal Index

Latent Simplex Position Model: High Dimensional Multi-view Clustering with Uncertainty Quantification

Learning Linear Non-Gaussian Causal Models in the Presence of Latent Variables

Optimal Bipartite Network Clustering

Switching Regression Models and Causal Inference in the Presence of Discrete Latent Variables

Greedy Attack and Gumbel Attack: Generating Adversarial Examples for Discrete Data

Dynamical Systems as Temporal Feature Spaces

A Convex Parametrization of a New Class of Universal Kernel Functions

pyts: A Python Package for Time Series Classification

Ancestral Gumbel-Top-k Sampling for Sampling Without Replacement

Skill Rating for Multiplayer Games. Introducing Hypernode Graphs and their Spectral Theory

Ensemble Learning for Relational Data

Sparse and low-rank multivariate Hawkes processes

Expected Policy Gradients for Reinforcement Learning

High-Dimensional Inference for Cluster-Based Graphical Models

Conjugate Gradients for Kernel Machines

Fast Rates for General Unbounded Loss Functions: From ERM to Generalized Bayes

Self-paced Multi-view Co-training

Robust Asynchronous Stochastic Gradient-Push: Asymptotically Optimal and Network-Independent Performance for Strongly Convex Functions

Exact Guarantees on the Absence of Spurious Local Minima for Non-negative Rank-1 Robust Principal Component Analysis

Kymatio: Scattering Transforms in Python

Multiparameter Persistence Landscapes

Generalized Optimal Matching Methods for Causal Inference

Unique Sharp Local Minimum in L1-minimization Complete Dictionary Learning

Community-Based Group Graphical Lasso

Smoothed Nonparametric Derivative Estimation using Weighted Difference Quotients

WONDER: Weighted One-shot Distributed Ridge Regression in High Dimensions

On Stationary-Point Hitting Time and Ergodicity of Stochastic Gradient Langevin Dynamics

Union of Low-Rank Tensor Spaces: Clustering and Completion

Representation Learning for Dynamic Graphs: A Survey

Estimation of a Low-rank Topic-Based Model for Information Cascades

(1 + epsilon)-class Classification: an Anomaly Detection Method for Highly Imbalanced or Incomplete Data Sets

Scalable Approximate MCMC Algorithms for the Horseshoe Prior

High-dimensional Gaussian graphical models on network-linked data

Identifiability of Additive Noise Models Using Conditional Variances

GADMM: Fast and Communication Efficient Framework for Distributed Machine Learning

Multi-Player Bandits: The Adversarial Case

Portraits of women in the collection

Top books to read at home

Q&A with Tara June Winch

Researching the Pacific: The Pacific Manuscripts Bureau

Cook commemoration sparks 1970 protest

Access thousands of newspapers and magazines with PressReader

Q&A with Adam Ferguson

Youth & Community Initiatives Funding available

Crime Prevention at Home

Health & Active Living Challenge

Mosquito Control Program

The finish line: Attachment of Signs

The Finish Line: Foam Shapes Revisited

The Finish Line: Adhesives vs. Mechanical Fasteners

The Finish Line: A (Faux) Monument for the Ages

The Finish Line: Right Solutions for the Right Problems

EPDs, HPDs and Red Lists (Oh My)!

Meeting Codes with Wall Assemblies

Green Advocacy vs. Informed Consent

Green Building Mistakes

Embodied Energy of Building Materials

Farming with Shipping Containers

Passive Houses Gain Momentum

Climate Change

How Much Rain does a Rainscreen Screen?

How Much Rain Does a Rainscreen Screen? (Part 2)

Subscribe To Our Newsletter