Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

18/07/2021

Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

Chaobing Song, Stephen Wright, Jelena Diakonikolas

Keywords: Optimization, Convex Optimization

Abstract Paper Similar Papers

Abstract: Structured nonsmooth convex finite-sum optimization appears in many machine learning applications, including support vector machines and least absolute deviation. For the primal-dual formulation of this problem, we propose a novel algorithm called \emph{Variance Reduction via Primal-Dual Accelerated Dual Averaging (\vrpda)}. In the nonsmooth and general convex setting, \vrpda~has the overall complexity $O(nd\log\min \{1/\epsilon, n\} + d/\epsilon )$ in terms of the primal-dual gap, where $n$ denotes the number of samples, $d$ the dimension of the primal variables, and $\epsilon$ the desired accuracy. In the nonsmooth and strongly convex setting, the overall complexity of \vrpda~becomes $O(nd\log\min\{1/\epsilon, n\} + d/\sqrt{\epsilon})$ in terms of both the primal-dual gap and the distance between iterate and optimal solution. Both these results for \vrpda~improve significantly on state-of-the-art complexity estimates---which are $O(nd\log \min\{1/\epsilon, n\} + \sqrt{n}d/\epsilon)$ for the nonsmooth and general convex setting and $O(nd\log \min\{1/\epsilon, n\} + \sqrt{n}d/\sqrt{\epsilon})$ for the nonsmooth and strongly convex setting---with a simpler and more straightforward algorithm and analysis. Moreover, both complexities are better than \emph{lower} bounds for general convex finite-sum optimization, because our approach makes use of additional, commonly occurring structure. Numerical experiments reveal competitive performance of \vrpda~compared to state-of-the-art approaches.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking

Haoran Sun, Songtao Lu, Mingyi Hong

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:56

06/12/2020

Variance Reduction via Accelerated Dual Averaging for Finite-Sum Optimization

Chaobing Song, Yong Jiang, Yi Ma

Keywords Paper

Algorithms -> Boosting and Ensemble Methods; Applications -> Hardware and Systems; Applications -> Natural Language Processing;, Applications -> Privacy, Anonymity, and Security

0

0

0

0

3:23

12/07/2020

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

Darren Lin, Chi Jin, Michael Jordan

Keywords Paper

Optimization - Non-convex

0

0

0

0

15:14

02/02/2021

Enhancing Parameter-Free Frank Wolfe with an Extra Subproblem

Bingcong Li, Lingda Wang, Georgios B. Giannakis, Zhizhen Zhao

Keywords Paper

0

0

0

0

13:45

06/12/2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems

Luo Luo, Haishan Ye, Zhichao Huang, Tong Zhang

Keywords Paper

0

0

0

0

2:00

18/07/2021

Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry

Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

Keywords Paper

Deep Learning, Algorithms, Multitask and Transfer Learning; Algorithms, Online Learning, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

17:27

18/07/2021

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov, Thomas Pock

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:07

06/12/2021

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon Du

Keywords Paper

theory, optimization

0

0

0

0

14:51

13/04/2021

Adaptive sampling for fast constrained maximization of submodular functions

Francesco Quinzan, Vanja Doskoc, Andreas Göbel, Tobias Friedrich

Keywords Paper

0

0

0

0

2:54

18/07/2021

Leveraging Non-uniformity in First-order Non-convex Optimization

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

18/07/2021

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

Zhize Li, Hongyan Bao, Xiangliang Zhang, Peter Richtarik

Keywords Paper

Optimization

0

0

0

0

11:53

06/12/2020

Escaping Saddle-Point Faster under Interpolation-like Conditions

Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

Keywords Paper

0

0

0

0

3:19

06/12/2021

Unifying Width-Reduced Methods for Quasi-Self-Concordant Optimization

Deeksha Adil, Brian Bullins, Sushant Sachdeva

Keywords Paper

optimization

0

0

0

0

12:14

03/05/2021

Local Search Algorithms for Rank-Constrained Convex Optimization

Kyriakos Axiotis, Maxim Sviridenko

Keywords Paper

matrix completion, rank-constrained convex optimization, low rank

0

0

0

0

4:59

06/12/2020

Hybrid Variance-Reduced SGD Algorithms For Minimax Problems with Nonconvex-Linear Function

Quoc Tran Dinh, Deyi Liu, Lam Nguyen

Keywords Paper

0

0

0

0

3:07

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2020

Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations

Zhuoran Yang, Chi Jin, Zhaoran Wang and
Mengdi Wang, Michael Jordan

Keywords Paper

0

0

0

0

3:42

06/12/2020

Large-Scale Methods for Distributionally Robust Optimization

Daniel Levy, Yair Carmon, John Duchi, Aaron Sidford

Keywords Paper

0

0

0

0

3:11

12/07/2020

Sparse Convex Optimization via Adaptively Regularized Hard Thresholding

Kyriakos Axiotis, Maxim Sviridenko

Keywords Paper

Optimization - General

0

0

0

0

13:44

06/12/2021

Calibration and Consistency of Adversarial Surrogate Losses

Pranjal Awasthi, Natalie Frank, Anqi Mao and
Mehryar Mohri, Yutao Zhong

Keywords Paper

theory, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

13:30

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

09/07/2020

How to trap a gradient flow

Dan Mikulincer, Sebastien Bubeck

Keywords Paper

Non-convex optimization,

0

0

0

0

15:01

09/07/2020

Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

Yossi Arjevani, Yair Carmon, John Duchi and
Dylan Foster, Ayush Sekhari, Karthik Sridharan

Keywords Paper

Non-convex optimization, Stochastic optimization

0

0

0

0

11:57

13/04/2021

Efficient methods for structured nonconvex-nonconcave min-max optimization

Jelena Diakonikolas, Constantinos Daskalakis, Michael Jordan

Keywords Paper

0

0

0

0

3:33

06/12/2021

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Paper

optimization, fairness

0

0

0

0

14:14

26/04/2020

Gradientless Descent: High-Dimensional Zeroth-Order Optimization

Daniel Golovin, John Karro, Greg Kochanski and
Chansoo Lee, Xingyou Song, Qiuyi Zhang

Keywords Paper

Zeroth Order Optimization

0

0

0

0

5:20

18/07/2021

Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity

Yorgos Amanatidis, Federico Fusco, Philip Lazos and
Stefano Leonardi, Alberto Marchetti-Spaccamela, Rebecca Reiffenhäuser

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:15

18/07/2021

Randomized Algorithms for Submodular Function Maximization with a $k$-System Constraint

Shuang Cui, Kai Han, Tianshuai Zhu and
Jing Tang, Benwei Wu, He Huang

Keywords Paper

Optimization

0

0

0

0

4:48

06/12/2020

Faster Randomized Infeasible Interior Point Methods for Tall/Wide Linear Programs

Agniva Chowdhury, Palma London, Haim Avron, Petros Drineas

Keywords Paper

0

0

0

0

3:21

06/12/2020

A Catalyst Framework for Minimax Optimization

Junchi Yang, Siqi Zhang, Negar Kiyavash, Niao He

Keywords Paper

0

0

0

0

3:01

13/04/2021

Rate-improved inexact augmented lagrangian method for constrained nonconvex optimization

Zichong Li, Pin-Yu Chen, Sijia Liu and
Songtao Lu, Yangyang Xu

Keywords Paper

0

0

0

0

3:04

26/08/2020

On the Convergence of SARAH and Beyond

Bingcong Li, Meng Ma, Georgios B. Giannakis

Keywords Paper

0

0

0

0

9:30

26/08/2020

Stochastic Recursive Variance-Reduced Cubic Regularization Methods

Dongruo Zhou, Quanquan Gu

Keywords Paper

0

0

0

0

15:42

18/07/2021

Regret and Cumulative Constraint Violation Analysis for Online Convex Optimization with Long Term Constraints

Xinlei Yi, Xiuxian Li, Tao Yang and
Lihua Xie, Tianyou Chai, Karl Johansson

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

17:31

06/12/2021

Escape saddle points by a simple gradient-descent based algorithm

Chenyi Zhang, Tongyang Li

Keywords Paper

optimization

0

0

0

0

14:49

06/12/2020

Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Keywords Paper

0

0

0

0

3:23

06/12/2020

Projection Robust Wasserstein Distance and Riemannian Optimization

Darren Lin, Chenyou Fan, Nhat Ho and
Marco Cuturi, Michael Jordan

Keywords Paper

Optimization -> Non-Convex Optimization; Optimization -> Stochastic Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

1

3:01

06/12/2020

Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization

Yan Yan, Yi Xu, Qihang Lin and
Wei Liu, Tianbao Yang

Keywords Paper

0

0

0

0

3:02

04/08/2021

Frank-Wolfe with Nearest Extreme Point Oracle

Dan Garber, Noam Wolf

Keywords Paper

0

0

0

0

12:52

06/12/2021

Stochastic $L^\natural$-convex Function Minimization

Haixiang Zhang, Zeyu Zheng, Javad Lavaei

Keywords Paper

optimization

0

0

0

0

14:34