Random Shuffling Beats SGD Only After Many Epochs on Ill-Conditioned Problems

06/12/2021

Random Shuffling Beats SGD Only After Many Epochs on Ill-Conditioned Problems

Itay Safran, Ohad Shamir

Keywords: optimization, machine learning

Abstract Paper Similar Papers

Abstract: Recently, there has been much interest in studying the convergence rates of without-replacement SGD, and proving that it is faster than with-replacement SGD in the worst case. However, these works ignore or do not provide tight bounds in terms of the problem's geometry, including its condition number. Perhaps surprisingly, we prove that when the condition number is taken into account, without-replacement SGD \emph{does not} significantly improve on with-replacement SGD in terms of worst-case bounds, unless the number of epochs (passes over the data) is larger than the condition number. Since many problems in machine learning and other areas are both ill-conditioned and involve large datasets, this indicates that without-replacement does not necessarily improve over with-replacement sampling for realistic iteration budgets. We show this by providing new lower and upper bounds which are tight (up to log factors), for quadratic problems with commuting quadratic terms, precisely quantifying the dependence on the problem parameters.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

A Riemannian Block Coordinate Descent Method for Computing the Projection Robust Wasserstein Distance

Minhui Huang, Shiqian Ma, Lifeng Lai

Keywords Paper

Algorithms, Optimal Transport

0

0

0

1

5:14

26/08/2020

Approximate Cross-Validation in High Dimensions with Guarantees

William Stephenson, Tamara Broderick

Keywords Paper

0

0

1

1

14:35

12/07/2020

Black-Box Methods for Restoring Monotonicity

Evangelia Gergatsouli, Brendan Lucier, Christos Tzamos

Keywords Paper

Learning Theory

0

0

0

0

15:40

18/07/2021

A Novel Sequential Coreset Method for Gradient Descent Algorithms

Jiawei Huang, Ruomin Huang, wenjie liu and
Nikolaos Freris, Hu Ding

Keywords Paper

Optimization

0

0

0

0

5:15

06/12/2020

Approximate Cross-Validation with Low-Rank Data in High Dimensions

Will Stephenson, Madeleine Udell, Tamara Broderick

Keywords Paper

0

0

0

0

3:02

12/07/2020

Implicit differentiation of Lasso-type models for hyperparameter optimization

Quentin Bertrand, Quentin Klopfenstein, Mathieu Blondel and
Samuel Vaiter, Alexandre Gramfort, Joseph Salmon

Keywords Paper

Optimization - General

0

0

0

0

16:18

19/04/2021

Reanalyzing the most probable sentence problem: A case study in explicating the role of entropy in algorithmic complexity

Eric Corlett, Gerald Penn

Keywords Paper

0

0

0

0

11:08

06/12/2021

Calibration and Consistency of Adversarial Surrogate Losses

Pranjal Awasthi, Natalie Frank, Anqi Mao and
Mehryar Mohri, Yutao Zhong

Keywords Paper

theory, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

13:30

18/07/2021

The Limits of Min-Max Optimization Algorithms: Convergence to Spurious Non-Critical Sets

Ya-Ping Hsieh, Panayotis Mertikopoulos, Volkan Cevher

Keywords Paper

Theory

0

0

0

0

16:38

18/07/2021

Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity

Yorgos Amanatidis, Federico Fusco, Philip Lazos and
Stefano Leonardi, Alberto Marchetti-Spaccamela, Rebecca Reiffenhäuser

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:15

23/08/2020

A block decomposition algorithm for sparse optimization

Ganzhao Yuan, Li Shen, Wei-Shi Zheng

Keywords Paper

NP-hard, nonconvex optimization, block coordinate descent, sparse optimization, convex optimization

0

0

0

0

18:12

12/07/2020

The FAST Algorithm for Submodular Maximization

Adam Breuer, Eric Balkanski, Yaron Singer

Keywords Paper

Optimization - General

0

0

0

0

14:16

06/12/2020

Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization

Yan Yan, Yi Xu, Qihang Lin and
Wei Liu, Tianbao Yang

Keywords Paper

0

0

0

0

3:02

12/07/2020

Rate-distortion optimization guided autoencoder for isometric embedding in Euclidean latent space

Keizo Kato, Jing Zhou, Tomotake Sasaki, Akira Nakagawa

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:48

02/02/2021

Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization

Wanli Shi, Bin Gu

Keywords Paper

0

0

0

0

14:47

06/12/2020

Robust Meta-learning for Mixed Linear Regression with Small Batches

Weihao Kong, Raghav Somani, Sham Kakade, Sewoong Oh

Keywords Paper

0

0

0

0

3:20

06/12/2020

GPU-Accelerated Primal Learning for Extremely Fast Large-Scale Classification

John Halloran, David M Rocke

Keywords Paper

0

0

0

0

3:33

06/12/2021

Iteratively Reweighted Least Squares for Basis Pursuit with Global Linear Convergence Rate

Christian Kümmerle, Claudio Mayrink Verdun, Dominik Stöger

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:17

12/07/2020

Stronger and Faster Wasserstein Adversarial Attacks

Kaiwen Wu, Allen Wang, Yaoliang Yu

Keywords Paper

Adversarial Examples

0

0

0

0

14:56

09/07/2020

A Greedy Anytime Algorithm for Sparse PCA

Dan Vilenchik, Adam Soffer, Guy Holtzman

Keywords Paper

Non-convex optimization, Combinatorial optimization, Computational complexity, High-dimensional statistics, Unsupervised and semi-supervised learning

0

0

0

0

15:31

06/12/2020

The Adaptive Complexity of Maximizing a Gross Substitutes Valuation

Ron Kupfer, Sharon Qian, Eric Balkanski, Yaron Singer

Keywords Paper

0

0

0

0

3:03

13/04/2021

Revisiting projection-free online learning: The strongly convex case

Ben Kretzu, Dan Garber

Keywords Paper

0

0

0

0

2:56

22/06/2020

Efficiently learning structured distributions from untrusted batches

Sitan Chen, Jerry Li, Ankur Moitra

Keywords Paper

sum-of-squares, federated learning, VC complexity, Robust statistics

0

0

0

0

24:38

22/06/2020

Algorithms for heavy-tailed statistics: Regression, covariance estimation, and beyond

Yeshwanth Cherapanamjeri, Samuel B. Hopkins, Tarun Kathuria and
Prasad Raghavendra, Nilesh Tripuraneni

Keywords Paper

Sum-of-squares, Algorithms, Heavy-Tailed Estimation

0

0

0

0

20:29

12/07/2020

Layered Sampling for Robust Optimization Problems

Hu Ding, Zixiu Wang

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

13:00

06/12/2020

Projection Robust Wasserstein Distance and Riemannian Optimization

Darren Lin, Chenyou Fan, Nhat Ho and
Marco Cuturi, Michael Jordan

Keywords Paper

Optimization -> Non-Convex Optimization; Optimization -> Stochastic Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

1

3:01

12/07/2020

Can Stochastic Zeroth-Order Frank-Wolfe Method Converge Faster for Non-Convex Problems?

Hongchang Gao, Heng Huang

Keywords Paper

Optimization - General

0

0

0

0

13:19

02/02/2021

Gradient Descent Averaging and Primal-dual Averaging for Strongly Convex Optimization

Wei Tao, Wei Li, Zhisong Pan, Qing Tao

Keywords Paper

0

0

0

0

15:10

12/07/2020

Estimating the Error of Randomized Newton Methods: A Bootstrap Approach

Miles Lopes, Jessie X.T. Chen

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

13:22

06/12/2020

Escaping Saddle-Point Faster under Interpolation-like Conditions

Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

Keywords Paper

0

0

0

0

3:19

13/04/2021

Self-concordant analysis of generalized linear bandits with forgetting

Yoan Russac, Louis Faury, Olivier Cappé, Aurélien Garivier

Keywords Paper

0

0

0

0

3:06

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

18/07/2021

Unbalanced minibatch Optimal Transport; applications to Domain Adaptation

Kilian Fatras, Thibault Séjourné, Rémi Flamary, Nicolas Courty

Keywords Paper

Algorithms, Optimal Transport

0

0

0

2

4:57

17/08/2020

NASOQ: Numerically accurate sparsity-oriented QP solver

Kazem Cheshmi, Danny M. Kaufman, Shoaib Kamil, Maryam Mehri Dehnavi

Keywords Paper

indefinite factorization, numerical optimization, contact simulation, sparse row modification, mesh deformation, quadratic programming, sparse linear algebra

0

0

0

0

15:27

06/12/2020

Fast Epigraphical Projection-based Incremental Algorithms for Wasserstein Distributionally Robust Support Vector Machine

Jiajin Li, Caihua Chen, Anthony Man-Cho So

Keywords Paper

Algorithms -> Meta-Learning; Applications -> Object Recognition; Data, Challenges, Implementations, and Software -> Benchmarks;, Algorithms -> Multitask and Transfer Learning

0

0

0

0

3:02

06/12/2020

Stochastic Gradient Descent in Correlated Settings: A Study on Gaussian Processes

Hao Chen, Lili Zheng, Raed AL Kontar, Garvesh Raskutti

Keywords Paper

0

0

0

0

3:12

02/02/2021

Improving Causal Discovery By Optimal Bayesian Network Learning

Ni Y Lu, Kun Zhang, Changhe Yuan

Keywords Paper

0

0

0

0

15:12

12/07/2020

Train Big, Then Compress: Rethinking Model Size for Efficient Training and Inference of Transformers

Zhuohan Li, Eric Wallace, Sheng Shen and
Kevin Lin, Kurt Keutzer, Dan Klein, Joseph Gonzalez

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

15:21

09/07/2020

Approximation Schemes for ReLU Regression

Ilias Diakonikolas, Surbhi Goel, Sushrut Karmalkar and
Adam Klivans, Mahdi Soltanolkotabi

Keywords Paper

PAC learning, Approximation algorithms, Convex optimization, Neural networks/deep learning

0

0

0

0

15:20

12/07/2020

Error Estimation for Sketched SVD

Miles Lopes, N. Benjamin Erichson, Michael Mahoney

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

15:29