Escape saddle points by a simple gradient-descent based algorithm

06/12/2021

Escape saddle points by a simple gradient-descent based algorithm

Chenyi Zhang, Tongyang Li

Keywords: optimization

Abstract Paper Similar Papers

Abstract: Escaping saddle points is a central research topic in nonconvex optimization. In this paper, we propose a simple gradient-based algorithm such that for a smooth function $f\colon\mathbb{R}^n\to\mathbb{R}$, it outputs an $\epsilon$-approximate second-order stationary point in $\tilde{O}(\log n/\epsilon^{1.75})$ iterations. Compared to the previous state-of-the-art algorithms by Jin et al. with $\tilde{O}(\log^4 n/\epsilon^{2})$ or $\tilde{O}(\log^6 n/\epsilon^{1.75})$ iterations, our algorithm is polynomially better in terms of $\log n$ and matches their complexities in terms of $1/\epsilon$. For the stochastic setting, our algorithm outputs an $\epsilon$-approximate second-order stationary point in $\tilde{O}(\log^{2} n/\epsilon^{4})$ iterations. Technically, our main contribution is an idea of implementing a robust Hessian power method using only gradients, which can find negative curvature near saddle points and achieve the polynomial speedup in $\log n$ compared to the perturbed gradient descent methods. Finally, we also perform numerical experiments that support our results.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

One-sided Frank-Wolfe algorithms for saddle problems

Vladimir Kolmogorov, Thomas Pock

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:07

26/08/2020

One Sample Stochastic Frank-Wolfe

Mingrui Zhang, Zebang Shen, Aryan Mokhtari and
Hamed Hassani, Amin Karbasi

Keywords Paper

0

0

0

0

6:05

18/07/2021

PAGE: A Simple and Optimal Probabilistic Gradient Estimator for Nonconvex Optimization

Zhize Li, Hongyan Bao, Xiangliang Zhang, Peter Richtarik

Keywords Paper

Optimization

0

0

0

0

11:53

09/07/2020

How to trap a gradient flow

Dan Mikulincer, Sebastien Bubeck

Keywords Paper

Non-convex optimization,

0

0

0

0

15:01

06/12/2020

Escaping Saddle-Point Faster under Interpolation-like Conditions

Abhishek Roy, Krishnakumar Balasubramanian, Saeed Ghadimi, Prasant Mohapatra

Keywords Paper

0

0

0

0

3:19

18/07/2021

Leveraging Non-uniformity in First-order Non-convex Optimization

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

26/04/2020

Gradientless Descent: High-Dimensional Zeroth-Order Optimization

Daniel Golovin, John Karro, Greg Kochanski and
Chansoo Lee, Xingyou Song, Qiuyi Zhang

Keywords Paper

Zeroth Order Optimization

0

0

0

0

5:20

06/12/2020

Stochastic Recursive Gradient Descent Ascent for Stochastic Nonconvex-Strongly-Concave Minimax Problems

Luo Luo, Haishan Ye, Zhichao Huang, Tong Zhang

Keywords Paper

0

0

0

0

2:00

12/07/2020

Accelerated Stochastic Gradient-free and Projection-free Methods

Feihu Huang, Lue Tao, Songcan Chen

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:05

18/07/2021

Variance Reduction via Primal-Dual Accelerated Dual Averaging for Nonsmooth Convex Finite-Sums

Chaobing Song, Stephen Wright, Jelena Diakonikolas

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

18:04

26/08/2020

Stochastic Recursive Variance-Reduced Cubic Regularization Methods

Dongruo Zhou, Quanquan Gu

Keywords Paper

0

0

0

0

15:42

06/12/2020

Continuous Submodular Maximization: Beyond DR-Submodularity

Moran Feldman, Amin Karbasi

Keywords Paper

0

0

0

0

3:15

06/12/2021

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Paper

optimization, fairness

0

0

0

0

14:14

06/12/2021

A Near-Optimal Algorithm for Stochastic Bilevel Optimization via Double-Momentum

Prashant Khanduri, Siliang Zeng, Mingyi Hong and
Hoi-To Wai, Zhaoran Wang, Zhuoran Yang

Keywords Paper

optimization

0

0

0

0

9:47

26/04/2020

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets

Mingrui Liu, Youssef Mroueh, Jerret Ross and
Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang

Keywords Paper

Generative Adversarial Nets, Adaptive Gradient Algorithms

0

0

0

0

5:08

06/12/2021

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon Du

Keywords Paper

theory, optimization

0

0

0

0

14:51

09/07/2020

Learning Polynomials in Few Relevant Dimensions

Sitan Chen, Raghu Meka

Keywords Paper

Regression, Convex optimization, High-dimensional statistics, Non-convex optimization

0

0

0

0

15:03

06/12/2020

Robustness Analysis of Non-Convex Stochastic Gradient Descent using Biased Expectations

Kevin Scaman, Cedric Malherbe

Keywords Paper

0

0

0

0

3:09

06/12/2021

PLUGIn: A simple algorithm for inverting generative models with recovery guarantees

Babhru Joshi, Xiaowei Li, Yaniv Plan, Ozgur Yilmaz

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:58

06/12/2021

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

Jikai Jin, Bohang Zhang, Haiyang Wang, Liwei Wang

Keywords Paper

optimization

0

0

0

0

14:05

26/04/2020

Kernelized Wasserstein Natural Gradient

M Arbel, A Gretton, W Li, G Montufar

Keywords Paper

kernel methods, natural gradient, information geometry, Wasserstein metric

0

0

0

0

4:56

06/12/2021

A first-order primal-dual method with adaptivity to local smoothness

Maria-Luiza Vladarean, Yura Malitsky, Volkan Cevher

Keywords Paper

optimization

0

0

0

0

11:47

06/12/2021

High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails

Ashok Cutkosky, Harsh Mehta

Keywords Paper

deep learning, optimization

0

0

0

0

20:14

03/05/2021

Local Search Algorithms for Rank-Constrained Convex Optimization

Kyriakos Axiotis, Maxim Sviridenko

Keywords Paper

matrix completion, rank-constrained convex optimization, low rank

0

0

0

0

4:59

09/07/2020

Near-Optimal Methods for Minimizing Star-Convex Functions and Beyond

Oliver Hinder, Aaron Sidford, Nimit S Sohoni

Keywords Paper

Non-convex optimization,

0

0

0

0

12:57

12/07/2020

Closing the convergence gap of SGD without replacement

Shashank Rajput, Anant Gupta, Dimitris Papailiopoulos

Keywords Paper

Optimization - Convex

0

0

0

0

12:45

03/05/2021

Adaptive Extra-Gradient Methods for Min-Max Optimization and Games

Kimon ANTONAKOPOULOS, E. Belmega, Panayotis Mertikopoulos

Keywords Paper

games, min-max optimization, regime agnostic methods, adaptive methods, mirror-prox

0

0

0

0

5:33

13/04/2021

Efficient methods for structured nonconvex-nonconcave min-max optimization

Jelena Diakonikolas, Constantinos Daskalakis, Michael Jordan

Keywords Paper

0

0

0

0

3:33

06/12/2021

Stochastic Bias-Reduced Gradient Methods

Hilal Asi, Yair Carmon, Arun Jambulapati and
Yujia Jin, Aaron Sidford

Keywords Paper

theory, optimization, privacy

0

0

0

0

11:42

12/07/2020

On Gradient Descent Ascent for Nonconvex-Concave Minimax Problems

Darren Lin, Chi Jin, Michael Jordan

Keywords Paper

Optimization - Non-convex

0

0

0

0

15:14

26/04/2020

SNODE: Spectral Discretization of Neural ODEs for System Identification

Alessio Quaglino, Marco Gallieri, Jonathan Masci, Jan Koutník

Keywords Paper

Recurrent neural networks, system identification, neural ODEs

0

0

0

0

5:00

12/07/2020

Sparse Convex Optimization via Adaptively Regularized Hard Thresholding

Kyriakos Axiotis, Maxim Sviridenko

Keywords Paper

Optimization - General

0

0

0

0

13:44

06/12/2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Paper

0

0

0

0

3:22

06/12/2021

Submodular + Concave

Siddharth Mitra, Moran Feldman, Amin Karbasi

Keywords Paper

optimization

0

0

0

0

14:53

06/12/2021

The Benefits of Implicit Regularization from SGD in Least Squares Problems

Difan Zou, Jingfeng Wu, Vladimir Braverman and
Quanquan Gu, Dean Foster, Sham Kakade

Keywords Paper

optimization, machine learning

0

0

0

0

16:05

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2021

A Comprehensively Tight Analysis of Gradient Descent for PCA

Zhiqiang Xu, Ping Li

Keywords Paper

optimization

0

0

0

0

4:37

12/07/2020

Improving the Sample and Communication Complexity for Decentralized Non-Convex Optimization: Joint Gradient Estimation and Tracking

Haoran Sun, Songtao Lu, Mingyi Hong

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:56

06/12/2020

Deterministic Approximation for Submodular Maximization over a Matroid in Nearly Linear Time

Kai Han, zongmai Cao, Shuang Cui, Benwei Wu

Keywords Paper

0

0

0

0

3:16

18/07/2021

Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry

Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

Keywords Paper

Deep Learning, Algorithms, Multitask and Transfer Learning; Algorithms, Online Learning, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

17:27