SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients

06/12/2021

SUPER-ADAM: Faster and Universal Framework of Adaptive Gradients

Feihu Huang, Junyi Li, Heng Huang

Keywords: deep learning, optimization, machine learning

Abstract Paper Similar Papers

Abstract: Adaptive gradient methods have shown excellent performances for solving many machine learning problems. Although multiple adaptive methods were recently studied, they mainly focus on either empirical or theoretical aspects and also only work for specific problems by using some specific adaptive learning rates. It is desired to design a universal framework for practical algorithms of adaptive gradients with theoretical guarantee to solve general problems. To fill this gap, we propose a faster and universal framework of adaptive gradients(\emph{i.e.}, SUPER-ADAM) by introducing a universal adaptive matrix that includes most existing adaptive gradient forms. Moreover, our framework can flexibly integrate the momentum and variance reduced techniques. In particular, our novel framework provides the convergence analysis support for adaptive gradient methods under the nonconvex setting. In theoretical analysis, we prove that our SUPER-ADAM algorithm can achieve the best known complexity of $\tilde{O}(\epsilon^{-3})$ for finding an $\epsilon$-stationary point of nonconvex optimization, which matches the lower bound for stochastic smooth nonconvex optimization. In numerical experiments, we employ various deep learning tasks to validate that our algorithm consistently outperforms the existing adaptive algorithms. Code is available at https://github.com/LIJUNYI95/SuperAdam

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Submodular Maximization subject to a Knapsack Constraint: Combinatorial Algorithms with Near-optimal Adaptive Complexity

Yorgos Amanatidis, Federico Fusco, Philip Lazos and
Stefano Leonardi, Alberto Marchetti-Spaccamela, Rebecca Reiffenhäuser

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:15

06/12/2021

Accelerating Quadratic Optimization with Reinforcement Learning

Jeffrey Ichnowski, Paras Jain, Bartolomeo Stellato and
Goran Banjac, Michael Luo, Francesco Borrelli, Joseph Gonzalez, Ion Stoica, Ken Goldberg

Keywords Paper

optimization, reinforcement learning and planning, machine learning

0

0

0

0

12:36

26/08/2020

On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms

Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

Keywords Paper

0

0

0

0

15:02

06/12/2020

A Catalyst Framework for Minimax Optimization

Junchi Yang, Siqi Zhang, Negar Kiyavash, Niao He

Keywords Paper

0

0

0

0

3:01

06/12/2020

Learning Feature Sparse Principal Subspace

Lai Tian, Feiping Nie, Rong Wang, Xuelong Li

Keywords Paper

0

0

0

0

3:13

06/12/2020

Differentiable Expected Hypervolume Improvement for Parallel Multi-Objective Bayesian Optimization

Sam Daulton, Max Balandat, Eytan Bakshy

Keywords Paper

0

0

0

0

3:20

18/07/2021

Bilevel Optimization: Convergence Analysis and Enhanced Design

Kaiyi Ji, Junjie Yang, Yingbin LIANG

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:02

12/07/2020

Multi-Task Learning with User Preferences: Gradient Descent with Controlled Ascent in Pareto Optimization

Debabrata Mahapatra, Vaibhav Rajan

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:35

12/07/2020

Efficient Continuous Pareto Exploration in Multi-Task Learning

Pingchuan Ma, Tao Du, Wojciech Matusik

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

14:56

12/07/2020

A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

Risheng Liu, Pan Mu, Xiaoming Yuan and
Shangzhi Zeng, Jin Zhang

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:51

12/07/2020

A simpler approach to accelerated optimization: iterative averaging meets optimism

Pooria Joulani, Anant Raj, András György, Csaba Szepesvari

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

1

1

16:17

12/07/2020

Conditional gradient methods for stochastically constrained convex minimization

Maria-Luiza Vladarean, Ahmet Alacaoglu, Ya-Ping Hsieh, Volkan Cevher

Keywords Paper

Optimization - Convex

0

0

0

0

14:50

19/08/2021

On the Convergence of Stochastic Compositional Gradient Descent Ascent Method

Hongchang Gao, Xiaoqian Wang, Lei Luo, Xinghua Shi

Keywords Paper

Machine Learning, Adversarial Machine Learning, Cost-Sensitive Learning

0

0

0

0

13:42

06/12/2021

On the Convergence of Prior-Guided Zeroth-Order Optimization Algorithms

Shuyu Cheng, Guoqiang Wu, Jun Zhu

Keywords Paper

optimization, reinforcement learning and planning, adversarial robustness and security

0

0

0

0

13:49

06/12/2021

Fast Training Method for Stochastic Compositional Optimization Problems

Hongchang Gao, Heng Huang

Keywords Paper

optimization, machine learning, meta learning

0

0

0

0

14:00

22/06/2020

Top-𝑘-convolution and the quest for near-linear output-sensitive subset sum

Karl Bringmann, Vasileios Nakos

Keywords Paper

Subset Sum, pseudopolynomial, output-sensitive, convolution, restricted sumset

0

0

0

0

25:48

06/12/2020

Hybrid Variance-Reduced SGD Algorithms For Minimax Problems with Nonconvex-Linear Function

Quoc Tran Dinh, Deyi Liu, Lam Nguyen

Keywords Paper

0

0

0

0

3:07

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

22/06/2020

Positive semidefinite programming: Mixed, parallel, and width-independent

Arun Jambulapati, Yin Tat Lee, Jerry Li and
Swati Padmanabhan, Kevin Tian

Keywords Paper

semidefinite programming, approximation algorithm, mixed packing and covering, width-independent algorithm, parallel algorithm

0

0

0

0

18:12

30/11/2020

Large-Scale Cross-Domain Few-Shot Learning

Jiechao Guan, Manli Zhang, Zhiwu Lu

Keywords Paper

0

0

0

0

7:26

06/12/2020

Effective Dimension Adaptive Sketching Methods for Faster Regularized Least-Squares Optimization

Jonathan Lacotte, Mert Pilanci

Keywords Paper

0

0

0

0

3:17

12/07/2020

Learning What to Defer for Maximum Independent Sets

Sungsoo Ahn, Younggyo Seo, Jinwoo Shin

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:47

09/07/2020

Provably Efficient Reinforcement Learning with Linear Function Approximation

Chi Jin, Zhuoran Yang, Zhaoran Wang, Michael Jordan

Keywords Paper

Reinforcement learning,

0

0

0

0

13:04

06/12/2021

Faster Matchings via Learned Duals

Michael Dinitz, Sungjin Im, Thomas Lavastida and
Benjamin Moseley, Sergei Vassilvitskii

Keywords Paper

theory, optimization

0

0

0

0

20:11

26/08/2020

'Bring Your Own Greedy'+Max: Near-Optimal 1/2-Approximations for Submodular Knapsack

Grigory Yaroslavtsev, Samson Zhou, Dmitrii Avdiukhin

Keywords Paper

0

0

0

0

13:14

26/04/2020

Sign-OPT: A Query-Efficient Hard-label Adversarial Attack

Minhao Cheng, Simranjit Singh, Patrick H. Chen and
Pin-Yu Chen, Sijia Liu, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

4:56

06/12/2020

Fast Epigraphical Projection-based Incremental Algorithms for Wasserstein Distributionally Robust Support Vector Machine

Jiajin Li, Caihua Chen, Anthony Man-Cho So

Keywords Paper

Algorithms -> Meta-Learning; Applications -> Object Recognition; Data, Challenges, Implementations, and Software -> Benchmarks;, Algorithms -> Multitask and Transfer Learning

0

0

0

0

3:02

18/07/2021

Implicit rate-constrained optimization of non-decomposable objectives

Abhishek Kumar, Harikrishna Narasimhan, Andrew Cotter

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

3:48

06/12/2021

Taming Communication and Sample Complexities in Decentralized Policy Evaluation for Cooperative Multi-Agent Reinforcement Learning

Xin Zhang, Zhuqing Liu, Jia Liu and
Zhengyuan Zhu, Songtao Lu

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

14:54

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

30/11/2020

Progressive Batching for Efficient Non-linear Least Squares

Huu Le, Christopher Zach, Edward Rosten, Oliver J. Woodford

Keywords Paper

0

0

0

0

8:23

02/02/2021

Theoretical Analyses of Multi-Objective Evolutionary Algorithms on Multi-Modal Objectives

Benjamin Doerr, Weijie Zheng

Keywords Paper

0

0

0

0

17:30

06/12/2020

Optimal Epoch Stochastic Gradient Descent Ascent Methods for Min-Max Optimization

Yan Yan, Yi Xu, Qihang Lin and
Wei Liu, Tianbao Yang

Keywords Paper

0

0

0

0

3:02

06/12/2020

Boosting First-Order Methods by Shifting Objective: New Schemes with Faster Worst-Case Rates

Kaiwen Zhou, Anthony Man-Cho So, James Cheng

Keywords Paper

0

0

0

0

3:16

18/07/2021

Global Optimality Beyond Two Layers: Training Deep ReLU Networks via Convex Programs

Tolga Ergen, Mert Pilanci

Keywords Paper

Optimization, Convex Optimization

0

0

0

0

5:40

06/12/2020

Global Convergence and Variance Reduction for a Class of Nonconvex-Nonconcave Minimax Problems

Junchi Yang, Negar Kiyavash, Niao He

Keywords Paper

0

0

0

0

3:07

06/12/2021

Nonsmooth Implicit Differentiation for Machine-Learning and Optimization

Jérôme Bolte, Tam Le, Edouard Pauwels, Tony Silveti-Falls

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

12:32

03/05/2021

Few-Shot Bayesian Optimization with Deep Kernel Surrogates

Martin Wistuba, Josif Grabocka

Keywords Paper

automl, bayesian optimization, metalearning, few-shot learning

0

0

0

0

5:18

18/07/2021

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Simon Du, Sham Kakade, Jason Lee and
Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:40

12/07/2020

Why Are Learned Indexes So Effective?

Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra

Keywords Paper

Applications - Other

0

0

0

0

13:22