Acceleration through spectral density estimation

Abstract: We develop a framework for designing optimal optimization methods in terms of their average-case runtime. This yields a new class of methods that achieve acceleration through a model of the Hessian's expected spectral density. We develop explicit algorithms for the uniform, Marchenko-Pastur and exponential distribution. These methods are momentum-based gradient algorithms whose hyper-parameters can be estimated cheaply using only the norm and the trace of the Hessian, in stark contrast with classical accelerated methods like Nesterov acceleration and Polyak momentum that require knowledge of the Hessian's largest and smallest singular value. Empirical results on quadratic, logistic regression and neural network show the proposed methods always match and in many cases significantly improve upon classical accelerated methods.

06/12/2020

Algorithms, Uncertainty Estimation, Algorithms, Classification; Deep Learning; Deep Learning, Predictive Models; Deep Learning, Supervised Deep Networks, Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

5:02

18/07/2021

Acceleration through spectral density estimation

Fabian Pedregosa, Damien Scieur

Comments

Similar Papers

A Catalyst Framework for Minimax Optimization

Junchi Yang, Siqi Zhang, Negar Kiyavash, Niao He

Keywords Abstract Paper

Explicit regularization of stochastic gradient methods through duality

Anant Raj, Francis Bach

Keywords Abstract Paper

Adaptive Gradient Methods for Constrained Convex Optimization and Variational Inequalities

Alina Ene, Huy L. Nguyen, Adrian Vladu

Keywords Abstract Paper

Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling

Amir Zandieh, David Woodruff

Keywords Abstract Paper

General Machine Learning Techniques

Conformal Symplectic and Relativistic Optimization

Guilherme Franca, Jeremias Sulam, Daniel Robinson, Rene Vidal

Keywords Abstract Paper

Averaging on the Bures-Wasserstein manifold: dimension-free convergence of gradient descent

Jason Altschuler, Sinho Chewi, Patrik R Gerber, Austin Stromme

Keywords Abstract Paper

optimization, optimal transport

ACMo: Angle-Calibrated Moment Methods for Stochastic Optimization

Xunpeng Huang, Runxin Xu, Hao Zhou and Zhe Wang, Zhengyang Liu, Lei Li

Keywords Abstract Paper

Optimal Randomized First-Order Methods for Least-Squares Problems

Jonathan Lacotte, Mert Pilanci

Keywords Abstract Paper

Optimization - Convex

Robustness Analysis of Non-Convex Stochastic Gradient Descent using Biased Expectations

Kevin Scaman, Cedric Malherbe

Keywords Abstract Paper

Random extrapolation for primal-dual coordinate descent

Ahmet Alacaoglu, Olivier Fercoq, Volkan Cevher

Keywords Abstract Paper

Optimization - Convex

One Sample Stochastic Frank-Wolfe

Mingrui Zhang, Zebang Shen, Aryan Mokhtari and Hamed Hassani, Amin Karbasi

Keywords Abstract Paper

A Contour Stochastic Gradient Langevin Dynamics Algorithm for Simulations of Multi-modal Distributions

Wei Deng, Guang Lin, Faming Liang

Keywords Abstract Paper

An Online Riemannian PCA for Stochastic Canonical Correlation Analysis

Zihang Meng, Rudrasis Chakraborty, Vikas Singh

Keywords Abstract Paper

optimization, fairness

Inertial Block Proximal Methods for Non-Convex Non-Smooth Optimization

Hien Le, Nicolas Gillis, Panagiotis Patrinos

Keywords Abstract Paper

Optimization - General

Scalable Variational Gaussian Processes via Harmonic Kernel Decomposition

Shengyang Sun, Jiaxin Shi, Andrew Wilson, Roger Grosse

Keywords Abstract Paper

Algorithms, Uncertainty Estimation, Algorithms, Classification; Deep Learning; Deep Learning, Predictive Models; Deep Learning, Supervised Deep Networks, Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

Bilevel Optimization: Convergence Analysis and Enhanced Design

Kaiyi Ji, Junjie Yang, Yingbin LIANG

Keywords Abstract Paper

Optimization, Non-Convex Optimization

On Estimation in Latent Variable Models

Guanhua Fang, Ping Li

Keywords Abstract Paper

Optimization, Stochastic Optimization

Efficient Generalization with Distributionally Robust Learning

Soumyadip Ghosh, Mark Squillante, Ebisa Wollega

Keywords Abstract Paper

optimization, machine learning

Automatic Differentiation of Some First-Order Methods in Parametric Optimization

Sheheryar Mehmood, Peter Ochs

Keywords Abstract Paper

SGLB: Stochastic Gradient Langevin Boosting

Aleksei Ustimenko, Liudmila Prokhorenkova

Keywords Abstract Paper

Algorithms, Boosting and Ensemble Methods

CSER: Communication-efficient SGD with Error Reset

Cong Xie, Shuai Zheng, Sanmi Koyejo and Indranil Gupta, Mu Li, Haibin Lin

Keywords Abstract Paper

Stochastic Bias-Reduced Gradient Methods

Hilal Asi, Yair Carmon, Arun Jambulapati and Yujia Jin, Aaron Sidford

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xunpeng Huang, Runxin Xu, Hao Zhou and
Zhe Wang, Zhengyang Liu, Lei Li

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mingrui Zhang, Zebang Shen, Aryan Mokhtari and
Hamed Hassani, Amin Karbasi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Cong Xie, Shuai Zheng, Sanmi Koyejo and
Indranil Gupta, Mu Li, Haibin Lin

Keywords Paper

Hilal Asi, Yair Carmon, Arun Jambulapati and
Yujia Jin, Aaron Sidford

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ali Hashemi, Yijing Gao, Chang Cai and
Sanjay Ghosh, Klaus-Robert Müller, Srikantan Nagarajan, Stefan Haufe

Keywords Paper

Keywords Paper

Si Yi Meng, Sharan Vaswani, Issam Hadj Laradji and
Mark Schmidt, Simon Lacoste-Julien

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhijian Li, Chao Zhang, Hui Qian and
Xin Du, Lingwei Peng

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper