Approximation Schemes for ReLU Regression

09/07/2020

Approximation Schemes for ReLU Regression

Ilias Diakonikolas, Surbhi Goel, Sushrut Karmalkar, Adam Klivans, Mahdi Soltanolkotabi

Keywords: PAC learning, Approximation algorithms, Convex optimization, Neural networks/deep learning

Abstract Paper Similar Papers

Abstract: We consider the fundamental problem of ReLU regression, where the goal is to output the best fitting ReLU with respect to square loss given access to draws from some unknown distribution. We give the first efficient, constant-factor approximation algorithm for this problem assuming the underlying distribution satisfies some weak concentration and anti-concentration conditions (and includes, for example, all log-concave distributions). This solves the main open problem of Goel et al., who proved hardness results for any exact algorithm for ReLU regression (up to an additive $\epsilon$). Using more sophisticated techniques, we can improve our results and obtain a polynomial-time approximation scheme for any subgaussian distribution. Given the aforementioned hardness results, these guarantees can not be substantially improved.\n \nOur main insight is a new characterization of {\em surrogate losses} for nonconvex activations. While prior work had established the existence of convex surrogates for monotone activations, we show that properties of the underlying distribution actually induce strong convexity for the loss, allowing us to relate the global minimum to the activation's {\em Chow parameters.}

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Non-convex Distributionally Robust Optimization: Non-asymptotic Analysis

Jikai Jin, Bohang Zhang, Haiyang Wang, Liwei Wang

Keywords Paper

optimization

0

0

0

0

14:05

06/12/2020

Stability of Stochastic Gradient Descent on Nonsmooth Convex Losses

Raef Bassily, Vitaly Feldman, Cristóbal Guzmán, Kunal Talwar

Keywords Paper

0

0

0

0

3:11

06/12/2021

Calibration and Consistency of Adversarial Surrogate Losses

Pranjal Awasthi, Natalie Frank, Anqi Mao and
Mehryar Mohri, Yutao Zhong

Keywords Paper

theory, optimization, machine learning, robustness, adversarial robustness and security

0

0

0

0

13:30

06/12/2020

Projection Robust Wasserstein Distance and Riemannian Optimization

Darren Lin, Chenyou Fan, Nhat Ho and
Marco Cuturi, Michael Jordan

Keywords Paper

Optimization -> Non-Convex Optimization; Optimization -> Stochastic Optimization, Deep Learning -> Optimization for Deep Networks

0

0

0

1

3:01

06/12/2021

High-probability Bounds for Non-Convex Stochastic Optimization with Heavy Tails

Ashok Cutkosky, Harsh Mehta

Keywords Paper

deep learning, optimization

0

0

0

0

20:14

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2020

Large-Scale Methods for Distributionally Robust Optimization

Daniel Levy, Yair Carmon, John Duchi, Aaron Sidford

Keywords Paper

0

0

0

0

3:11

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

06/12/2020

A novel variational form of the Schatten-$p$ quasi-norm

Paris Giampouras, Rene Vidal, Athanasios Rontogiannis, Benjamin Haeffele

Keywords Paper

0

0

0

0

3:14

06/12/2020

DAGs with No Fears: A Closer Look at Continuous Optimization for Learning Bayesian Networks

Dennis Wei, Tian Gao, Yue Yu

Keywords Paper

0

0

0

0

3:23

06/12/2021

Small random initialization is akin to spectral learning: Optimization and generalization guarantees for overparameterized low-rank matrix reconstruction

Dominik Stöger, Mahdi Soltanolkotabi

Keywords Paper

optimization

0

0

0

0

14:11

18/07/2021

On Lower Bounds for Standard and Robust Gaussian Process Bandit Optimization

Xu Cai, Jonathan Scarlett

Keywords Paper

Applications, Natural Language Processing, Applications, Network Analysis, Reinforcement Learning and Planning, Bandits

0

0

0

0

4:19

18/07/2021

Distributionally Robust Optimization with Markovian Data

Mengmeng Li, Tobias Sutter, Daniel Kuhn

Keywords Paper

Optimization

0

0

0

0

5:18

09/07/2020

Calibrated Surrogate Losses for Adversarially Robust Classification

Han Bao, Clayton Scott, Masashi Sugiyama

Keywords Paper

Loss functions, Adversarial learning and robustness, Classification, Excess risk bounds and generalization error bounds, Supervised learning

0

0

0

0

14:21

06/12/2021

Sparse Quadratic Optimisation over the Stiefel Manifold with Application to Permutation Synchronisation

Florian Bernard, Daniel Cremers, Johan Thunberg

Keywords Paper

vision, graph learning, clustering

0

0

0

0

11:13

13/04/2021

A study of condition numbers for first-order optimization

Charles Guille-Escuret, Manuela Girotti, Baptiste Goujaud, Ioannis Mitliagkas

Keywords Paper

0

0

0

0

2:46

18/07/2021

Consistent regression when oblivious outliers overwhelm

Tommaso d'Orsi, Gleb Novikov, David Steurer

Keywords Paper

Theory, Game Theory and Computational Economics, Theory, Theory, Computational Complexity

0

0

0

0

4:42

06/12/2020

Projection Efficient Subgradient Method and Optimal Nonsmooth Frank-Wolfe Method

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Keywords Paper

0

0

0

0

3:23

13/04/2021

SONIA: A symmetric blockwise truncated optimization algorithm

Majid Jahani, MohammadReza Nazari, Rachael Tappenden and
Albert Berahas, Martin Takac

Keywords Paper

0

0

0

0

2:55

09/07/2020

Second-Order Information in Non-Convex Stochastic Optimization: Power and Limitations

Yossi Arjevani, Yair Carmon, John Duchi and
Dylan Foster, Ayush Sekhari, Karthik Sridharan

Keywords Paper

Non-convex optimization, Stochastic optimization

0

0

0

0

11:57

06/12/2021

Policy Optimization in Adversarial MDPs: Improved Exploration via Dilated Bonuses

Haipeng Luo, Chen-Yu Wei, Chung-Wei Lee

Keywords Paper

optimization, reinforcement learning and planning, bandits

0

0

0

0

15:17

12/07/2020

Black-Box Methods for Restoring Monotonicity

Evangelia Gergatsouli, Brendan Lucier, Christos Tzamos

Keywords Paper

Learning Theory

0

0

0

0

15:40

13/04/2021

A dynamical view on optimization algorithms of overparameterized neural networks

Zhiqi Bu, Shiyun Xu, Kan Chen

Keywords Paper

0

0

0

0

3:05

06/12/2021

Approximate optimization of convex functions with outlier noise

Anindya De, Sanjeev Khanna, Huan Li, MohammadHesam NikpeySalekde

Keywords Paper

optimization

0

0

0

0

14:45

06/12/2020

Least Squares Regression with Markovian Data: Fundamental Limits and Algorithms

Dheeraj Nagaraj, Xian Wu, Guy Bresler and
Prateek Jain, Praneeth Netrapalli

Keywords Paper

0

0

0

0

3:34

06/12/2020

An Asymptotically Optimal Primal-Dual Incremental Algorithm for Contextual Linear Bandits

Andrea Tirinzoni, Matteo Pirotta, Marcello Restelli, Alessandro Lazaric

Keywords Paper

0

0

0

0

3:13

06/12/2020

Monotone operator equilibrium networks

Ezra Winston, J. Zico Kolter

Keywords Paper

0

0

0

0

3:29

12/07/2020

Eliminating the Invariance on the Loss Landscape of Linear Autoencoders

Reza Oftadeh, Jiayi Shen, Zhangyang Wang, Dylan Shell

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:10

06/12/2021

Rank Overspecified Robust Matrix Recovery: Subgradient Method and Exact Recovery

Lijun Ding, Liwei Jiang, Yudong Chen and
Qing Qu, Zhihui Zhu

Keywords Paper

0

0

0

0

14:02

06/12/2021

Oracle Complexity in Nonsmooth Nonconvex Optimization

Guy Kornowski, Ohad Shamir

Keywords Paper

theory, deep learning, optimization

0

0

0

0

18:30

26/08/2020

Linearly Convergent Frank-Wolfe without Line-Search

Fabian Pedregosa, Geoffrey Negiar, Armin Askari, Martin Jaggi

Keywords Paper

0

0

0

0

10:14

06/12/2021

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Heyuan Liu, Paul Grigas

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:56

18/07/2021

Instance Specific Approximations for Submodular Maximization

Eric Balkanski, Sharon Qian, Yaron Singer

Keywords Paper

Optimization, Combinatorial Optimization

0

0

0

0

5:09

06/12/2020

Revisiting Frank-Wolfe for Polytopes: Strict Complementarity and Sparsity

Dan Garber

Keywords Paper

0

0

0

0

3:22

26/08/2020

On the optimality of kernels for high-dimensional clustering

Leena C Vankadara, Debarghya Ghoshdastidar

Keywords Paper

0

0

0

0

12:25

18/07/2021

Private Stochastic Convex Optimization: Optimal Rates in L1 Geometry

Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

Keywords Paper

Deep Learning, Algorithms, Multitask and Transfer Learning; Algorithms, Online Learning, Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

17:27

06/12/2020

Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations

Zhuoran Yang, Chi Jin, Zhaoran Wang and
Mengdi Wang, Michael Jordan

Keywords Paper

0

0

0

0

3:42

04/08/2021

Functions with average smoothness: structure, algorithms, and learning

Yair Ashlagi, Lee-Ad Gottlieb, Aryeh Kontorovich

Keywords Paper

0

0

0

0

17:22

06/12/2021

Global Convergence of Gradient Descent for Asymmetric Low-Rank Matrix Factorization

Tian Ye, Simon Du

Keywords Paper

theory, optimization

0

0

0

0

14:51

02/02/2021

Improved Penalty Method via Doubly Stochastic Gradients for Bilevel Hyperparameter Optimization

Wanli Shi, Bin Gu

Keywords Paper

0

0

0

0

14:47