Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

09/07/2020

Algorithms and SQ Lower Bounds for PAC Learning One-Hidden-Layer ReLU Networks

Ilias Diakonikolas, Daniel M Kane, Vasilis Kontonis, Nikos Zarifis

Keywords: PAC learning,

Abstract Paper Similar Papers

Abstract: We study the problem of PAC learning one-hidden-layer ReLU networks\nwith $k$ hidden units\non $\R^d$ under Gaussian marginals in the presence of additive label noise. \nFor the case of positive coefficients, we give the first polynomial-time algorithm \nfor this learning problem for $k$ up to $\tilde{\Omega}(\sqrt{\log d})$. \nPreviously, no polynomial time algorithm was known, even for $k=3$.\nThis answers an open question posed by~\cite{Kliv17}. Importantly,\nour algorithm does not require any assumptions about the rank of the weight matrix\nand its complexity is independent of its condition number. On the negative side,\nfor the more general task of PAC learning one-hidden-layer ReLU networks with positive or negative coefficients, \nwe prove a Statistical Query lower bound of $d^{\Omega(k)}$. Thus, we provide a \nseparation between the two classes in terms of efficient learnability.\nOur upper and lower bounds are general, extending to broader families of activation functions.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLT 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Paper

0

0

0

0

3:22

04/08/2021

From Local Pseudorandom Generators to Hardness of Learning

Amit Daniely, Gal Vardi

Keywords Paper

0

0

0

0

15:46

12/07/2020

From PAC to Instance-Optimal Sample Complexity in the Plackett-Luce Model

Aadirupa Saha, Aditya Gopalan

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:22

18/07/2021

On the Power of Localized Perceptron for Label-Optimal Learning of Halfspaces with Adversarial Noise

Jie Shen

Keywords Paper

Algorithms, Active Learning

0

0

0

0

4:40

06/12/2021

List-Decodable Mean Estimation in Nearly-PCA Time

Ilias Diakonikolas, Daniel Kane, Daniel Kongsgaard and
Jerry Li, Kevin Tian

Keywords Paper

theory, clustering

0

0

0

0

14:21

06/12/2020

Provably Efficient Reinforcement Learning with Kernel and Neural Function Approximations

Zhuoran Yang, Chi Jin, Zhaoran Wang and
Mengdi Wang, Michael Jordan

Keywords Paper

0

0

0

0

3:42

26/04/2020

Towards Better Understanding of Adaptive Gradient Algorithms in Generative Adversarial Nets

Mingrui Liu, Youssef Mroueh, Jerret Ross and
Wei Zhang, Xiaodong Cui, Payel Das, Tianbao Yang

Keywords Paper

Generative Adversarial Nets, Adaptive Gradient Algorithms

0

0

0

0

5:08

12/07/2020

Optimal Statistical Guaratees for Adversarially Robust Gaussian Classification

Chen Dan, Yuting Wei, Pradeep Ravikumar

Keywords Paper

Learning Theory

0

0

0

0

14:36

18/07/2021

Sample-Optimal PAC Learning of Halfspaces with Malicious Noise

Jie Shen

Keywords Paper

Theory, Computational Learning Theory

0

0

0

0

4:37

06/12/2020

A Single-Loop Smoothed Gradient Descent-Ascent Algorithm for Nonconvex-Concave Min-Max Problems

Jiawei Zhang, Peijun Xiao, Ruoyu Sun, Zhiquan Luo

Keywords Paper

0

0

0

0

3:12

06/12/2020

Robust Sub-Gaussian Principal Component Analysis and Width-Independent Schatten Packing

Arun Jambulapati, Jerry Li, Kevin Tian

Keywords Paper

0

0

0

0

3:22

06/12/2020

Agnostic Learning of a Single Neuron with Gradient Descent

Spencer Frei, Yuan Cao, Quanquan Gu

Keywords Paper

0

0

0

0

3:10

09/07/2020

Learning Zero-Sum Simultaneous-Move Markov Games Using Function Approximation and Correlated Equilibrium

Qiaomin Xie, Yudong Chen, Zhaoran Wang, Zhuoran Yang

Keywords Paper

Reinforcement learning, Planning and control

0

0

0

0

15:16

06/12/2021

An Improved Analysis of Gradient Tracking for Decentralized Machine Learning

Anastasiia Koloskova, Tao Lin, Sebastian Stich

Keywords Paper

optimization, machine learning

0

0

0

0

7:22

06/12/2021

On the Power of Differentiable Learning versus PAC and SQ Learning

Emmanuel Abbe, Pritish Kamath, Eran Malach and
Colin Sandon, Nathan Srebro

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:57

06/12/2020

Statistical-Query Lower Bounds via Functional Gradients

Surbhi Goel, Aravind Gollakota, Adam Klivans

Keywords Paper

0

0

0

0

3:24

04/08/2021

Boosting in the Presence of Massart Noise

Ilias Diakonikolas, Russell Impagliazzo, Daniel M Kane and
Rex Lei, Jessica Sorrell, Christos Tzamos

Keywords Paper

0

0

0

0

15:24

06/12/2021

Neural Active Learning with Performance Guarantees

Zhilei Wang, Pranjal Awasthi, Christoph Dann and
Ayush Sekhari, Claudio Gentile

Keywords Paper

deep learning, active learning

0

0

0

0

10:43

04/08/2021

Is Reinforcement Learning More Difficult Than Bandits? A Near-optimal Algorithm Escaping the Curse of Horizon

Zihan Zhang, Xiangyang Ji, Simon Du

Keywords Paper

0

0

0

0

12:37

06/12/2020

Robust Gaussian Covariance Estimation in Nearly-Matrix Multiplication Time

Jerry Li, Guanghao Ye

Keywords Paper

0

0

0

0

3:13

06/12/2021

Rethinking gradient sparsification as total error minimization

Atal Sahu, Aritra Dutta, Ahmed M. Abdelmoniem and
Trambak Banerjee, Marco Canini, Panos Kalnis

Keywords Paper

deep learning, optimization

0

0

0

0

12:31

12/07/2020

Second-Order Provable Defenses against Adversarial Attacks

Sahil Singla, Soheil Feizi

Keywords Paper

Adversarial Examples

0

0

0

0

12:45

06/12/2021

Corruption Robust Active Learning

Yifang Chen, Simon Du, Kevin Jamieson

Keywords Paper

machine learning, robustness, active learning

0

0

0

0

14:31

06/12/2021

Efficient Algorithms for Learning Depth-2 Neural Networks with General ReLU Activations

Pranjal Awasthi, Alex Tang, Aravindan Vijayaraghavan

Keywords Paper

theory, deep learning

0

0

0

0

14:31

06/12/2021

On the Cryptographic Hardness of Learning Single Periodic Neurons

Min Jae Song, Ilias Zadik, Joan Bruna

Keywords Paper

theory, deep learning, graph learning

0

0

0

0

10:30

06/12/2020

Is Plug-in Solver Sample-Efficient for Feature-based Reinforcement Learning?

Qiwen Cui, Lin Yang

Keywords Paper

Algorithms -> Semi-Supervised Learning; Deep Learning -> Deep Autoencoders; Deep Learning -> Generative Models, Probabilistic Methods -> Variational Inference

0

0

0

0

3:25

06/12/2020

Model-Based Multi-Agent RL in Zero-Sum Markov Games with Near-Optimal Sample Complexity

Kaiqing Zhang, Sham Kakade, Tamer Basar, Lin Yang

Keywords Paper

0

0

0

0

3:25

04/08/2021

Improved Algorithms for Efficient Active Learning Halfspaces with Massart and Tsybakov noise

Chicheng Zhang, Yinan Li

Keywords Paper

0

0

0

0

19:19

26/08/2020

Best-item Learning in Random Utility Models with Subset Choices

Aadirupa Saha , Bangalore), Aditya Gopalan , Bangalore)

Keywords Paper

0

0

0

0

16:30

06/12/2020

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

Benjamin Aubin, Florent Krzakala, Yue Lu, Lenka Zdeborová

Keywords Paper

0

0

0

0

3:08

06/12/2020

Fourier Sparse Leverage Scores and Approximate Kernel Learning

Tamas Erdelyi, Cameron Musco, Christopher Musco

Keywords Paper

0

0

0

0

3:25

06/12/2021

Reinforcement Learning in Reward-Mixing MDPs

Jeongyeol Kwon, Yonathan Efroni, Constantine Caramanis, Shie Mannor

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

12:57

04/08/2021

Nonparametric Regression with Shallow Overparametrized Neural Networks Trained by GD with Early Stopping

Ilja Kuzborskij , Csaba Szepesvari

Keywords Paper

0

0

0

0

15:14

06/12/2020

Enabling certification of verification-agnostic networks via memory-efficient semidefinite programming

Sumanth Dathathri, Krishnamurthy Dvijotham, Alexey Kurakin and
Aditi Raghunathan, Jonathan Uesato, Rudy Bunel, Shreya Shankar, Jacob Steinhardt, Ian Goodfellow, Percy Liang, Pushmeet Kohli

Keywords Paper

0

0

0

0

3:23

06/12/2021

Nearly Horizon-Free Offline Reinforcement Learning

Tongzheng Ren, Jialian Li, Bo Dai and
Simon Du, Sujay Sanghavi

Keywords Paper

theory, optimization, reinforcement learning and planning

0

0

0

0

8:44

03/05/2021

Text Generation by Learning from Demonstrations

Richard Pang, He He

Keywords Paper

learning from demonstrations, nlp, text generation

0

0

0

0

5:21

06/12/2021

Consistent Estimation for PCA and Sparse Regression with Oblivious Outliers

Tommaso d'Orsi, Chih-Hung Liu, Rajai Nasser and
Gleb Novikov, David Steurer, Stefan Tiegel

Keywords Paper

optimization

0

0

0

0

10:44

06/12/2020

Generalization Bound of Gradient Descent for Non-Convex Metric Learning

MINGZHI DONG, Xiaochen Yang, Rui Zhu and
Yujiang Wang, Jing-Hao Xue

Keywords Paper

0

0

0

0

3:18

06/12/2020

Optimal and Practical Algorithms for Smooth and Strongly Convex Decentralized Optimization

Dmitry Kovalev, Adil Salim, Peter Richtarik

Keywords Paper

0

0

0

0

3:27

07/09/2020

Mish: A Self Regularized Non-Monotonic Activation Function

Diganta Misra

Keywords Paper

activation functions, non-linear dynamics, loss landscapes

0

0

0

0

10:37