Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels

18/07/2021

Quantifying the Benefit of Using Differentiable Learning over Tangent Kernels

Eran Malach, Pritish Kamath, Emmanuel Abbe, Nati Srebro

Keywords: Theory, Deep learning Theory

Abstract Paper Similar Papers

Abstract: We study the relative power of learning with gradient descent on differentiable models, such as neural networks, versus using the corresponding tangent kernels. We show that under certain conditions, gradient descent achieves small error only if a related tangent kernel method achieves a non-trivial advantage over random guessing (a.k.a. weak learning), though this advantage might be very small even when gradient descent can achieve arbitrarily high accuracy. Complementing this, we show that without these conditions, gradient descent can in fact learn with small error even when no kernel method, in particular using the tangent kernel, can achieve a non-trivial advantage over random guessing.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

The Implications of Local Correlation on Learning Some Deep Functions

Eran Malach, Shai Shalev-Shwartz

Keywords Paper

0

0

0

0

3:07

13/04/2021

Exponential convergence rates of classification errors on learning with SGD and random features

Shingo Yashima, Atsushi Nitanda, Taiji Suzuki

Keywords Paper

0

0

0

0

2:58

06/12/2021

Statistically and Computationally Efficient Linear Meta-representation Learning

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Keywords Paper

optimization, meta learning, representation learning, few shot learning

1

0

0

1

12:56

03/05/2021

Benefit of deep learning with non-convex noisy gradient descent: Provable excess risk bound and superiority to kernel methods

Taiji Suzuki, Akiyama Shunta

Keywords Paper

local Rademacher complexity, minimax optimal rate, Excess risk, linear estimator, kernel method, fast learning rate

0

0

0

0

10:13

06/12/2021

Local Signal Adaptivity: Provable Feature Learning in Neural Networks Beyond Kernels

Stefani Karp, Ezra Winston, Yuanzhi Li, Aarti Singh

Keywords Paper

theory, deep learning, optimization, machine learning, vision, kernel methods

0

0

0

0

13:22

06/12/2020

Learning Parities with Neural Networks

Amit Daniely, Eran Malach

Keywords Paper

0

0

0

0

3:21

12/07/2020

Privately Learning Markov Random Fields

Gautam Kamath, Janardhan Kulkarni, Steven Wu, Huanyu Zhang

Keywords Paper

Privacy-preserving Statistics and Machine Learning

0

0

0

0

15:16

12/07/2020

Revisiting Spatial Invariance with Low-Rank Local Connectivity

Gamaleldin Elsayed, Prajit Ramachandran, Jon Shlens, Simon Kornblith

Keywords Paper

Deep Learning - General

0

0

0

0

14:48

26/04/2020

Maxmin Q-learning: Controlling the Estimation Bias of Q-learning

Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White

Keywords Paper

reinforcement learning, bias and variance reduction

0

0

0

0

4:27

06/12/2021

Learning where to learn: Gradient sparsity in meta and continual learning

Johannes von Oswald, Dominic Zhao, Seijin Kobayashi and
Simon Schug, Massimo Caccia, Nicolas Zucchet, João Sacramento

Keywords Paper

deep learning, optimization, meta learning, continual learning, few shot learning

0

0

0

0

12:20

04/08/2021

The Connection Between Approximation, Depth Separation and Learnability in Neural Networks

Eran Malach, Gilad Yehudai, Shai Shalev-Schwartz, Ohad Shamir

Keywords Paper

0

0

0

0

16:50

12/07/2020

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

Deep Learning - General

0

0

0

0

15:02

06/12/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Minshuo Chen, Yu Bai, Jason Lee and
Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher

Keywords Paper

0

0

0

0

3:07

03/05/2021

Exploring the Uncertainty Properties of Neural Networks’ Implicit Priors in the Infinite-Width Limit

Ben Adlam, Jaehoon Lee, Lechao Xiao and
Jeffrey Pennington, Jasper Snoek

Keywords Paper

Deep Learning, Bayesian Neural Networks, Neural Network Gaussian Process, Infinite-Width Limit, Uncertainty, Gaussian Process

0

0

0

0

4:34

20/07/2020

Exact asymptotics for phase retrieval and compressed sensing with random generative priors

Benjamin Aubin, Bruno Loureiro, Antoine Baker and
Florent Krzakala, Lenka Zdeborová

Keywords Paper

0

0

0

0

16:19

06/12/2020

Why Do Deep Residual Networks Generalize Better than Deep Feedforward Networks? --- A Neural Tangent Kernel Perspective

Kaixuan Huang, Yuqing Wang, Molei Tao, Tuo Zhao

Keywords Paper

Algorithms -> Uncertainty Estimation; Theory -> Frequentist Statistics; Theory -> Large Deviations and Asymptotic Analysis; The, Algorithms -> Kernel Methods

0

0

0

0

2:59

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

14/06/2020

Deep Learning for Handling Kernel/model Uncertainty in Image Deconvolution

Yuesong Nan, Hui Ji

Keywords Paper

image deblurring, robust deblurring, error-in-variable model, deep learning, blur kernel correction, image restoration, image processing, low level vision

0

0

0

0

1:01

06/12/2021

An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias

Lu Yu, Krishnakumar Balasubramanian, Stanislav Volgushev, Murat Erdogdu

Keywords Paper

optimization, machine learning

0

0

0

0

10:21

12/07/2020

Low Bias Low Variance Gradient Estimates for Hierarchical Boolean Stochastic Networks

Adeel Pervez, Taco Cohen, Efstratios Gavves

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:28

06/12/2020

Efficient Variational Inference for Sparse Deep Learning with Theoretical Guarantee

Jincheng Bai, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:11

13/04/2021

Learning the truth from only one side of the story

Heinrich Jiang, Qijia Jiang, Aldo Pacchiano

Keywords Paper

0

0

0

0

2:54

06/12/2020

Learning with Optimized Random Features: Exponential Speedup by Quantum Machine Learning without Sparsity and Low-Rank Assumptions

Hayata Yamasaki, Sathyawageeswar Subramanian, Sho Sonoda, Masato Koashi

Keywords Paper

0

0

0

0

3:15

26/08/2020

Conditional Linear Regression

Diego Calderon, Brendan Juba, Sirui Li and
Zongyi Li, Lisa Ruan

Keywords Paper

0

0

0

0

14:31

18/07/2021

Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

Maria Refinetti, Sebastian Goldt, FLORENT KRZAKALA, Lenka Zdeborova

Keywords Paper

Theory, Models of Learning and Generalization

0

0

0

0

4:24

18/07/2021

Toward Better Generalization Bounds with Locally Elastic Stability

Zhun Deng, Hangfeng He, Weijie Su

Keywords Paper

Theory, Computational Learning Theory

0

0

0

0

4:59

02/02/2021

Longitudinal Deep Kernel Gaussian Process Regression

Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant G Honavar

Keywords Paper

0

0

0

0

16:27

13/04/2021

Learning with gradient descent and weakly convex losses

Dominic Richards, Mike Rabbat

Keywords Paper

0

0

0

0

3:20

04/08/2021

From Local Pseudorandom Generators to Hardness of Learning

Amit Daniely, Gal Vardi

Keywords Paper

0

0

0

0

15:46

03/05/2021

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Colin Wei, Kendrick Shen, Yining Chen, Tengyu Ma

Keywords Paper

deep learning theory, semi-supervised learning theory, unsupervised learning theory, domain adaptation theory

1

1

0

0

14:46

18/07/2021

Generative Particle Variational Inference via Estimation of Functional Gradients

Neale Ratzlaff, Jerry Bai, Fuxin Li, Wei Xu

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

5:11

06/12/2021

On the Power of Differentiable Learning versus PAC and SQ Learning

Emmanuel Abbe, Pritish Kamath, Eran Malach and
Colin Sandon, Nathan Srebro

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:57

13/04/2021

On the generalization properties of adversarial training

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

0

0

0

0

3:05

12/07/2020

Double Reinforcement Learning for Efficient and Robust Off-Policy Evaluation

Nathan Kallus, Masatoshi Uehara

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

13:46

14/09/2020

Effective Version Space Reduction for Convolutional Neural Networks

Jiayu Liu, Ioannis Chiotellis, Rudolph Triebel , Daniel Cremers

Keywords Paper

active learning, deep learning, version space, diameter reduction

0

0

0

0

14:45

26/04/2020

Sampling-Free Learning of Bayesian Quantized Neural Networks

Jiahao Su, Milan Cvitkovic, Furong Huang

Keywords Paper

Bayesian neural networks, Quantized neural networks

0

0

0

0

4:45

19/08/2021

Towards Understanding the Spectral Bias of Deep Learning

Yuan Cao, Zhiying Fang, Yue Wu and
Ding-Xuan Zhou, Quanquan Gu

Keywords Paper

Machine Learning, Deep Learning, Kernel Methods

0

0

0

0

14:42

06/12/2021

Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble

Gaon An, Seungyong Moon, Jang-Hyun Kim, Hyun Oh Song

Keywords Paper

deep learning, reinforcement learning and planning

1

0

0

0

13:50

06/12/2020

Understanding Double Descent Requires A Fine-Grained Bias-Variance Decomposition

Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:30

18/07/2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie, Li Yuan, Zhanxing Zhu, Masashi Sugiyama

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:17