A Constructive Prediction of the Generalization Error Across Scales

26/04/2020

A Constructive Prediction of the Generalization Error Across Scales

Jonathan S. Rosenfeld, Amir Rosenfeld, Yonatan Belinkov, Nir Shavit

Keywords: neural networks, deep learning, generalization error, scaling, scalability, vision, language

Abstract Paper Similar Papers

Abstract: The dependency of the generalization error of neural networks on model and dataset size is of critical importance both in practice and for understanding the theory of neural networks. Nevertheless, the functional form of this dependency remains elusive. In this work, we present a functional form which approximates well the generalization error in practice. Capitalizing on the successful concept of model scaling (e.g., width, depth), we are able to simultaneously construct such a form and specify the exact models which can attain it across model/data scales. Our construction follows insights obtained from observations conducted over a range of model/data scales, in various model types and datasets, in vision and language tasks. We show that the form both fits the observations well across scales, and provides accurate predictions from small- to large-scale models and data.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Analytic Insights into Structure and Rank of Neural Network Hessian Maps

Sidak Pal Singh, Gregor Bachmann, Thomas Hofmann

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

15:08

26/08/2020

Non-Parametric Calibration for Classification

Jonathan Wenger, Hedvig Kjellström, Rudolph Triebel )

Keywords Paper

0

0

0

0

15:29

03/05/2021

Do Wide and Deep Networks Learn the Same Things? Uncovering How Neural Network Representations Vary with Width and Depth

Thao Nguyen, Maithra Raghu, Simon Kornblith

Keywords Paper

Representation learning

0

0

0

0

4:45

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

06/12/2021

Grounding Representation Similarity Through Statistical Testing

Frances Ding, Jean-Stanislas Denain, Jacob Steinhardt

Keywords Paper

deep learning, robustness, representation learning

0

0

0

0

9:02

03/08/2020

Prediction Intervals: Split Normal Mixture from Quality-Driven Deep Ensembles

Tárik S. Salem, Helge Langseth, Heri Ramampiaro

Keywords Paper

0

0

0

0

7:45

03/05/2021

Influence Functions in Deep Learning Are Fragile

Samyadeep Basu, Phil Pope, Soheil Feizi

Keywords Paper

Influence Functions, Interpretability

0

0

1

1

6:15

03/05/2021

Towards Robust Neural Networks via Close-loop Control

Zhuotong Chen, Qianxiao Li, Zheng Zhang

Keywords Paper

dynamical system, neural network robustness, optimal control

0

0

0

0

4:47

06/12/2020

Hold me tight! Influence of discriminative features on deep network boundaries

Guillermo Ortiz-Jimenez, Apostolos Modas, Seyed Moosavi-Dezfooli, Pascal Frossard

Keywords Paper

0

1

0

0

3:18

18/07/2021

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

Xingchen Ma, Matthew B Blaschko

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

4:28

14/06/2020

Dataless Model Selection With the Deep Frame Potential

Calvin Murdock, Simon Lucey

Keywords Paper

deep learning, sparse approximation theory, deep network architectures, model selection, sparsity, mutual coherence

0

0

0

1

5:00

06/12/2020

Improving model calibration with accuracy versus uncertainty optimization

Ranganath Krishnan, Omesh Tickoo

Keywords Paper

0

0

0

0

3:25

14/06/2020

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

0

0

0

0

1:00

06/12/2021

Adjusting for Autocorrelated Errors in Neural Networks for Time Series

Fan-Keng Sun, Chris Lang, Duane Boning

Keywords Paper

deep learning

0

0

0

0

12:16

06/12/2021

Domain Adaptation with Invariant Representation Learning: What Transformations to Learn?

Petar Stojanov, Zijian Li, Mingming Gong and
Ruichu Cai, Jaime Carbonell, Kun Zhang

Keywords Paper

deep learning, machine learning, adversarial robustness and security, domain adaptation, representation learning, transfer learning

0

0

0

0

15:02

18/07/2021

On Linear Identifiability of Learned Representations

Geoffrey Roeder, Luke Metz, Durk Kingma

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:11

13/04/2021

Influence decompositions for neural network attribution

Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

0

0

0

0

2:52

06/12/2021

Measuring Generalization with Optimal Transport

Ching-Yao Chuang, Youssef Mroueh, Kristjan Greenewald and
Antonio Torralba, Stefanie Jegelka

Keywords Paper

deep learning, optimal transport

0

0

1

1

14:47

06/12/2021

Adaptive wavelet distillation from neural networks through interpretations

Wooseok Ha, Chandan Singh, Francois Lanusse and
Srigokul Upadhyayula, Bin Yu

Keywords Paper

deep learning, interpretability

0

0

0

0

14:56

19/08/2021

RCA: A Deep Collaborative Autoencoder Approach for Anomaly Detection

Boyang Liu, Ding Wang, Kaixiang Lin and
Pang-Ning Tan, Jiayu Zhou

Keywords Paper

Data Mining, Anomaly/Outlier Detection, Unsupervised Learning

0

0

0

0

12:05

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

12/07/2020

DeepMatch: Balancing Deep Covariate Representations for Causal Inference Using Adversarial Training

Nathan Kallus

Keywords Paper

Causality

0

0

0

0

15:38

13/04/2021

DebiNet: Debiasing linear models with nonlinear overparameterized neural networks

Shiyun Xu, Zhiqi Bu

Keywords Paper

0

0

0

0

2:56

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

06/12/2020

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning

Giulia Denevi, Massimiliano Pontil, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:24

03/05/2021

Enforcing robust control guarantees within neural network policies

Priya Donti, Melrose Roderick, Mahyar Fazlyab, Zico Kolter

Keywords Paper

reinforcement learning, differentiable optimization, robust control

0

0

0

1

5:09

06/12/2021

Explanation-based Data Augmentation for Image Classification

Sandareka Wickramanayake, Wynne Hsu, Mong Li Lee

Keywords Paper

deep learning, machine learning, vision, interpretability

0

0

0

0

14:23

26/04/2020

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Pu Zhao, Pin-Yu Chen, Payel Das and
Karthikeyan Natesan Ramamurthy, Xue Lin

Keywords Paper

mode connectivity, adversarial robustness, backdoor attack, error-injection attack, evasion attacks, loss landscapes

0

0

0

0

4:30

06/12/2020

Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

Amir Rahimi, Amirreza Shaban, Ching-An Cheng and
Richard I Hartley, Byron Boots

Keywords Paper

0

0

0

0

3:10

12/07/2020

NADS: Neural Architecture Distribution Search for Uncertainty Awareness

Randy Ardywibowo, Shahin Boluki, Xinyu Gong and
Zhangyang Wang, Xiaoning Qian

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

7:16

06/12/2021

How does a Neural Network's Architecture Impact its Robustness to Noisy Labels?

Jingling Li, Mozhi Zhang, Keyulu Xu and
John P Dickerson, Jimmy Ba

Keywords Paper

deep learning, robustness, graph learning

0

0

0

0

15:07

06/12/2020

Almost Surely Stable Deep Dynamics

Nathan Lawrence, Philip Loewen, Michael Forbes and
Johan Backstrom, Bhushan Gopaluni

Keywords Paper

0

0

0

0

3:25

06/12/2020

Adaptation Properties Allow Identification of Optimized Neural Codes

Luke Rast, Jan Drugowitsch

Keywords Paper

0

0

0

0

3:17

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

14/06/2020

Belief Propagation Reloaded: Learning BP-Layers for Labeling Problems

Patrick Knöbelreiter, Christian Sormann, Alexander Shekhovtsov and
Friedrich Fraundorfer, Thomas Pock

Keywords Paper

belief propagation, inference, conditional random fields, convolutional neural networks, deep learning, stereo, semantic segmentation, optical flow

0

0

0

0

1:01

06/12/2020

A Causal View on Robustness of Neural Networks

Cheng Zhang, Kun Zhang, Yingzhen Li

Keywords Paper

Data, Challenges, Implementations, and Software -> Virtual Environments; Deep Learning -> Memory-Augmented Neural Networks; Neu, Deep Learning

0

0

0

0

3:25

19/08/2021

Explaining Deep Neural Network Models with Adversarial Gradient Integration

Deng Pan, Xin Li, Dongxiao Zhu

Keywords Paper

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Explainability

0

0

0

0

15:16

06/12/2021

LEADS: Learning Dynamical Systems that Generalize Across Environments

Yuan Yin, Ibrahim Ayed, Emmanuel de Bézenac and
Nicolas Baskiotis, Patrick Gallinari

Keywords Paper

theory, deep learning

0

0

0

0

11:07

26/04/2020

The Shape of Data: Intrinsic Distance for Data Distributions

Anton Tsitsulin, Marina Munkhoeva, Davide Mottin and
Panagiotis Karras, Alex Bronstein, Ivan Oseledets, Emmanuel Mueller

Keywords Paper

Deep Learning, Generative Models, Nonlinear Dimensionality Reduction, Manifold Learning, Similarity and Distance Learning, Spectral Methods

0

0

0

0

4:49

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21