Scaling Properties of Deep Residual Networks

18/07/2021

Scaling Properties of Deep Residual Networks

Alain-Sam Cohen, Rama Cont, Alain Rossier, Renyuan Xu

Keywords: Theory, Deep learning Theory

Abstract Paper Similar Papers

Abstract: Residual networks (ResNets) have displayed impressive results in pattern recognition and, recently, have garnered considerable theoretical interest due to a perceived link with neural ordinary differential equations (neural ODEs). This link relies on the convergence of network weights to a smooth function as the number of layers increases. We investigate the properties of weights trained by stochastic gradient descent and their scaling with network depth through detailed numerical experiments. We observe the existence of scaling regimes markedly different from those assumed in neural ODE literature. Depending on certain features of the network architecture, such as the smoothness of the activation function, one may obtain an alternative ODE limit, a stochastic differential equation or neither of these. These findings cast doubts on the validity of the neural ODE model as an adequate asymptotic description of deep ResNets and point to an alternative class of differential equations as a better description of the deep network limit.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

On Robustness of Neural Ordinary Differential Equations

Hanshu YAN, Jiawei DU, Vincent TAN, Jiashi FENG

Keywords Paper

Neural ODE

0

0

0

0

5:09

12/07/2020

Interpolation between CNNs and ResNets

Zonghan Yang, Yang Liu, Chenglong Bao, Zuoqiang Shi

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:04

06/12/2021

Grounding Representation Similarity Through Statistical Testing

Frances Ding, Jean-Stanislas Denain, Jacob Steinhardt

Keywords Paper

deep learning, robustness, representation learning

0

0

0

0

9:02

12/07/2020

Implicit Euler Skip Connections: Enhancing Adversarial Robustness via Numerical Stability

Mingjie Li, Lingshen He, Zhouchen Lin

Keywords Paper

Deep Learning - General

0

0

0

0

14:26

14/06/2020

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

0

0

0

0

1:00

26/04/2020

Bridging Mode Connectivity in Loss Landscapes and Adversarial Robustness

Pu Zhao, Pin-Yu Chen, Payel Das and
Karthikeyan Natesan Ramamurthy, Xue Lin

Keywords Paper

mode connectivity, adversarial robustness, backdoor attack, error-injection attack, evasion attacks, loss landscapes

0

0

0

0

4:30

06/12/2021

Precise characterization of the prior predictive distribution of deep ReLU networks

Lorenzo Noci, Gregor Bachmann, Kevin Roth and
Sebastian Nowozin, Thomas Hofmann

Keywords Paper

deep learning

0

0

0

0

14:26

06/12/2020

SCOP: Scientific Control for Reliable Neural Network Pruning

Yehui Tang, Yunhe Wang, Yixing Xu and
Dacheng Tao, Chunjing XU, Chao Xu, Chang Xu

Keywords Paper

0

0

0

0

2:50

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

26/04/2020

Stable Rank Normalization for Improved Generalization in Neural Networks and GANs

Amartya Sanyal, Philip H. Torr, Puneet K. Dokania

Keywords Paper

Generelization, regularization, empirical lipschitz

0

0

0

0

5:25

03/05/2021

Neural Delay Differential Equations

Qunxi Zhu, Yao Guo, Wei Lin

Keywords Paper

Delay differential equations, neural networks

0

0

0

0

4:57

03/05/2021

Deep Equals Shallow for ReLU Networks in Kernel Regimes

Alberto Bietti, Francis Bach

Keywords Paper

approximation, neural tangent kernels, deep learning, kernels

0

0

0

0

5:27

06/12/2020

Adapting Neural Architectures Between Domains

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

3:20

18/07/2021

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi, Alon Brutzkus, Amir Globerson

Keywords Paper

Probabilistic Methods, Theory, Probabilistic Methods, MCMC

0

0

0

0

5:22

06/12/2021

Stable Neural ODE with Lyapunov-Stable Equilibrium Points for Defending Against Adversarial Attacks

Qiyu Kang, Yang Song, Qinxu Ding, Wee Peng Tay

Keywords Paper

deep learning, machine learning, adversarial robustness and security

0

0

0

0

14:21

06/12/2020

Differentiable Neural Architecture Search in Equivalent Space with Exploration Enhancement

Miao Zhang, Huiqi Li, Shirui Pan and
Xiaojun Chang, Zongyuan Ge, Steven Su

Keywords Paper

0

0

0

0

3:22

18/07/2021

Classifying high-dimensional Gaussian mixtures: Where kernel methods fail and neural networks succeed

Maria Refinetti, Sebastian Goldt, FLORENT KRZAKALA, Lenka Zdeborova

Keywords Paper

Theory, Models of Learning and Generalization

0

0

0

0

4:24

02/02/2021

Learning Interpretable Models for Coupled Networks Under Domain Constraints

Hongyuan You, Sikun Lin, Ambuj Singh

Keywords Paper

0

0

0

0

16:47

18/07/2021

Robust Learning for Data Poisoning Attacks

Yunjuan Wang, Poorya Mianjy, Raman Arora

Keywords Paper

Deep Learning, Generative Models, Algorithms, Unsupervised Learning; Deep Learning, Adversarial Networks, Algorithms, Adversarial Examples

0

0

0

0

5:20

13/04/2021

Fast adaptation with linearized neural networks

Wesley Maddox, Shuai Tang, Pablo Moreno and
Andrew Gordon Wilson, Andreas Damianou

Keywords Paper

0

0

0

0

3:13

26/04/2020

Beyond Linearization: On Quadratic and Higher-Order Approximation of Wide Neural Networks

Yu Bai, Jason D. Lee

Keywords Paper

Neural Tangent Kernels, over-parametrized neural networks, deep learning theory

0

0

0

0

5:25

02/02/2021

Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

Russell Tsuchida, Tim Pearce, Chris van der Heide and
Fred Roosta, Marcus Gallagher

Keywords Paper

0

0

0

0

13:47

09/07/2020

Kernel and Rich Regimes in Overparametrized Models

Blake E Woodworth, Suriya Gunasekar, Jason Lee and
Edward Moroshko, Pedro Henrique Pamplona Savarese, Itay Golan, Daniel Soudry, Nathan Srebro

Keywords Paper

Neural networks/deep learning,

0

0

0

0

13:29

14/06/2020

How Does Noise Help Robustness? Explanation and Exploration under the Neural SDE Framework

Xuanqing Liu, Tesi Xiao, Si Si and
Qin Cao, Sanjiv Kumar, Cho-Jui Hsieh

Keywords Paper

adversarial, defense, neural ode, neural sde

0

0

0

0

4:59

06/12/2021

Node Dependent Local Smoothing for Scalable Graph Learning

Wentao Zhang, Mingyu Yang, Zeang Sheng and
Yang Li, Wen Ouyang, Yangyu Tao, Zhi Yang, Bin CUI

Keywords Paper

deep learning, machine learning, graph learning

0

0

0

0

4:18

14/06/2020

NeuralScale: Efficient Scaling of Neurons for Resource-Constrained Deep Neural Networks

Eugene Lee, Chen-Yi Lee

Keywords Paper

neural architecture search, pruning, inductive bias, neural network, filters, neurons, optimization, hyperparameter selection, resource constraint, hardware

0

0

0

0

5:00

13/04/2021

A dynamical view on optimization algorithms of overparameterized neural networks

Zhiqi Bu, Shiyun Xu, Kan Chen

Keywords Paper

0

0

0

0

3:05

18/07/2021

Optimal Transport Kernels for Sequential and Parallel Neural Architecture Search

Vu Nguyen, Tam Le, Makoto Yamada, Michael A Osborne

Keywords Paper

Algorithms, AutoML

0

0

0

0

5:22

26/04/2020

The Local Elasticity of Neural Networks

Hangfeng He, Weijie Su

Keywords Paper

0

0

0

0

5:34

06/12/2021

When Are Solutions Connected in Deep Networks?

Quynh Nguyen, Pierre Bréchet, Marco Mondelli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:44

06/12/2021

On the Role of Optimization in Double Descent: A Least Squares Study

Ilja Kuzborskij, Csaba Szepesvari, Omar Rivasplata and
Amal Rannen-Triki, Razvan Pascanu

Keywords Paper

theory, deep learning, optimization

0

0

0

0

13:48

18/07/2021

Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives

Da Xu, Chuanwei Ruan, Evren Korpeoglu and
Sushant Kumar, Kannan Achan

Keywords Paper

Algorithms, Algorithms, Structured Prediction, Algorithms, Collaborative Filtering

0

0

0

0

5:14

06/12/2021

Learning Transferable Adversarial Perturbations

Krishna kanth Nakka, Mathieu Salzmann

Keywords Paper

deep learning, optimization, adversarial robustness and security

0

0

0

0

12:00

19/08/2021

Explaining Deep Neural Network Models with Adversarial Gradient Integration

Deng Pan, Xin Li, Dongxiao Zhu

Keywords Paper

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Explainability

0

0

0

0

15:16

12/07/2020

Efficient proximal mapping of the path-norm regularizer of shallow networks

Fabian Latorre, Paul Rolland, Shaul Nadav Hallak, Volkan Cevher

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

11:32

06/12/2020

Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

Amir Rahimi, Amirreza Shaban, Ching-An Cheng and
Richard I Hartley, Byron Boots

Keywords Paper

0

0

0

0

3:10

06/12/2021

Distributional Gradient Matching for Learning Uncertain Neural Dynamics Models

Lenart Treven, Philippe Wenk, Florian Dorfler, Andreas Krause

Keywords Paper

deep learning, reinforcement learning and planning, kernel methods, active learning

0

0

0

0

14:46

26/08/2020

Understanding Generalization in Deep Learning via Tensor Methods

Jingling Li, Yanchao Sun, Jiahao Su and
Taiji Suzuki, Furong Huang

Keywords Paper

0

0

0

0

11:35

06/12/2021

Periodic Activation Functions Induce Stationarity

Lassi Meronen, Martin Trapp, Arno Solin

Keywords Paper

deep learning, kernel methods

0

0

0

0

12:24