Precise characterization of the prior predictive distribution of deep ReLU networks

06/12/2021

Precise characterization of the prior predictive distribution of deep ReLU networks

Lorenzo Noci, Gregor Bachmann, Kevin Roth, Sebastian Nowozin, Thomas Hofmann

Keywords: deep learning

Abstract Paper Similar Papers

Abstract: Recent works on Bayesian neural networks (BNNs) have highlighted the need to better understand the implications of using Gaussian priors in combination with the compositional structure of the network architecture. Similar in spirit to the kind of analysis that has been developed to devise better initialization schemes for neural networks (cf. He- or Xavier initialization), we derive a precise characterization of the prior predictive distribution of finite-width ReLU networks with Gaussian weights.While theoretical results have been obtained for their heavy-tailedness,the full characterization of the prior predictive distribution (i.e. its density, CDF and moments), remained unknown prior to this work. Our analysis, based on the Meijer-G function, allows us to quantify the influence of architectural choices such as the width or depth of the network on the resulting shape of the prior predictive distribution. We also formally connect our results to previous work in the infinite width setting, demonstrating that the moments of the distribution converge to those of a normal log-normal mixture in the infinite depth limit. Finally, our results provide valuable guidance on prior design: for instance, controlling the predictive variance with depth- and width-informed priors on the weights of the network.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Scaling Properties of Deep Residual Networks

Alain-Sam Cohen, Rama Cont, Alain Rossier, Renyuan Xu

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:20

12/07/2020

Approximation Capabilities of Neural ODEs and Invertible Residual Networks

Han Zhang, Xi Gao, Jacob Unterman, Tomasz Arodz

Keywords Paper

Learning Theory

0

0

0

0

15:10

14/06/2020

Dataless Model Selection With the Deep Frame Potential

Calvin Murdock, Simon Lucey

Keywords Paper

deep learning, sparse approximation theory, deep network architectures, model selection, sparsity, mutual coherence

0

0

0

1

5:00

26/08/2020

Understanding Generalization in Deep Learning via Tensor Methods

Jingling Li, Yanchao Sun, Jiahao Su and
Taiji Suzuki, Furong Huang

Keywords Paper

0

0

0

0

11:35

14/06/2020

GP-NAS: Gaussian Process Based Neural Architecture Search

Zhihang Li, Teng Xi, Jiankang Deng and
Gang Zhang, Shengzhao Wen, Ran He

Keywords Paper

neural architecture search, gaussian process, image classification, face recognition

0

0

0

0

0:59

18/07/2021

Towards Understanding Learning in Neural Networks with Linear Teachers

Roei Sarussi, Alon Brutzkus, Amir Globerson

Keywords Paper

Probabilistic Methods, Theory, Probabilistic Methods, MCMC

0

0

0

0

5:22

06/12/2021

Parametric Complexity Bounds for Approximating PDEs with Neural Networks

Tanya Marwah, Zachary Lipton, Andrej Risteski

Keywords Paper

theory, deep learning, optimization

0

0

0

0

12:32

03/05/2021

Neural Delay Differential Equations

Qunxi Zhu, Yao Guo, Wei Lin

Keywords Paper

Delay differential equations, neural networks

0

0

0

0

4:57

06/12/2020

Learning Global Transparent Models consistent with Local Contrastive Explanations

Tejaswini Pedapati, Avinash Balakrishnan, Karthikeyan Shanmugam, Amit Dhurandhar

Keywords Paper

0

0

0

0

3:20

18/07/2021

On the Implicit Bias of Initialization Shape: Beyond Infinitesimal Mirror Descent

Shahar Azulay, Edward Moroshko, Mor Shpigel Nacson and
Blake Woodworth, Nati Srebro, Amir Globerson, Daniel Soudry

Keywords Paper

, Probabilistic Methods, MCMC, Theory, Deep learning Theory

0

0

0

0

15:38

06/12/2020

Non-Euclidean Universal Approximation

Anastasis Kratsios, Eugene Bilokopytov

Keywords Paper

0

0

0

0

3:34

18/07/2021

Robust Learning for Data Poisoning Attacks

Yunjuan Wang, Poorya Mianjy, Raman Arora

Keywords Paper

Deep Learning, Generative Models, Algorithms, Unsupervised Learning; Deep Learning, Adversarial Networks, Algorithms, Adversarial Examples

0

0

0

0

5:20

12/07/2020

Training Linear Neural Networks: Non-Local Convergence and Complexity Results

Armin Eftekhari

Keywords Paper

Deep Learning - General

0

0

0

0

14:35

02/02/2021

Liquid Time-constant Networks

Ramin Hasani, Mathias Lechner, Alexander Amini and
Daniela Rus, Radu Grosu

Keywords Paper

0

0

0

0

16:02

06/12/2021

Exact marginal prior distributions of finite Bayesian neural networks

Jacob Zavatone-Veth, Cengiz Pehlevan

Keywords Paper

deep learning

0

0

0

0

14:42

03/05/2021

Estimating informativeness of samples with Smooth Unique Information

Hrayr Harutyunyan, Alessandro Achille, Giovanni Paolini and
Orchid Majumder, Avinash Ravichandran, Rahul Bhotika, Stefano Soatto

Keywords Paper

dataset summarization, ntk, stability theory, sample information, information theory

0

0

0

0

6:05

06/12/2020

Gaussian Gated Linear Networks

David Budden, Adam Marblestone, Eren Sezener and
Tor Lattimore, Greg Wayne, Joel Veness

Keywords Paper

0

0

0

0

3:28

26/04/2020

The Local Elasticity of Neural Networks

Hangfeng He, Weijie Su

Keywords Paper

0

0

0

0

5:34

06/12/2021

Asymptotics of representation learning in finite Bayesian neural networks

Jacob Zavatone-Veth, Abdulkadir Canatar, Ben Ruben, Cengiz Pehlevan

Keywords Paper

deep learning, representation learning

0

0

0

0

14:09

06/12/2020

Adapting Neural Architectures Between Domains

Yanxi Li, Zhaohui Yang, Yunhe Wang, Chang Xu

Keywords Paper

0

0

0

0

3:20

12/07/2020

Information-Theoretic Local Minima Characterization and Regularization

Zhiwei Jia, Hao Su

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:11

26/08/2020

On Generalization Bounds of a Family of Recurrent Neural Networks

Minshuo Chen, Xingguo Li, Tuo Zhao

Keywords Paper

0

0

0

0

13:31

19/10/2020

NASE: Learning knowledge graph embedding for link prediction via neural architecture search

Xiaoyu Kou, Bingfeng Luo, Huang Hu, Yan Zhang

Keywords Paper

kg embedding, neural architecture search, knowledge graph

0

0

0

0

6:29

18/07/2021

Amortized Conditional Normalized Maximum Likelihood: Reliable Out of Distribution Uncertainty Estimation

Aurick Zhou, Sergey Levine

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

5:05

09/07/2020

Kernel and Rich Regimes in Overparametrized Models

Blake E Woodworth, Suriya Gunasekar, Jason Lee and
Edward Moroshko, Pedro Henrique Pamplona Savarese, Itay Golan, Daniel Soudry, Nathan Srebro

Keywords Paper

Neural networks/deep learning,

0

0

0

0

13:29

13/04/2021

DebiNet: Debiasing linear models with nonlinear overparameterized neural networks

Shiyun Xu, Zhiqi Bu

Keywords Paper

0

0

0

0

2:56

14/06/2020

Improving One-Shot NAS by Suppressing the Posterior Fading

Xiang Li, Chen Lin, Chuming Li and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

neural architecture search automl classification imagenet bayesian

0

0

0

0

0:58

06/12/2021

Non-asymptotic Error Bounds for Bidirectional GANs

Shiao Liu, Yunfei Yang, Jian Huang and
Yuling Jiao, Yang Wang

Keywords Paper

deep learning, generative model

0

0

0

0

13:23

06/12/2020

Intra Order-preserving Functions for Calibration of Multi-Class Neural Networks

Amir Rahimi, Amirreza Shaban, Ching-An Cheng and
Richard I Hartley, Byron Boots

Keywords Paper

0

0

0

0

3:10

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

03/08/2020

An Interpretable and Sample Efficient Deep Kernel for Gaussian Process

Yijue Dai, Tianjian Zhang, Zhidi Lin and
Feng Yin, Sergios Theodoridis, Shuguang Cui

Keywords Paper

0

0

0

0

8:31

06/12/2020

Distributional Robustness with IPMs and links to Regularization and GANs

Hisham Husain

Keywords Paper

0

0

0

0

3:12

26/04/2020

On Universal Equivariant Set Networks

Nimrod Segol, Yaron Lipman

Keywords Paper

deep learning, universality, set functions, equivariance

0

0

0

0

5:02

14/06/2020

Robust Design of Deep Neural Networks Against Adversarial Attacks Based on Lyapunov Theory

Arash Rahnama, Andre T. Nguyen, Edward Raff

Keywords Paper

adversarial machine learning, robustness, control theory, lyapunov theory, spectral norm regularization, stability and robustness analysis of dnns, dissipativity and passivity theory, adversarial attacks, learning theory, mathematical analysis of dnns

0

0

0

0

1:00

19/08/2021

Explaining Deep Neural Network Models with Adversarial Gradient Integration

Deng Pan, Xin Li, Dongxiao Zhu

Keywords Paper

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Explainability

0

0

0

0

15:16

20/07/2020

Gating creates slow modes and controls phase-space complexity in GRUs and LSTMs

Tankut Can, Kamesh Krishnamurthy, David J. Schwab

Keywords Paper

0

0

0

0

21:00

02/02/2021

Learning Interpretable Models for Coupled Networks Under Domain Constraints

Hongyuan You, Sikun Lin, Ambuj Singh

Keywords Paper

0

0

0

0

16:47

13/04/2021

Fast adaptation with linearized neural networks

Wesley Maddox, Shuai Tang, Pablo Moreno and
Andrew Gordon Wilson, Andreas Damianou

Keywords Paper

0

0

0

0

3:13

06/12/2021

Analytic Insights into Structure and Rank of Neural Network Hessian Maps

Sidak Pal Singh, Gregor Bachmann, Thomas Hofmann

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

15:08

26/04/2020

Understanding Generalization in Recurrent Neural Networks

Zhuozhuo Tu, Fengxiang He, Dacheng Tao

Keywords Paper

generalization, recurrent neural networks, learning theory

0

0

0

0

4:16