Improving predictions of bayesian neural nets via local linearization

13/04/2021

Improving predictions of bayesian neural nets via local linearization

Alexander Immer, Maciej Korzepa, Matthias Bauer

Keywords:

Abstract Paper Similar Papers

Abstract: The generalized Gauss-Newton (GGN) approximation is often used to make practical Bayesian deep learning approaches scalable by replacing a second order derivative with a product of first order derivatives. In this paper we argue that the GGN approximation should be understood as a local linearization of the underlying Bayesian neural network (BNN), which turns the BNN into a generalized linear model (GLM). Because we use this linearized model for posterior inference, we should also predict using this modified model instead of the original one. We refer to this modified predictive as "GLM predictive" and show that it effectively resolves common underfitting problems of the Laplace approximation. It extends previous results in this vein to general likelihoods and has an equivalent Gaussian process formulation, which enables alternative inference schemes for BNNs in function space. We demonstrate the effectiveness of our approach on several standard classification datasets as well as on out-of-distribution detection. We provide an implementation at https://github.com/AlexImmer/BNN-predictions.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Gaussian Gated Linear Networks

David Budden, Adam Marblestone, Eren Sezener and
Tor Lattimore, Greg Wayne, Joel Veness

Keywords Paper

0

0

0

0

3:28

18/07/2021

Asynchronous Decentralized Optimization With Implicit Stochastic Variance Reduction

Kenta Niwa, Guoqiang Zhang, W. Bastiaan Kleijn and
Noboru Harada, Hiroshi Sawada, Akinori Fujino

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:41

06/12/2021

Model Selection for Bayesian Autoencoders

Ba-Hien Tran, Simone Rossi, Dimitrios Milios and
Pietro Michiardi, Edwin Bonilla, Maurizio Filippone

Keywords Paper

optimization, self-supervised learning, generative model, representation learning

0

0

0

0

10:49

18/07/2021

Data-driven Prediction of General Hamiltonian Dynamics via Learning Exactly-Symplectic Maps

Renyi Chen, Molei Tao

Keywords Paper

Algorithms, Time Series and Sequences

0

0

0

0

5:21

12/07/2020

dS^2LBI: Exploring Structural Sparsity on Deep Network via Differential Inclusion Paths

Yanwei Fu, Chen Liu, Donghao Li and
Xinwei Sun, Jinshan ZENG, Yuan Yao

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

12:45

06/12/2020

Benchmarking Deep Inverse Models over time, and the Neural-Adjoint method

Ben Ren, Willie Padilla, Jordan Malof

Keywords Paper

0

0

0

0

3:17

13/04/2021

Latent derivative bayesian last layer networks

Joe Watson, Jihao Andreas Lin, Pascal Klink and
Joni Pajarinen, Jan Peters

Keywords Paper

0

0

0

0

3:05

12/07/2020

Learning Flat Latent Manifolds with VAEs

Nutan Chen, Alexej Klushyn, Francesco Ferroni and
Justin Bayer, Patrick van der Smagt

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

10:13

06/12/2021

Multiwavelet-based Operator Learning for Differential Equations

Gaurav Gupta, Xiongye Xiao, Paul Bogdan

Keywords Paper

0

0

0

0

12:15

18/07/2021

Bilinear Classes: A Structural Framework for Provable Generalization in RL

Simon Du, Sham Kakade, Jason Lee and
Shachar Lovett, Gaurav Mahajan, Wen Sun, Ruosong Wang

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:40

02/02/2021

Partial-Label and Structure-constrained Deep Coupled Factorization Network

Yan Zhang, Zhao Zhang, Yang Wang and
Zheng Zhang, Li Zhang, Shuicheng Yan, Meng Wang

Keywords Paper

0

0

0

0

13:39

12/07/2020

Better depth-width trade-offs for neural networks through the lens of dynamical systems

Evangelos Chatziafratis, Ioannis Panageas, Sai Ganesh Nagarajan

Keywords Paper

Deep Learning - Theory

0

0

0

0

16:21

06/12/2020

Practical Quasi-Newton Methods for Training Deep Neural Networks

Donald Goldfarb, Yi Ren, Achraf Bahamou

Keywords Paper

0

0

0

0

3:13

06/12/2020

A new convergent variant of Q-learning with linear function approximation

Diogo Carvalho, Francisco S. Melo, Pedro A. Santos

Keywords Paper

0

0

0

0

2:30

19/08/2021

Variational Model-based Policy Optimization

Yinlam Chow, Brandon Cui, Moonkyung Ryu, Mohammad Ghavamzadeh

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

15:31

06/12/2021

Hessian Eigenspectra of More Realistic Nonlinear Models

Zhenyu Liao, Michael W Mahoney

Keywords Paper

theory, optimization, machine learning

0

0

0

0

15:49

12/07/2020

Differentiating through the Fréchet Mean

Aaron Lou, Isay Katsman, Qingxuan Jiang and
Serge Belongie, Ser Nam Lim, Christopher De Sa

Keywords Paper

Representation Learning

0

0

0

0

12:11

06/12/2021

Robust Regression Revisited: Acceleration and Improved Estimation Rates

Arun Jambulapati, Jerry Li, Tselil Schramm, Kevin Tian

Keywords Paper

theory, optimization

0

0

0

0

14:22

03/05/2021

On the mapping between Hopfield networks and Restricted Boltzmann Machines

Matthew Smart, Anton Zilman

Keywords Paper

Statistical Physics, Restricted Boltzmann Machines, Hopfield Networks

0

0

0

0

14:13

14/06/2020

Shape correspondence using anisotropic Chebyshev spectral CNNs

Qinsong Li, Shengjun Liu, Ling Hu, Xinru Liu

Keywords Paper

shape correspondence, geometric deep learning

0

0

0

0

1:02

18/07/2021

A Wasserstein Minimax Framework for Mixed Linear Regression

Theo Diamandis, Yonina Eldar, Alireza Fallah and
Farzan Farnia, Asuman Ozdaglar

Keywords Paper

Algorithms, Multimodal Learning

0

0

0

0

25:41

06/12/2021

Combining Recurrent, Convolutional, and Continuous-time Models with Linear State Space Layers

Albert Gu, Isys Johnson, Karan Goel and
Khaled Saab, Tri Dao, Atri Rudra, Christopher Ré

Keywords Paper

theory, deep learning, machine learning, vision

0

0

0

0

15:13

19/08/2021

On Guaranteed Optimal Robust Explanations for NLP Models

Emanuele La Malfa, Rhiannon Michelmore, Agnieszka M. Zbrzezny and
Nicola Paoletti, Marta Kwiatkowska

Keywords Paper

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Sentiment Analysis and Text Mining

0

0

0

0

14:52

06/12/2020

Non-Euclidean Universal Approximation

Anastasis Kratsios, Eugene Bilokopytov

Keywords Paper

0

0

0

0

3:34

12/07/2020

State Space Expectation Propagation: Efficient Inference Schemes for Temporal Gaussian Processes

William Wilkinson, Paul Chang, Michael Andersen, Arno Solin

Keywords Paper

Gaussian Processes

0

0

0

0

13:31

06/12/2020

Deep Rao-Blackwellised Particle Filters for Time Series Forecasting

Richard Kurle, Syama Sundar Rangapuram, Emmanuel de Bézenac and
Stephan Günnemann, Jan Gasthaus

Keywords Paper

0

0

0

0

3:14

12/07/2020

Minimax Weight and Q-Function Learning for Off-Policy Evaluation

Masatoshi Uehara, Jiawei Huang, Nan Jiang

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:20

18/07/2021

Rethinking Neural vs. Matrix-Factorization Collaborative Filtering: the Theoretical Perspectives

Da Xu, Chuanwei Ruan, Evren Korpeoglu and
Sushant Kumar, Kannan Achan

Keywords Paper

Algorithms, Algorithms, Structured Prediction, Algorithms, Collaborative Filtering

0

0

0

0

5:14

02/02/2021

A Trace-restricted Kronecker-Factored Approximation to Natural Gradient

Kaixin Gao, Xiaolei Liu, Zhenghai Huang and
Min Wang, Zidong Wang, Dachuan Xu, Fan Yu

Keywords Paper

0

0

0

0

17:43

26/08/2020

Sparse Orthogonal Variational Inference for Gaussian Processes

Jiaxin Shi, Michalis Titsias, Andriy Mnih

Keywords Paper

0

0

0

0

13:53

12/07/2020

Feature Selection using Stochastic Gates

Yutaro Yamada, Ofir Lindenbaum, Sahand Negahban, Yuval Kluger

Keywords Paper

Supervised Learning

0

0

0

0

15:17

18/07/2021

Positive-Negative Momentum: Manipulating Stochastic Gradient Noise to Improve Generalization

Zeke Xie, Li Yuan, Zhanxing Zhu, Masashi Sugiyama

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

5:17

18/07/2021

Neural Rough Differential Equations for Long Time Series

James Morrill, Cristopher Salvi, Patrick Kidger, James Foster

Keywords Paper

Algorithms, Time Series and Sequences

0

0

0

0

5:31

19/01/2020

Semantics of Higher-Order Probabilistic Programs with Conditioning

Fredrik Dahlqvist, Dexter Kozen

Keywords Paper

semantics, type system, Probabilistic programming

0

0

0

0

18:42

18/07/2021

Representation Subspace Distance for Domain Adaptation Regression

Xinyang Chen, Sinan Wang, Jianmin Wang, Mingsheng Long

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

4:32

06/12/2020

Correspondence learning via linearly-invariant embedding

Riccardo Marin, Marie-Julie Rakotosaona, Simone Melzi, Maks Ovsjanikov

Keywords Paper

0

0

0

0

3:18

18/07/2021

Active Learning of Continuous-time Bayesian Networks through Interventions

Dominik Linzner, Heinz Koeppl

Keywords Paper

Probabilistic Methods, Graphical Models

0

0

0

0

5:07

12/07/2020

Structured Policy Iteration for Linear Quadratic Regulator

Youngsuk Park, Ryan Rossi, Zheng Wen and
Gang Wu, Handong Zhao

Keywords Paper

Reinforcement Learning - General

0

0

0

0

16:08

18/07/2021

Leveraging Non-uniformity in First-order Non-convex Optimization

Jincheng Mei, Yue Gao, Bo Dai and
Csaba Szepesvari, Dale Schuurmans

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

4:49

06/12/2020

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces

Guy Lorberbom, Chris J. Maddison, Nicolas Heess and
Tamir Hazan, Daniel Tarlow

Keywords Paper

0

0

0

0

3:16