Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

12/07/2020

Efficient and Scalable Bayesian Neural Nets with Rank-1 Factors

Mike Dusenberry, Ghassen Jerfel, Yeming Wen, Yian Ma, Jasper Snoek, Katherine Heller, Balaji Lakshminarayanan, Dustin Tran

Keywords: Deep Learning - General

Abstract Paper Similar Papers

Abstract: Bayesian neural networks (BNNs) demonstrate promising success in improving the robustness and uncertainty quantification of modern neural networks. However, they generally struggle with underfitting at scale and parameter efficiency. On the other hand, deep ensembles have emerged as an alternative for uncertainty quantification that, while outperforming BNNs on certain problems, also suffers from efficiency issues. It remains unclear how to combine the strengths of these two approaches and remediate their common issues. To tackle this challenge, we propose a rank-1 parameterization of BNNs, where each weight matrix involves only a distribution on a rank-1 subspace. We also revisit the use of mixture approximate posteriors to capture multiple modes where unlike typical mixtures, this approach admits a significantly smaller memory increase (e.g., only a 0.4\% increase for a ResNet-50 mixture of size 10). We perform a systematic empirical study on the choices of prior, variational posterior, and methods to improve training. For ResNet-50 on ImageNet and Wide ResNet 28-10 on CIFAR-10/100, rank-1 BNNs demonstrate improved performance across log-likelihood, accuracy, and calibration on the test set and out-of-distribution variants.

0

0

0

1

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Maximum-and-Concatenation Networks

Xingyu Xie, Hao Kong, Jianlong Wu and
Wayne Zhang, Guangcan Liu, Zhouchen Lin

Keywords Paper

Deep Learning - Theory

0

0

0

0

14:05

26/04/2020

Stable Rank Normalization for Improved Generalization in Neural Networks and GANs

Amartya Sanyal, Philip H. Torr, Puneet K. Dokania

Keywords Paper

Generelization, regularization, empirical lipschitz

0

0

0

0

5:25

06/12/2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

deep learning

1

0

0

0

4:45

06/12/2021

Dangers of Bayesian Model Averaging under Covariate Shift

Pavel Izmailov, Patrick Nicholson, Sanae Lotfi, Andrew Wilson

Keywords Paper

deep learning, robustness

0

0

0

0

15:57

06/12/2021

An Infinite-Feature Extension for Bayesian ReLU Nets That Fixes Their Asymptotic Overconfidence

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

deep learning, kernel methods

0

0

0

0

10:57

18/07/2021

What Are Bayesian Neural Network Posteriors Really Like?

Pavel Izmailov, Sharad Vikram, Matt Hoffman, Andrew Wilson

Keywords Paper

Deep Learning, Bayesian Deep Learning

0

0

0

0

17:13

03/05/2021

Activation-level uncertainty in deep neural networks

Pablo Morales-Alvarez, Daniel Hernández-Lobato, Rafael Molina, José Miguel Hernández Lobato

Keywords Paper

Gaussian Processes, Bayesian Neural Networks, Deep Gaussian Processes, Uncertainty estimation

0

0

0

0

6:53

03/05/2021

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi, LU HOU, Lifeng Shang and
Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Paper

sample reweighting, data augmentation

0

0

0

0

4:58

02/02/2021

Vector Quantized Bayesian Neural Network Inference for Data Streams

Namuk Park, Taekyu Lee, Songkuk Kim

Keywords Paper

0

0

0

0

19:20

06/12/2021

Collapsed Variational Bounds for Bayesian Neural Networks

Marcin Tomczak, Siddharth Swaroop, Andrew Foong, Richard Turner

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

5:44

03/05/2021

Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks

Like Hui, Misha Belkin

Keywords Paper

classification, experimental evaluation, square loss vs cross-entropy, large scale learning

0

0

0

0

5:09

12/07/2020

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

Deep Learning - General

0

0

0

0

15:02

16/11/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Yuning Mao, Yanru Qu, Yiqing Xie and
Xiang Ren, Jiawei Han

Keywords Paper

single-document summarization, single-document sds, multi-document summarization, multi-document mds

0

0

0

0

10:58

30/11/2020

Accurate and Efficient Single Image Super-Resolution with Matrix Channel Attention Network

Hailong Ma, Xiangxiang Chu, Bo Zhang

Keywords Paper

0

0

0

0

9:18

26/04/2020

EMPIR: Ensembles of Mixed Precision Deep Networks for Increased Robustness Against Adversarial Attacks

Sanchari Sen, Balaraman Ravindran, Anand Raghunathan

Keywords Paper

ensembles, mixed precision, robustness, adversarial attacks

0

0

0

0

4:25

05/01/2021

MeliusNet: An Improved Network Architecture for Binary Neural Networks

Joseph Bethge, Christian Bartz, Haojin Yang and
Ying Chen, Christoph Meinel

Keywords Paper

0

0

0

0

5:00

06/12/2020

On the Expressiveness of Approximate Inference in Bayesian Neural Networks

Andrew Foong, David Burt, Yingzhen Li, Richard Turner

Keywords Paper

0

0

0

0

3:23

01/07/2020

Compressing Neural Machine Translation Models with 4-bit Precision

Alham Fikri Aji, Kenneth Heafield

Keywords Paper

0

0

0

0

9:35

06/12/2020

Constant-Expansion Suffices for Compressed Sensing with Generative Priors

Constantinos Daskalakis, Dhruv Rohatgi, Emmanouil Zampetakis

Keywords Paper

0

0

0

0

3:13

14/06/2020

Low-Rank Compression of Neural Nets: Learning the Rank of Each Layer

Yerlan Idelbayev, Miguel Á. Carreira-Perpiñán

Keywords Paper

low-rank compression, rank selection, optimization, discrete-continuous optimization

0

0

0

0

1:00

14/06/2020

Fast Sparse ConvNets

Erich Elsen, Marat Dukhan, Trevor Gale, Karen Simonyan

Keywords Paper

vision, convolutional networks, cnns, efficient inference, sparsity, mobile, edge, tensorflow, xnnpack

0

0

0

0

1:01

03/05/2021

Training independent subnetworks for robust prediction

Marton Havasi, Rodolphe Jenatton, Stanislav Fort and
Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew Dai, Dustin Tran

Keywords Paper

robustness, Efficient ensembles

0

0

0

0

4:10

14/06/2020

Overcoming Multi-Model Forgetting in One-Shot NAS With Diversity Maximization

Miao Zhang, Huiqi Li, Shirui Pan and
Xiaojun Chang, Steven Su

Keywords Paper

automl, neural architecture search, catastrophic forgetting, novelty search, continual learning

0

0

0

0

1:01

04/07/2020

Improved Natural Language Generation via Loss Truncation

Daniel Kang, Tatsunori Hashimoto

Keywords Paper

Natural Generation, optimization, estimation, distinguishability

0

0

0

0

10:35

26/04/2020

Harnessing the Power of Infinitely Wide Deep Nets on Small-data Tasks

Sanjeev Arora, Simon S. Du, Zhiyuan Li and
Ruslan Salakhutdinov, Ruosong Wang, Dingli Yu

Keywords Paper

small data, neural tangent kernel, UCI database, few-shot learning, kernel SVMs, deep learning theory, kernel design

0

0

0

0

5:02

02/02/2021

SA-BNN: State-Aware Binary Neural Network

Chunlei Liu, Peng Chen, Bohan Zhuang and
Chunhua Shen, Baochang Zhang, Wenrui Ding

Keywords Paper

0

0

0

0

17:42

26/04/2020

Data-Independent Neural Pruning via Coresets

Ben Mussay, Margarita Osadchy, Vladimir Braverman and
Samson Zhou, Dan Feldman

Keywords Paper

coresets, neural pruning, network compression

0

0

0

0

4:23

03/05/2021

AdamP: Slowing Down the Slowdown for Momentum Optimizers on Scale-invariant Weights

Byeongho Heo, Sanghyuk Chun, Seong Joon Oh and
Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha

Keywords Paper

effective learning rate, normalize layer, scale-invariant weights, momentum optimizer

0

0

0

0

5:16

14/06/2020

Adaptive Loss-Aware Quantization for Multi-Bit Networks

Zhongnan Qu, Zimu Zhou, Yun Cheng, Lothar Thiele

Keywords Paper

quantization, binary neural networks, adaptive bitwidth, loss-aware

0

0

0

0

1:01

22/11/2021

Parameter Efficient Dynamic Convolution via Tensor Decomposition

Zejiang Hou, Sun-Yuan Kung

Keywords Paper

dynamic convolution, input-dependent reparameterization, parameter efficiency, tensor decomposition

0

0

0

0

3:58

06/12/2020

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

Andrew Wilson, Pavel Izmailov

Keywords Paper

0

0

0

0

3:27

02/02/2021

Training Spiking Neural Networks with Accumulated Spiking Flow

Hao Wu, Yueyi Zhang, Wenming Weng and
Yongting Zhang, Zhiwei Xiong, Zheng-Jun Zha, Xiaoyan Sun, Feng Wu

Keywords Paper

0

0

0

0

16:45

22/11/2021

UWC: Unit-wise Calibration Towards Rapid Network Compression

Chen Lin, Zheyang Li, Bo Peng and
Wenming Tan, Ye Ren, Shiliang Pu

Keywords Paper

post training quantization

0

0

0

0

4:16

02/02/2021

On the Softmax Bottleneck of Recurrent Language Models

Dwarak Govind Parthiban, Yongyi Mao, Diana Inkpen

Keywords Paper

0

0

0

0

19:58

12/07/2020

Up or Down? Adaptive Rounding for Post-Training Quantization

Markus Nagel, Rana Ali Amjad, Marinus van Baalen and
Christos Louizos, Tijmen Blankevoort

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

15:08

12/07/2020

Normalized Loss Functions for Deep Learning with Noisy Labels

Xingjun Ma, Hanxun Huang, Yisen Wang and
Simone Romano, Sarah Erfani, James Bailey

Keywords Paper

Supervised Learning

0

0

0

0

16:00

14/06/2020

ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks

Qilong Wang, Banggu Wu, Pengfei Zhu and
Peihua Li, Wangmeng Zuo, Qinghua Hu

Keywords Paper

channel attention, efficient, adaptive 1d convolution, deep cnns, image classifcation, object detection, instance segmentation

0

0

0

0

0:57

16/11/2020

Discriminatively-Tuned Generative Classifiers for Robust Natural Language Inference

Xiaoan Ding, Tianyu Liu, Baobao Chang and
Zhifang Sui, Kevin Gimpel

Keywords Paper

natural inference, nli tasks, discriminative fine-tuning, discriminative classifiers

0

0

0

0

11:37

03/05/2021

Understanding Over-parameterization in Generative Adversarial Networks

Yogesh Balaji, Mohammadmahdi Sajedi, Neha Kalibhat and
Mucong Ding, Dominik Stöger, Mahdi Soltanolkotabi, Soheil Feizi

Keywords Paper

min-max optimization, Over-parameterization, GAN

0

0

0

0

5:04

26/08/2020

Fenchel Lifted Networks: A Lagrange Relaxation of Neural Network Training

Fangda Gu, Armin Askari, Laurent El Ghaoui

Keywords Paper

0

0

0

0

14:27