Combining Ensembles and Data Augmentation Can Harm Your Calibration

03/05/2021

Combining Ensembles and Data Augmentation Can Harm Your Calibration

Yeming Wen, Ghassen Jerfel, Rafael Müller, Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords: Uncertainty estimates, Ensembles, Calibration

Abstract Paper Similar Papers

Abstract: Ensemble methods which average over multiple neural network predictions are a simple approach to improve a model’s calibration and robustness. Similarly, data augmentation techniques, which encode prior information in the form of invariant feature transformations, are effective for improving calibration and robustness. In this paper, we show a surprising pathology: combining ensembles and data augmentation can harm model calibration. This leads to a trade-off in practice, whereby improved accuracy by combining the two techniques comes at the expense of calibration. On the other hand, selecting only one of the techniques ensures good uncertainty estimates at the expense of accuracy. We investigate this pathology and identify a compounding under-confidence among methods which marginalize over sets of weights and data augmentation techniques which soften labels. Finally, we propose a simple correction, achieving the best of both worlds with significant accuracy and calibration gains over using only ensembles or data augmentation individually. Applying the correction produces new state-of-the art in uncertainty calibration and robustness across CIFAR-10, CIFAR-100, and ImageNet.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Ensemble Distribution Distillation

Andrey Malinin, Bruno Mlodozeniec, Mark Gales

Keywords Paper

Ensemble Distillation, Knowledge Distillation, Uncertainty Estimation, Density Estimation

0

0

0

1

4:56

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

02/02/2021

Practical and Rigorous Uncertainty Bounds for Gaussian Process Regression

Christian Fiedler, Carsten W. Scherer, Sebastian Trimpe

Keywords Paper

0

0

0

0

18:49

12/07/2020

Doubly robust off-policy evaluation with shrinkage

Yi Su, Maria Dimakopoulou, Akshay Krishnamurthy, Miroslav Dudik

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

15:08

06/12/2021

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Jongheon Jeong, Sejun Park, Minkyu Kim and
Heung-Chang Lee, Do-Guk Kim, Jinwoo Shin

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security

0

0

0

0

12:23

06/12/2021

Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions

Huan Ma, Zongbo Han, Changqing Zhang and
Huazhu Fu, Joey Tianyi Zhou, Qinghua Hu

Keywords Paper

0

0

0

0

5:37

06/12/2021

Towards Deeper Deep Reinforcement Learning with Spectral Normalization

Nils Bjorck, Carla Gomes, Kilian Weinberger

Keywords Paper

reinforcement learning and planning, vision, language

0

0

0

0

9:28

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

04/07/2020

Perturbation Based Learning for Structured NLP tasks with Application to Dependency Parsing

Amichay Doitch, Ram Yazdi, Tamir Hazan, Roi Reichart

Keywords Paper

Structured tasks, Dependency Parsing, NLP, sampling

0

0

0

0

10:53

06/12/2021

Uncertainty Quantification and Deep Ensembles

Rahul Rahaman, alexandre thiery

Keywords Paper

deep learning, machine learning

0

0

0

0

14:40

13/04/2021

Improving adversarial robustness via unlabeled out-of-domain data

Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

Keywords Paper

0

0

0

0

3:01

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

12/07/2020

Towards Understanding the Regularization of Adversarial Robustness on Neural Networks

Yuxin Wen, Shuai Li, Kui Jia

Keywords Paper

Adversarial Examples

0

0

0

0

15:53

06/12/2021

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Zhengzhuo Xu, Zenghao Chai, Chun Yuan

Keywords Paper

theory, machine learning

0

0

0

0

4:23

19/08/2021

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

Dazhong Shen, Chuan Qin, Chao Wang and
Hengshu Zhu, Enhong Chen, Hui Xiong

Keywords Paper

Machine Learning, Bayesian Learning, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

13:04

18/07/2021

Demystifying Inductive Biases for (Beta-)VAE Based Architectures

Dominik Zietlow, Michal Rolinek, Georg Martius

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Body Pose, Face, and Gesture Analysis; Applications, Computer Vision; Deep Learning, Generative Models, Deep Learning, Embedding and Representation learning

0

0

0

0

4:51

06/12/2020

Understanding Anomaly Detection with Deep Invertible Networks through Hierarchies of Distributions and Features

Robin Schirrmeister, Yuxuan Zhou, Tonio Ball, Dan Zhang

Keywords Paper

0

0

0

0

3:21

06/12/2021

Scaling Ensemble Distribution Distillation to Many Classes with Proxy Targets

Max Ryabinin, Andrey Malinin, Mark Gales

Keywords Paper

machine learning

0

0

0

0

12:36

06/12/2021

Multi-Label Learning with Pairwise Relevance Ordering

Ming-Kun Xie, Sheng-Jun Huang

Keywords Paper

machine learning

0

0

0

0

3:56

06/12/2021

Evaluating model performance under worst-case subpopulations

Mike Li, Hongseok Namkoong, Shangzhou Xia

Keywords Paper

robustness, fairness

0

0

0

0

5:45

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

06/12/2020

Autoencoders that don't overfit towards the Identity

Harald Steck

Keywords Paper

0

0

0

0

3:22

26/08/2020

Uncertainty in Neural Networks: Approximately Bayesian Ensembling

Tim Pearce, Felix Leibfried, Alexandra Brintrup

Keywords Paper

0

0

0

0

16:03

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

06/12/2021

Beware of the Simulated DAG! Causal Discovery Benchmarks May Be Easy to Game

Alexander Reisach, Christof Seiler, Sebastian Weichwald

Keywords Paper

optimization, graph learning, causality

0

0

0

0

14:13

12/07/2020

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

Deep Learning - General

0

0

0

0

15:02

13/04/2021

Improving classifier confidence using lossy label-invariant transformations

Sooyong Jang, Insup Lee, James Weimer

Keywords Paper

0

0

0

0

2:59

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16

13/04/2021

High-dimensional multi-task averaging and application to kernel mean embedding

Hannah Marienwald, Jean-Baptiste Fermanian, Gilles Blanchard

Keywords Paper

0

0

0

0

3:01

02/02/2021

Uncertainty-Aware Multi-View Representation Learning

Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Keywords Paper

0

0

0

0

14:19

06/12/2021

Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning

Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm

Keywords Paper

robustness, domain adaptation

0

0

0

0

13:34

12/07/2020

Fast and Consistent Learning of Hidden Markov Models by Incorporating Non-Consecutive Correlations

Robert Mattila, Cristian Rojas, Eric Moulines and
Vikram Krishnamurthy, Bo Wahlberg

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

13:37

06/12/2020

When Do Neural Networks Outperform Kernel Methods?

Behrooz Ghorbani, Song Mei, Theodor Misiakiewicz, Andrea Montanari

Keywords Paper

0

0

0

0

3:16

06/12/2020

Rescuing neural spike train models from bad MLE

Diego Arribas, Yuan Zhao, Memming Park

Keywords Paper

0

0

0

0

3:23

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

26/08/2020

Mitigating Overfitting in Supervised Classification from Two Unlabeled Datasets: A Consistent Risk Correction Approach

Nan Lu, Tianyi Zhang, Gang Niu, Masashi Sugiyama

Keywords Paper

0

0

0

0

10:16

06/12/2021

OpenMatch: Open-Set Semi-supervised Learning with Open-set Consistency Regularization

Kuniaki Saito, Donghyun Kim, Kate Saenko

Keywords Paper

semi-supervised learning

0

0

0

0

11:12

26/08/2020

A Unified Statistically Efficient Estimation Framework for Unnormalized Models

Masatoshi Uehara, Takafumi Kanamori, Takashi Takenouchi, Takeru Matsuda

Keywords Paper

0

0

0

0

13:58

26/04/2020

Understanding the Limitations of Conditional Generative Models

Ethan Fetaya, Joern-Henrik Jacobsen, Will Grathwohl, Richard Zemel

Keywords Paper

Conditional Generative Models, Generative Classifiers, Robustness, Adversarial Examples

0

0

0

0

4:46

12/07/2020

Fundamental Tradeoffs between Invariance and Sensitivity to Adversarial Perturbations

Florian Tramer, Jens Behrmann, Nicholas Carlini and
Nicolas Papernot, Joern-Henrik Jacobsen

Keywords Paper

Adversarial Examples

0

0

0

0

15:22