Revisiting the Calibration of Modern Neural Networks

Abstract: Accurate estimation of predictive uncertainty (model calibration) is essential for the safe application of neural networks. Many instances of miscalibration in modern neural networks have been reported, suggesting a trend that newer, more accurate models produce poorly calibrated predictions. Here, we revisit this question for recent state-of-the-art image classification models. We systematically relate model calibration and accuracy, and find that the most recent models, notably those not using convolutions, are among the best calibrated. Trends observed in prior model generations, such as decay of calibration with distribution shift or model size, are less pronounced in recent architectures. We also show that model size and amount of pretraining do not fully explain these differences, suggesting that architecture is a major determinant of calibration properties.

06/12/2021

single image 3d human reconstruction high-resolution implicit function clothed human coarse-to-fine multi-level pifu geometry clothing fashion

5:01

13/04/2021

Revisiting the Calibration of Modern Neural Networks

Matthias Minderer, Josip Djolonga, Rob Romijnders, Frances Hubis, Xiaohua Zhai, Neil Houlsby, Dustin Tran, Mario Lucic

Comments

Similar Papers

Rethinking Calibration of Deep Neural Networks: Do Not Be Afraid of Overconfidence

Deng-Bao Wang, Lei Feng, Min-Ling Zhang

Keywords Abstract Paper

deep learning, machine learning

Improving Calibration through the Relationship with Adversarial Robustness

Yao Qin, Xuezhi Wang, Alex Beutel, Ed Chi

Keywords Abstract Paper

deep learning, machine learning, robustness, adversarial robustness and security

Combining Ensembles and Data Augmentation Can Harm Your Calibration

Yeming Wen, Ghassen Jerfel, Rafael Müller and Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords Abstract Paper

Uncertainty estimates, Ensembles, Calibration

Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift

Alexander Chan, Ahmed Alaa, Zhaozhi Qian, Mihaela van der Schaar

Keywords Abstract Paper

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Abstract Paper

Meta-Cal: Well-controlled Post-hoc Calibration by Ranking

Xingchen Ma, Matthew B Blaschko

Keywords Abstract Paper

Algorithms, Supervised Learning

Implicit Bias of Linear RNNs

Melika Emami, Moji Sahraee-Ardakan, Parthe Pandit and Sundeep Rangan, Alyson Fletcher

Keywords Abstract Paper

Theory, Deep learning Theory

Towards Understanding the Regularization of Adversarial Robustness on Neural Networks

Yuxin Wen, Shuai Li, Kui Jia

Keywords Abstract Paper

Uncertainty Quantification and Deep Ensembles

Rahul Rahaman, alexandre thiery

Keywords Abstract Paper

deep learning, machine learning

Are Neural Rankers still Outperformed by Gradient Boosted Decision Trees?

Zhen Qin, Le Yan, Honglei Zhuang and Yi Tay, Rama Kumar Pasumarthi, Xuanhui Wang, Michael Bendersky, Marc Najork

Keywords Abstract Paper

gradient boosted decision trees, Learning to Rank, benchmark, neural network

Calibrated Reliable Regression using Maximum Mean Discrepancy

Peng Cui, Wenbo Hu, Jun Zhu

Keywords Abstract Paper

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Keywords Abstract Paper

image classification, regularization, self-knowledge distillation, generalization, calibration

Overparameterisation and worst-case generalisation: friend or foe?

Aditya Krishna Menon, Ankit Singh Rawat, Sanjiv Kumar

Keywords Abstract Paper

worst-case generalisation, overparameterisation

Local Signal Adaptivity: Provable Feature Learning in Neural Networks Beyond Kernels

Stefani Karp, Ezra Winston, Yuanzhi Li, Aarti Singh

Keywords Abstract Paper

theory, deep learning, optimization, machine learning, vision, kernel methods

PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human Digitization

Shunsuke Saito, Tomas Simon, Jason Saragih, Hanbyul Joo

Keywords Abstract Paper

Improving classifier confidence using lossy label-invariant transformations

Sooyong Jang, Insup Lee, James Weimer

Keywords Abstract Paper

Training independent subnetworks for robust prediction

Marton Havasi, Rodolphe Jenatton, Stanislav Fort and Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew Dai, Dustin Tran

Keywords Abstract Paper

robustness, Efficient ensembles

Evaluation of Neural Architectures Trained With Square Loss vs Cross-Entropy in Classification Tasks

Like Hui, Misha Belkin

Keywords Abstract Paper

classification, experimental evaluation, square loss vs cross-entropy, large scale learning

Uncertainty in Neural Networks: Approximately Bayesian Ensembling

Tim Pearce, Felix Leibfried, Alexandra Brintrup

Keywords Abstract Paper

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Abstract Paper

deep learning, machine learning, robustness, vision, domain adaptation

Ensemble Distillation for Structured Prediction: Calibrated, Accurate, Fast—Choose Three

Steven Reich, David Mueller, Nicholas Andrews

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Yeming Wen, Ghassen Jerfel, Rafael Müller and
Michael W Dusenberry, Jasper Snoek, Balaji Lakshminarayanan, Dustin Tran

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Melika Emami, Moji Sahraee-Ardakan, Parthe Pandit and
Sundeep Rangan, Alyson Fletcher

Keywords Paper

Keywords Paper

Keywords Paper

Zhen Qin, Le Yan, Honglei Zhuang and
Yi Tay, Rama Kumar Pasumarthi, Xuanhui Wang, Michael Bendersky, Marc Najork

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Marton Havasi, Rodolphe Jenatton, Stanislav Fort and
Jeremiah Zhe Liu, Jasper Snoek, Balaji Lakshminarayanan, Andrew Dai, Dustin Tran

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Muhammad Asim, Max Daniels, Oscar Leong and
Paul Hand, Ali Ahmed

Keywords Paper

Shen Yan, Yu Zheng, Wei Ao and
Xiao Zeng, Mi Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xingjun Ma, Hanxun Huang, Yisen Wang and
Simone Romano, Sarah Erfani, James Bailey

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jan-Matthis Lueckmann, Jan Boelts, David Greenberg and
Pedro Goncalves, Jakob Macke

Keywords Paper

Keywords Paper