Locally Adaptive Label Smoothing Improves Predictive Churn

Abstract: Training modern neural networks is an inherently noisy process that can lead to high \emph{prediction churn}-- disagreements between re-trainings of the same model due to factors such as randomization in the parameter initialization and mini-batches-- even when the trained models all attain similar accuracies. Such prediction churn can be very undesirable in practice. In this paper, we present several baselines for reducing churn and show that training on soft labels obtained by adaptively smoothing each example's label based on the example's neighboring labels often outperforms the baselines on churn while improving accuracy on a variety of benchmark classification tasks and model architectures.

13/04/2021

Algorithms -> Active Learning; Algorithms -> Classification; Algorithms -> Ranking and Preference Learning, Theory -> Learning Theory

3:28

12/07/2020

Locally Adaptive Label Smoothing Improves Predictive Churn

Dara Bahri, Heinrich Jiang

Comments

Similar Papers

Hidden cost of randomized smoothing

Jeet Mohapatra, Ching-Yun Ko, Lily Weng and Pin-Yu Chen, Sijia Liu, Luca Daniel

Keywords Abstract Paper

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and Gang Niu, Jian Yang, Chen Gong

Keywords Abstract Paper

DIBS: Diversity Inducing Information Bottleneck in Model Ensembles

Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal and Hugo Larochelle, Animesh Garg, Florian Shkurti

Keywords Abstract Paper

Uncertainty-Aware CNNs for Depth Completion: Uncertainty from Beginning to End

Abdelrahman Eldesokey, Michael Felsberg, Karl Holmquist, Michael Persson

Keywords Abstract Paper

uncertainty, sparsity, depth completion, bayesian deep learning, normalized convolution, real-time

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Keywords Abstract Paper

image classification, regularization, self-knowledge distillation, generalization, calibration

Non-Parametric Calibration for Classification

Jonathan Wenger, Hedvig Kjellström, Rudolph Triebel )

Keywords Abstract Paper

Nondeterminism and Instability in Neural Network Optimization

Cecilia Summers, Michael J Dinneen

Keywords Abstract Paper

Deep Learning, Optimization for Deep Networks

On the Generalization Benefit of Noise in Stochastic Gradient Descent

Samuel Smith, Erich Elsen, Soham De

Keywords Abstract Paper

Deep Learning - General

Triple descent and the two kinds of overfitting: where & why do they appear?

Stéphane d'Ascoli, Levent Sagun, Giulio Biroli

Keywords Abstract Paper

Algorithms -> Active Learning; Algorithms -> Classification; Algorithms -> Ranking and Preference Learning, Theory -> Learning Theory

Puzzle Mix: Exploiting Saliency and Local Statistics for Optimal Mixup

Jang-Hyun Kim, Wonho Choo, Hyun Oh Song

Keywords Abstract Paper

Deep Learning - Algorithms

Risk Bounds for Over-parameterized Maximum Margin Classification on Sub-Gaussian Mixtures

Yuan Cao, Quanquan Gu, Mikhail Belkin

Keywords Abstract Paper

deep learning, machine learning

Instance-dependent Label-noise Learning under a Structural Causal Model

Yu Yao, Tongliang Liu, Mingming Gong and Bo Han, Gang Niu, Kun Zhang

Keywords Abstract Paper

deep learning, causality

Distilling Multiple Domains for Neural Machine Translation

Anna Currey, Prashant Mathur, Georgiana Dinu

Keywords Abstract Paper

translation, neural translation, multi-domain model, high-resource conditions

Improving generalization by controlling label-noise information in neural network weights

Hrayr Harutyunyan, Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Abstract Paper

Supervised Learning

Robust training with ensemble consensus

Jisoo Lee, Sae-Young Chung

Keywords Abstract Paper

Annotation noise, Noisy label, Robustness, Ensemble, Perturbation

Gradient Descent with Early Stopping is Provably Robust to Label Noise for Overparameterized Neural Networks

Mingchen Li, Mahdi Soltanolkotabi, Samet Oymak

Keywords Abstract Paper

What can linearized neural networks actually say about generalization?

Guillermo Ortiz-Jimenez, Seyed-Mohsen Moosavi-Dezfooli, Pascal Frossard

Keywords Abstract Paper

theory, deep learning

The Neural Tangent Kernel in High Dimensions: Triple Descent and a Multi-Scale Theory of Generalization

Ben Adlam, Jeffrey Pennington

Keywords Abstract Paper

Deep Learning - Theory

Rethinking Calibration of Deep Neural Networks: Do Not Be Afraid of Overconfidence

Deng-Bao Wang, Lei Feng, Min-Ling Zhang

Keywords Abstract Paper

deep learning, machine learning

Influence Functions in Deep Learning Are Fragile

Samyadeep Basu, Phil Pope, Soheil Feizi

Keywords Abstract Paper

Influence Functions, Interpretability

AugMix: A Simple Data Processing Method to Improve Robustness and Uncertainty

Jeet Mohapatra, Ching-Yun Ko, Lily Weng and
Pin-Yu Chen, Sijia Liu, Luca Daniel

Keywords Paper

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal and
Hugo Larochelle, Animesh Garg, Florian Shkurti

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yu Yao, Tongliang Liu, Mingming Gong and
Bo Han, Gang Niu, Kun Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Dan Hendrycks, Norman Mu, Ekin Dogus Cubuk and
Barret Zoph, Justin Gilmer, Balaji Lakshminarayanan

Keywords Paper

Setareh Ariafar, Zelda Mariet, Dana Brooks and
Jennifer Dy, Jasper Snoek

Keywords Paper

Keywords Paper

Ahmadreza Jeddi, Mohammad Javad Shafiee, Michelle Karg and
Christian Scharfenberger, Alexander Wong

Keywords Paper

Anne Draelos, Pranjal Gupta, Na Young Jun and
Chaichontat Sriworarat, John Pearson

Keywords Paper

Keywords Paper

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

Shufei Zhang, Zhuang Qian, Kaizhu Huang and
Qiufeng Wang, Rui Zhang, Xinping Yi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Bo Han, Gang Niu, Xingrui Yu and
QUANMING YAO, Miao Xu, Ivor Tsang, Masashi Sugiyama

Keywords Paper