Does label smoothing mitigate label noise?

12/07/2020

Does label smoothing mitigate label noise?

Michal Lukasik, Srinadh Bhojanapalli, Aditya Menon, Sanjiv Kumar

Keywords: Deep Learning - Algorithms

Abstract Paper Similar Papers

Abstract: Label smoothing is commonly used in training deep learning models, wherein one-hot training labels are mixed with uniform label vectors. Empirically, smoothing has been shown to improve both predictive performance and model calibration. In this paper, we study whether label smoothing is also effective as a means of coping with label noise. While label smoothing apparently amplifies this problem --- being equivalent to injecting symmetric noise to the labels --- we show how it relates to a general family of loss-correction techniques from the label noise literature. Building on this connection, we show that label smoothing can be competitive with loss-correction techniques under label noise. Further, we show that when performing distillation under label noise, label smoothing of the teacher can be beneficial; this is in contrast to recent findings for noise-free problems, and sheds further light on settings where label smoothing is beneficial.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Rethinking Soft Labels for Knowledge Distillation: A Bias–Variance Tradeoff Perspective

Helong Zhou, Liangchen Song, Jiajie Chen and
Ye Zhou, Guoli Wang, Junsong Yuan, Qian Zhang

Keywords Paper

teacher-student model, soft labels, Knowledge distillation

0

0

0

0

2:20

03/05/2021

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

Asish Ghoshal, Xilun Chen, Sonal Gupta and
Luke Zettlemoyer, Yashar Mehdad

Keywords Paper

calibration, semantic parsing, structured prediction, label smoothing

0

0

0

0

5:37

12/07/2020

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates

Yang Liu, Hongyi Guo

Keywords Paper

Supervised Learning

0

0

0

0

15:57

26/08/2020

Regularization via Structural Label Smoothing

Weizhi Li, Gautam Dasarathy, Visar Berisha

Keywords Paper

0

0

0

0

13:36

06/12/2020

Self-Distillation as Instance-Specific Label Smoothing

Zhilu Zhang, Mert Sabuncu

Keywords Paper

0

0

0

0

3:09

06/12/2021

Training Over-parameterized Models with Non-decomposable Objectives

Harikrishna Narasimhan, Aditya Menon

Keywords Paper

optimization, machine learning, fairness

0

0

0

0

8:28

18/07/2021

A statistical perspective on distillation

Aditya Menon, Ankit Singh Rawat, Sashank Jakkam Reddi and
Seungyeon Kim, Sanjiv Kumar

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

4:56

03/05/2021

When Optimizing $f$-Divergence is Robust with Label Noise

Jiaheng Wei, Yang Liu

Keywords Paper

robustness, learning with noisy labels, $f-$divergence

0

0

0

0

5:36

02/02/2021

Label Confusion Learning to Enhance Text Classification Models

Biyang Guo, Songqiao Han, Xiao Han and
Hailiang Huang, Ting Lu

Keywords Paper

0

0

0

0

15:17

11/10/2020

Data Cleansing with Contrastive Learning for Vocal Note Event Annotations

Gabriel Meseguer Brocal, Rachel Bittner, Simon Durand, Brian Brost

Keywords Paper

Domain knowledge, Machine learning/Artificial intelligence for music, Evaluation, datasets, and reproducibility, Novel datasets and use cases, MIR tasks, Music transcription and annotation

0

0

0

0

3:51

22/11/2021

Training Better Deep Neural Networks with Uncertainty Mining Net

Yang Sun, Abhishek Kolagunda, Steven Eliuk, Xiaolong Wang

Keywords Paper

label noise, label uncertainty, learning with noisy labels

0

0

0

0

3:00

18/07/2021

Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels

Songhua Wu, Xiaobo Xia, Tongliang Liu and
Bo Han, Mingming Gong, Nannan Wang, Haifeng Liu, Gang Niu

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

4:54

19/08/2021

Comparing Kullback-Leibler Divergence and Mean Squared Error Loss in Knowledge Distillation

Taehyeon Kim, Jaehoon Oh, Nak Yil Kim and
Sangwook Cho, Se-Young Yun

Keywords Paper

Machine Learning, Classification, Deep Learning

0

0

0

0

12:43

02/02/2021

Learning to Purify Noisy Labels via Meta Soft Label Corrector

Yichen Wu, Jun Shu, Qi Xie and
Qian Zhao, Deyu Meng

Keywords Paper

0

0

0

0

13:01

02/02/2021

From Label Smoothing to Label Relaxation

Julian Lienen, Eyke Hüllermeier

Keywords Paper

0

0

0

0

20:04

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

03/05/2021

Understanding and Improving Lexical Choice in Non-Autoregressive Translation

Liam Ding, Longyue Wang, Xuebo Liu and
Derek Wong, Dacheng Tao, Zhaopeng Tu

Keywords Paper

0

0

0

0

11:37

02/02/2021

PULNS: Positive-Unlabeled Learning with Effective Negative Sample Selector

Chuan Luo, Pu Zhao, Chen Chen and
Bo Qiao, Chao Du, Hongyu Zhang, Wei Wu, Shaowei Cai, Bing He, Saravanakumar Rajmohan, Qingwei Lin

Keywords Paper

0

0

0

0

18:08

02/02/2021

Curriculum Labeling: Revisiting Pseudo-Labeling for Semi-Supervised Learning

Paola Cascante-Bonilla, Fuwen Tan, Yanjun Qi, Vicente Ordonez

Keywords Paper

0

0

0

0

19:45

03/05/2021

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen, Guangyong Chen, Junjie Ye and
jingwei zhao, Pheng-Ann Heng

Keywords Paper

Regularization, SGD noise, Robust Learning, Noisy Labels

0

0

0

0

9:42

06/12/2021

Revealing and Protecting Labels in Distributed Training

Trung Dang, Om Thakkar, Swaroop Ramaswamy and
Rajiv Mathews, Peter Chin, Françoise Beaufays

Keywords Paper

machine learning, vision, privacy, federated learning

0

0

0

0

13:06

12/07/2020

Variational Label Enhancement

Ning Xu, Yun-Peng Liu, Jun Shu, Xin Geng

Keywords Paper

Supervised Learning

0

0

0

0

15:37

02/02/2021

Exploiting Unlabeled Data via Partial Label Assignment for Multi-Class Semi-Supervised Learning

Zhen-Ru Zhang, Qian-Wen Zhang, Yunbo Cao, Min-Ling Zhang

Keywords Paper

0

0

0

0

15:05

12/07/2020

Learning with Bounded Instance- and Label-dependent Label Noise

Jiacheng Cheng, Tongliang Liu, Kotagiri Ramamohanarao, Dacheng Tao

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:40

03/05/2021

Robust Curriculum Learning: from clean label detection to noisy label self-correction

Tianyi Zhou, Shengjie Wang, Jeff Bilmes

Keywords Paper

neural networks, curriculum learning, training dynamics, robust learning, noisy label

0

0

0

0

5:02

03/05/2021

Robust early-learning: Hindering the memorization of noisy labels

Xiaobo Xia, Tongliang Liu, Bo Han and
Chen Gong, Nannan Wang, Zongyuan Ge, Yi Chang

Keywords Paper

0

0

0

0

4:39

19/08/2021

Perturb, Predict & Paraphrase: Semi-Supervised Learning using Noisy Student for Image Captioning

Arjit Jain, Pranay Reddy Samala, Preethi Jyothi and
Deepak Mittal, Maneesh Singh

Keywords Paper

Computer Vision, Language and Vision, Semi-Supervised Learning

0

0

0

0

10:06

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

06/12/2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise

Hongxin Wei, Lue Tao, RENCHUNZI XIE, Bo An

Keywords Paper

deep learning, robustness

0

0

0

0

2:46

06/12/2021

End-to-End Weak Supervision

Salva Rühling Cachay, Benedikt Boecking, Artur Dubrawski

Keywords Paper

deep learning, machine learning, robustness

0

0

0

0

14:43

06/12/2021

A Theory-Driven Self-Labeling Refinement Method for Contrastive Representation Learning

Pan Zhou, Caiming Xiong, Xiaotong Yuan, Steven Chu Hong Hoi

Keywords Paper

theory, machine learning, self-supervised learning, contrastive learning, representation learning

0

0

0

0

14:12

18/07/2021

Active Testing: Sample-Efficient Model Evaluation

Jannik Kossen, Sebastian Farquhar, Yarin Gal, Tom Rainforth

Keywords Paper

Algorithms, Active Learning

0

0

0

0

5:19

03/05/2021

Is Label Smoothing Truly Incompatible with Knowledge Distillation: An Empirical Study

Zhiqiang Shen, Zhiqiang Shen, Dejia Xu and
Zitian Chen, Kwang-Ting Cheng, Marios Savvides

Keywords Paper

binary neural networks, knowledge distillation, label smoothing, image classification, neural machine translation

0

0

0

0

4:46

18/07/2021

Leveraged Weighted Loss for Partial Label Learning

Hongwei Wen, Jingyi Cui, Hanyuan Hang and
Jiabin Liu, Yisen Wang, Zhouchen Lin

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

1

13:26

22/11/2021

PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

Filipe Rolim Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Keywords Paper

noisy labels, noisy annotation, Mixup, hard samples, noisy samples, noisy training

0

0

0

0

3:01

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06

22/11/2021

Elsa: Energy-based Learning for Semi-supervised Anomaly Detection

Sungwon Han, HyeonHo Song, Seung Eon Lee and
Sungwon Park, Meeyoung Cha

Keywords Paper

contrastive learning, energy-based learning, semi-supervised learning, anomaly detection

0

0

0

0

2:48

18/07/2021

Contrastive Learning Inverts the Data Generating Process

Roland S. Zimmermann, Yash Sharma, Steffen Schneider and
Matthias Bethge, Wieland Brendel

Keywords Paper

Theory, Deep learning Theory

1

1

0

1

5:17

03/05/2021

Knowledge Distillation as Semiparametric Inference

Tri Dao, Govinda Kamath, Vasilis Syrgkanis, Lester Mackey

Keywords Paper

generalization bounds, knowledge distillation, model compression, loss correction, orthogonal machine learning, cross-fitting, semiparametric inference

0

0

0

0

5:10

12/07/2020

Time-Consistent Self-Supervision for Semi-Supervised Learning

Tianyi Zhou, Shengjie Wang, Jeff Bilmes

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

14:37