From Label Smoothing to Label Relaxation

02/02/2021

From Label Smoothing to Label Relaxation

Julian Lienen, Eyke Hüllermeier

Keywords:

Abstract Paper Similar Papers

Abstract: Regularization of (deep) learning models can be realized at the model, loss, or data level. As a technique somewhere in-between loss and data, label smoothing turns deterministic class labels into probability distributions, for example by uniformly distributing a certain part of the probability mass over all classes. A predictive model is then trained on these distributions as targets, using cross-entropy as loss function. While this method has shown improved performance compared to non-smoothed cross-entropy, we argue that the use of a smoothed though still precise probability distribution as a target can be questioned from a theoretical perspective. As an alternative, we propose a generalized technique called label relaxation, in which the target is a set of probabilities represented in terms of an upper probability distribution. This leads to a genuine relaxation of the target instead of a distortion, thereby reducing the risk of incorporating an undesirable bias in the learning process. Methodically, label relaxation leads to the minimization of a novel type of loss function, for which we propose a suitable closed-form expression for model optimization. The effectiveness of the approach is demonstrated in an empirical study on image data.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948094

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Self-Supervised Learning with Kernel Dependence Maximization

Yazhe Li, Roman Pogodin, [deadname] J Sutherland, Arthur Gretton

Keywords Paper

machine learning, self-supervised learning, vision, representation learning, kernel methods, semi-supervised learning

0

0

0

0

11:48

03/05/2021

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

Asish Ghoshal, Xilun Chen, Sonal Gupta and
Luke Zettlemoyer, Yashar Mehdad

Keywords Paper

calibration, semantic parsing, structured prediction, label smoothing

0

0

0

0

5:37

19/08/2021

Deep Residual Reinforcement Learning (Extended Abstract)

Shangtong Zhang, Wendelin Boehmer, Shimon Whiteson

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

12:34

03/05/2021

Separation and Concentration in Deep Networks

John Zarka, Florentin Guth, Stéphane Mallat

Keywords Paper

concentration, mean separation, neural collapse, fisher ratio, image classification, variance reduction, deep learning

0

0

0

0

5:11

06/12/2020

Stochastic Normalizing Flows

Hao Wu, Jonas Köhler, Frank Noe

Keywords Paper

0

0

0

0

3:19

18/07/2021

Learning Randomly Perturbed Structured Predictors for Direct Loss Minimization

Hedda Cohen Indelman, Tamir Hazan

Keywords Paper

Algorithms, Structured Prediction, Algorithms, Collaborative Filtering, Applications, Recommender Systems

0

0

0

0

5:11

03/05/2021

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen, Guangyong Chen, Junjie Ye and
jingwei zhao, Pheng-Ann Heng

Keywords Paper

Regularization, SGD noise, Robust Learning, Noisy Labels

0

0

0

0

9:42

03/05/2021

Adversarial score matching and improved sampling for image generation

Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Ioannis Mitliagkas, Remi Combes

Keywords Paper

score matching, adversarial, generative model, GAN, Langevin dynamics

0

0

0

0

4:56

12/07/2020

Meta-learning with Stochastic Linear Bandits

Leonardo Cella, Alessandro Lazaric, Massimiliano Pontil

Keywords Paper

Transfer, Multitask and Meta-learning

1

1

0

0

13:17

06/12/2021

Diffusion Normalizing Flow

Qinsheng Zhang, Yongxin Chen

Keywords Paper

generative model

0

0

0

0

9:09

26/04/2020

Adaptive Correlated Monte Carlo for Contextual Categorical Sequence Generation

Xinjie Fan, Yizhe Zhang, Zhendong Wang, Mingyuan Zhou

Keywords Paper

binary softmax, discrete variables, policy gradient, pseudo actions, reinforcement learning, variance reduction

0

0

0

0

4:59

03/08/2020

Relaxed Multivariate Bernoulli Distribution and Its Applications to Deep Generative Models

Xi Wang, Junming Yin

Keywords Paper

0

0

0

0

7:56

06/12/2020

Understanding Double Descent Requires A Fine-Grained Bias-Variance Decomposition

Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:30

06/12/2020

Deep Diffusion-Invariant Wasserstein Distributional Classification

Sung Woo Park, Dong Wook Shu, Junseok Kwon

Keywords Paper

0

0

0

0

3:06

03/05/2021

Bayesian Few-Shot Classification with One-vs-Each Pólya-Gamma Augmented Gaussian Processes

Jake Snell, Richard Zemel

Keywords Paper

uncertainty estimation, few-shot learning, bayesian deep learning, gaussian processes

0

0

0

0

4:58

06/12/2020

Deep Rao-Blackwellised Particle Filters for Time Series Forecasting

Richard Kurle, Syama Sundar Rangapuram, Emmanuel de Bézenac and
Stephan Günnemann, Jan Gasthaus

Keywords Paper

0

0

0

0

3:14

19/08/2021

Variational Model-based Policy Optimization

Yinlam Chow, Brandon Cui, Moonkyung Ryu, Mohammad Ghavamzadeh

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

15:31

18/07/2021

DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phil Gibbons

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:07

06/12/2021

Repulsive Deep Ensembles are Bayesian

Francesco D'Angelo, Vincent Fortuin

Keywords Paper

deep learning, optimization

0

0

0

0

11:53

12/07/2020

Double Trouble in Double Descent: Bias and Variance(s) in the Lazy Regime

Stéphane d'Ascoli, Maria Refinetti, Giulio Biroli, Florent Krzakala

Keywords Paper

Deep Learning - Theory

0

0

0

0

15:11

06/12/2020

On the Convergence of Smooth Regularized Approximate Value Iteration Schemes

Elena Smirnova, Elvis Dohmatob

Keywords Paper

0

0

0

0

3:22

06/12/2020

Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory

Yufeng Zhang, Qi Cai, Zhuoran Yang and
Yongxin Chen, Zhaoran Wang

Keywords Paper

0

0

0

0

3:02

18/07/2021

Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients

Artem Artemev, David Burt, Mark van der Wilk

Keywords Paper

Probabilistic Methods, Gaussian Processes and Bayesian non-parametrics

0

0

0

0

17:13

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

26/08/2020

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Han Bao, Masashi Sugiyama

Keywords Paper

0

0

0

0

15:01

19/08/2021

Regularizing Variational Autoencoder with Diversity and Uncertainty Awareness

Dazhong Shen, Chuan Qin, Chao Wang and
Hengshu Zhu, Enhong Chen, Hui Xiong

Keywords Paper

Machine Learning, Bayesian Learning, Probabilistic Machine Learning, Unsupervised Learning

0

0

0

0

13:04

19/08/2021

Differentially Private Correlation Alignment for Domain Adaptation

Kaizhong Jin, Xiang Cheng, Jiaxi Yang, Kaiyuan Shen

Keywords Paper

Multidisciplinary Topics and Applications, Security and Privacy, Transfer, Adaptation, Multi-task Learning

0

0

0

0

8:03

06/12/2021

Leveraging Recursive Gumbel-Max Trick for Approximate Inference in Combinatorial Spaces

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

deep learning, optimization

0

0

0

0

14:26

12/07/2020

Supervised learning: no loss no cry

Richard Nock, Aditya Menon

Keywords Paper

Learning Theory

0

0

0

0

15:18

06/12/2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, self-supervised learning, vision, contrastive learning, representation learning, transfer learning

0

0

0

0

14:34

20/07/2020

DP-LSSGD: A Stochastic Optimization Method to Lift the Utility in Privacy-Preserving ERM

Bao Wang, Quanquan Gu, March Boedihardjo and
Lingxiao Wang, Farzin Barekat, Stanley J. Osher

Keywords Paper

0

0

0

0

17:42

03/05/2021

CPR: Classifier-Projection Regularization for Continual Learning

Sungmin Cha, Hsiang Hsu, Taebaek Hwang and
Flavio Calmon, Taesup Moon

Keywords Paper

regularization, wide local minima, continual learning

0

0

0

1

5:21

04/08/2021

Nonparametric Regression with Shallow Overparametrized Neural Networks Trained by GD with Early Stopping

Ilja Kuzborskij , Csaba Szepesvari

Keywords Paper

0

0

0

0

15:14

18/07/2021

Understanding self-supervised learning dynamics without contrastive pairs

Yuandong Tian, Xinlei Chen, Surya Ganguli

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

18:16

06/12/2020

Robust Federated Learning: The Case of Affine Distribution Shifts

Amirhossein Reisizadeh, Farzan Farnia, Ramtin Pedarsani, Ali Jadbabaie

Keywords Paper

0

0

0

0

3:16

18/07/2021

Learning from Similarity-Confidence Data

Yuzhou Cao, Lei Feng, Yitian Xu and
Bo An, Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

4:05

06/12/2020

Beyond the Mean-Field: Structured Deep Gaussian Processes Improve the Predictive Uncertainties

Jakob Lindinger, David Reeb, Christoph Lippert, Barbara Rakitsch

Keywords Paper

0

0

0

0

3:21

13/04/2021

Learning with risk-averse feedback under potentially heavy tails

Matthew Holland, El Mehdi Haress

Keywords Paper

0

0

0

0

2:44

26/08/2020

Regularization via Structural Label Smoothing

Weizhi Li, Gautam Dasarathy, Visar Berisha

Keywords Paper

0

0

0

0

13:36

06/12/2020

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

Andrew Wilson, Pavel Izmailov

Keywords Paper

0

0

0

0

3:27