Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

26/04/2020

Simple and Effective Regularization Methods for Training on Noisily Labeled Data with Generalization Guarantee

Wei Hu, Zhiyuan Li, Dingli Yu

Keywords: deep learning theory, regularization, noisy labels

Abstract Paper Code Similar Papers

Abstract: Over-parameterized deep neural networks trained by simple first-order methods are known to be able to fit any labeling of data. Such over-fitting ability hinders generalization when mislabeled training examples are present. On the other hand, simple regularization methods like early-stopping can often achieve highly nontrivial performance on clean test data in these scenarios, a phenomenon not theoretically understood. This paper proposes and analyzes two simple and intuitive regularization methods: (i) regularization by the distance between the network parameters to initialization, and (ii) adding a trainable auxiliary variable to the network output for each training example. Theoretically, we prove that gradient descent training with either of these two methods leads to a generalization guarantee on the clean data distribution despite being trained using noisy labels. Our generalization analysis relies on the connection between wide neural network and neural tangent kernel (NTK). The generalization bound is independent of the network size, and is comparable to the bound one can get when there is no label noise. Experimental results verify the effectiveness of these methods on noisily labeled datasets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/08/2020

Gradient Descent with Early Stopping is Provably Robust to Label Noise for Overparameterized Neural Networks

Mingchen Li, Mahdi Soltanolkotabi, Samet Oymak

Keywords Paper

0

0

0

0

13:21

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

26/08/2020

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Keywords Paper

0

0

0

0

16:48

18/07/2021

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

Hancheng Min, Salma Tarmoun, Rene Vidal, Enrique Mallada

Keywords Paper

Theory

0

0

0

0

5:16

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

12/07/2020

Confidence-Aware Learning for Deep Neural Networks

Sangheum Hwang, Jooyoung Moon, Jihyo Kim, Younghak Shin

Keywords Paper

Deep Learning - Algorithms

0

0

0

1

14:05

12/07/2020

Concise Explanations of Neural Networks using Adversarial Training

Prasad Chalasani, Jiefeng Chen, Amrita Roy Chowdhury and
Xi Wu, Somesh Jha

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:16

14/06/2020

Watch Your Up-Convolution: CNN Based Generative Deep Neural Networks Are Failing to Reproduce Spectral Distributions

Ricard Durall, Margret Keuper, Janis Keuper

Keywords Paper

spectral regularization, gan, deepfake, up-convolution, generative models, frequency spectrum

0

0

0

0

1:00

03/05/2021

Overparameterisation and worst-case generalisation: friend or foe?

Aditya Krishna Menon, Ankit Singh Rawat, Sanjiv Kumar

Keywords Paper

worst-case generalisation, overparameterisation

0

0

0

0

5:01

06/12/2021

Bayesian Adaptation for Covariate Shift

Aurick Zhou, Sergey Levine

Keywords Paper

deep learning, machine learning, robustness, vision, domain adaptation

0

0

0

0

8:21

03/05/2021

How Benign is Benign Overfitting ?

Amartya Sanyal, Puneet Dokania, Varun Kanade, Philip Torr

Keywords Paper

generalization, memorization, benign overfitting, adversarial robustness

0

0

0

0

10:56

12/07/2020

Efficient proximal mapping of the path-norm regularizer of shallow networks

Fabian Latorre, Paul Rolland, Shaul Nadav Hallak, Volkan Cevher

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

11:32

18/07/2021

Uniform Convergence, Adversarial Spheres and a Simple Remedy

Gregor Bachmann, Seyed Moosavi, Thomas Hofmann

Keywords Paper

Theory, Deep learning Theory

0

2

0

0

5:52

03/05/2021

An Unsupervised Deep Learning Approach for Real-World Image Denoising

Dihan Zheng, Sia Huat Tan, Xiaowen Zhang and
Zuoqiang Shi, Kaisheng Ma, Chenglong Bao

Keywords Paper

Real-world image denoising, unsupervised image denoising

0

0

0

0

4:31

06/12/2020

Adversarial robustness via robust low rank representations

Pranjal Awasthi, Himanshu Jain, Ankit Singh Rawat, Aravindan Vijayaraghavan

Keywords Paper

0

0

0

1

3:14

06/12/2021

Integrated Latent Heterogeneity and Invariance Learning in Kernel Space

Jiashuo Liu, Zheyuan Hu, Peng Cui and
Bo Li, Zheyan Shen

Keywords Paper

deep learning, reinforcement learning and planning, machine learning

0

0

0

0

11:11

12/07/2020

Being Bayesian, Even Just a Bit, Fixes Overconfidence in ReLU Networks

Agustinus Kristiadi, Matthias Hein, Philipp Hennig

Keywords Paper

Deep Learning - General

0

0

0

0

15:02

06/12/2021

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Kuan-Lin Chen, Ching-Hua Lee, Harinath Garudadri, Bhaskar D Rao

Keywords Paper

optimization, vision

0

0

0

0

13:27

14/06/2020

Regularizing Class-Wise Predictions via Self-Knowledge Distillation

Sukmin Yun, Jongjin Park, Kimin Lee, Jinwoo Shin

Keywords Paper

image classification, regularization, self-knowledge distillation, generalization, calibration

0

0

0

0

1:01

04/07/2020

Improved Natural Language Generation via Loss Truncation

Daniel Kang, Tatsunori Hashimoto

Keywords Paper

Natural Generation, optimization, estimation, distinguishability

0

0

0

0

10:35

03/05/2021

Robust Curriculum Learning: from clean label detection to noisy label self-correction

Tianyi Zhou, Shengjie Wang, Jeff Bilmes

Keywords Paper

neural networks, curriculum learning, training dynamics, robust learning, noisy label

0

0

0

0

5:02

12/07/2020

Detecting Out-of-Distribution Examples with Gram Matrices

Chandramouli Shama Sastry, Sageev Oore

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

12:40

06/12/2020

Bad Global Minima Exist and SGD Can Reach Them

Shengchao Liu, Dimitrios Papailiopoulos, Dimitris Achlioptas

Keywords Paper

0

0

0

0

3:15

03/05/2021

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi, LU HOU, Lifeng Shang and
Xin Jiang, Qun Liu, Zhi-Ming Ma

Keywords Paper

sample reweighting, data augmentation

0

0

0

0

4:58

06/12/2021

Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Zifeng Wang, Tong Jian, Aria Masoomi and
Stratis Ioannidis, Jennifer Dy

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

13:49

12/07/2020

Adversarial Neural Pruning with Latent Vulnerability Suppression

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Adversarial Examples

0

0

0

0

14:43

18/07/2021

RATT: Leveraging Unlabeled Data to Guarantee Generalization

Saurabh Garg, Sivaraman Balakrishnan, Zico Kolter, Zachary Lipton

Keywords Paper

Probabilistic Methods, Graphical Models, Theory, Computational Complexity, Theory, Models of Learning and Generalization

0

0

0

1

17:27

26/04/2020

Improved Sample Complexities for Deep Neural Networks and Robust Classification via an All-Layer Margin

Colin Wei, Tengyu Ma

Keywords Paper

deep learning theory, generalization bounds, adversarially robust generalization, data-dependent generalization bounds

0

0

0

0

5:30

12/07/2020

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu and
Chunjing Xu, Enhua Wu, Chang Xu

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:34

30/11/2020

Bridging Adversarial and Statistical Domain Transfer via Spectral Adaptation Networks

Christoph Raab, Philipp Väth, Peter Meier, Frank-Michael Schleif

Keywords Paper

0

0

0

0

10:07

26/08/2020

Adversarial Risk Bounds through Sparsity based Compression

Emilio Balda, Niklas Koep, Arash Behboodi, Rudolf Mathar

Keywords Paper

0

0

0

0

15:15

03/05/2021

Implicit Gradient Regularization

David Barrett, Benoit Dherin

Keywords Paper

regularization, theory, deep learning, implicit regularization, deep learning theory, theoretical issues in deep learning

0

0

0

0

4:55

02/02/2021

A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data

Xianfeng Li, Weijie Chen, Di Xie and
Shicai Yang, Peng Yuan, Shiliang Pu, Yueting Zhuang

Keywords Paper

0

0

0

0

19:06

05/01/2021

Do We Really Need Gold Samples for Sample Weighting Under Label Noise?

Aritra Ghosh, Andrew Lan

Keywords Paper

0

0

0

0

4:58

12/07/2020

Unlabelled Data Improves Bayesian Uncertainty Calibration under Covariate Shift

Alexander Chan, Ahmed Alaa, Zhaozhi Qian, Mihaela van der Schaar

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

14:59

14/06/2020

Towards Achieving Adversarial Robustness by Enforcing Feature Consistency Across Bit Planes

Sravanti Addepalli, Vivek B.S., Arya Baburaj and
Gaurang Sriramanan, R. Venkatesh Babu

Keywords Paper

adversarial robustness, adversarial defense, adversarial training, fast adversarial training, adversary-free training, adversarial attacks, efficient adversarial training, generalization, feature consistency, deep neural networks

0

0

0

0

1:01

14/06/2020

HRank: Filter Pruning Using High-Rank Feature Map

Mingbao Lin, Rongrong Ji, Yan Wang and
Yichen Zhang, Baochang Zhang, Yonghong Tian, Ling Shao

Keywords Paper

network pruning, neural network compression and acceleration, high-rank feature map, efficient deep learning computing

0

0

0

0

4:57

22/11/2021

Unsupervised Image Denoising with Frequency Domain Knowledge

Nahyun Kim, Donggon Jang, Sunhyeok Lee and
Bomi Kim, Daeshik Kim

Keywords Paper

unsupervised image denoising, Fourier transform, frequency knowledge, GAN-based denoising

0

0

0

0

8:51

03/05/2021

Towards Robust Neural Networks via Close-loop Control

Zhuotong Chen, Qianxiao Li, Zheng Zhang

Keywords Paper

dynamical system, neural network robustness, optimal control

0

0

0

0

4:47