Analysing the Noise Model Error for Realistic Noisy Label Data

02/02/2021

Analysing the Noise Model Error for Realistic Noisy Label Data

Michael A. Hedderich, Dawei Zhu, Dietrich Klakow

Keywords:

Abstract Paper Similar Papers

Abstract: Distant and weak supervision allow to obtain large amounts of labeled training data quickly and cheaply, but these automatic annotations tend to contain a high amount of errors. A popular technique to overcome the negative effects of these noisy labels is noise modelling where the underlying noise process is modelled. In this work, we study the quality of these estimated noise models from the theoretical side by deriving the expected error of the noise model. Apart from evaluating the theoretical results on commonly used synthetic noise, we also publish NoisyNER, a new noisy label dataset from the NLP domain that was obtained through a realistic distant supervision technique. It provides seven sets of labels with differing noise patterns to evaluate different noise levels on the same instances. Parallel, clean labels are available making it possible to study scenarios where a small amount of gold-standard data can be leveraged. Our theoretical results and the corresponding experiments give insights into the factors that influence the noise model estimation like the noise distribution and the sampling technique.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948181

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

When Optimizing $f$-Divergence is Robust with Label Noise

Jiaheng Wei, Yang Liu

Keywords Paper

robustness, learning with noisy labels, $f-$divergence

0

0

0

0

5:36

18/07/2021

Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization

Yivan Zhang, Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

19:29

02/02/2021

Learning Precise Temporal Point Event Detection with Misaligned Labels

Julien Schroeter, Kirill Sidorov, David Marshall

Keywords Paper

0

0

0

0

21:24

06/12/2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise

Hongxin Wei, Lue Tao, RENCHUNZI XIE, Bo An

Keywords Paper

deep learning, robustness

0

0

0

0

2:46

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

03/05/2021

How Benign is Benign Overfitting ?

Amartya Sanyal, Puneet Dokania, Varun Kanade, Philip Torr

Keywords Paper

generalization, memorization, benign overfitting, adversarial robustness

0

0

0

0

10:56

06/12/2021

Improved Regularization and Robustness for Fine-tuning in Neural Networks

Dongyue Li, Hongyang Zhang

Keywords Paper

deep learning, machine learning, robustness, vision, transfer learning

0

0

0

0

12:03

18/07/2021

Wasserstein Distributional Normalization For Robust Distributional Certification of Noisy Labeled Data

Sung Woo Park, Junseok Kwon

Keywords Paper

Deep Learning, Generative Models, Algorithms, Representation Learning; Optimization, Submodular Optimization, Probabilistic Methods, Robust statistics

0

0

0

0

5:20

02/02/2021

Uncertainty-Aware Multi-View Representation Learning

Yu Geng, Zongbo Han, Changqing Zhang, Qinghua Hu

Keywords Paper

0

0

0

0

14:19

18/07/2021

Confidence Scores Make Instance-dependent Label-noise Learning Possible

Antonin Berthon, Bo Han, Gang Niu and
Tongliang Liu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

20:38

03/05/2021

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen, Guangyong Chen, Junjie Ye and
jingwei zhao, Pheng-Ann Heng

Keywords Paper

Regularization, SGD noise, Robust Learning, Noisy Labels

0

0

0

0

9:42

06/12/2020

Dual T: Reducing Estimation Error for Transition Matrix in Label-noise Learning

Nick Yao, Tongliang Liu, Bo Han and
Mingming Gong, Jiankang Deng, Gang Niu, Masashi Sugiyama

Keywords Paper

, Optimization -> Non-Convex Optimization

0

0

0

0

3:15

06/12/2020

Distributionally Robust Local Non-parametric Conditional Estimation

Viet Anh Nguyen, Fan Zhang, Jose Blanchet and
Erick Delage, Yinyu Ye

Keywords Paper

0

0

0

0

3:22

06/12/2021

On the Algorithmic Stability of Adversarial Training

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

deep learning, adversarial robustness and security

0

0

0

0

13:00

04/07/2020

Label Noise in Context

Michael Desmond, Catherine Finegan-Dollak, Jeff Boston, Matt Arnold

Keywords Paper

Label Noise, manual remediation, correcting noise, noise remediation

0

0

0

0

11:43

06/12/2021

Can Less be More? When Increasing-to-Balancing Label Noise Rates Considered Beneficial

Yang Liu, Jialu Wang

Keywords Paper

machine learning, fairness

0

0

0

0

14:49

14/06/2020

Optical Flow in the Dark

Yinqiang Zheng, Mingfang Zhang, Feng Lu

Keywords Paper

low-light, optical flow, noise modeling, synthetic dataset, cnn

0

0

0

0

1:01

03/05/2021

Learning with Feature-Dependent Label Noise: A Progressive Approach

Yikai Zhang, Songzhu Zheng, Pengxiang Wu and
Mayank Goswami, Chao Chen

Keywords Paper

Noisy Label, Classification, Deep Learning

0

0

0

0

10:37

06/12/2021

On Density Estimation with Diffusion Models

Diederik Kingma, Tim Salimans, Ben Poole, Jonathan Ho

Keywords Paper

optimization, generative model

0

0

0

0

9:53

03/05/2021

Multiscale Score Matching for Out-of-Distribution Detection

Ahsan Mahmood, Junier Oliva, Martin A Styner

Keywords Paper

out-of-distribution detection, deep learning, score matching, outlier detection

0

0

0

0

5:13

14/06/2020

PointASNL: Robust Point Clouds Processing Using Nonlocal Neural Networks With Adaptive Sampling

Xu Yan, Chaoda Zheng, Zhen Li and
Sheng Wang, Shuguang Cui

Keywords Paper

adaptive sampling, non-local network, robust point clouds processing, point clouds segmentation

0

0

0

0

1:01

06/12/2021

Relative Uncertainty Learning for Facial Expression Recognition

Yuhang Zhang, Chengrui Wang, Weihong Deng

Keywords Paper

0

0

0

0

8:12

06/12/2020

Universally Quantized Neural Compression

Eirikur Agustsson, Lucas Theis

Keywords Paper

0

0

0

0

3:03

02/02/2021

Learning to Purify Noisy Labels via Meta Soft Label Corrector

Yichen Wu, Jun Shu, Qi Xie and
Qian Zhao, Deyu Meng

Keywords Paper

0

0

0

0

13:01

18/07/2021

Provably End-to-end Label-noise Learning without Anchor Points

Xuefeng Li, Tongliang Liu, Bo Han and
Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

5:16

26/04/2020

Meta Dropout: Learning to Perturb Latent Features for Generalization

Hae Beom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang

Keywords Paper

0

1

0

0

4:46

14/06/2020

Transfer Learning From Synthetic to Real-Noise Denoising With Adaptive Instance Normalization

Yoonsik Kim, Jae Woong Soh, Gu Yong Park, Nam Ik Cho

Keywords Paper

real-noise, real-noise denoiser, real-noise denoising, transfer learning, adaptive denoiser, adaptive instance normalization, simulate noise denoiser, awgn denoiser

0

0

0

0

1:01

06/12/2021

Can we globally optimize cross-validation loss? Quasiconvexity in ridge regression

Will Stephenson, Zachary Frangella, Madeleine Udell, Tamara Broderick

Keywords Paper

theory, optimization, interpretability

0

0

0

0

14:48

05/01/2021

Do We Really Need Gold Samples for Sample Weighting Under Label Noise?

Aritra Ghosh, Andrew Lan

Keywords Paper

0

0

0

0

4:58

18/07/2021

Cumulants of Hawkes Processes are Robust to Observation Noise

William Trouleau, Jalal Etesami, Matt Grossglauser and
Negar Kiyavash, Patrick Thiran

Keywords Paper

Algorithms, Time Series and Sequences

0

0

0

0

5:01

06/12/2020

Noise2Same: Optimizing A Self-Supervised Bound for Image Denoising

Yaochen Xie, Zhengyang Wang, Shuiwang Ji

Keywords Paper

0

0

0

0

3:24

05/01/2021

Same Same but DifferNet: Semi-Supervised Defect Detection With Normalizing Flows

Marco Rudolph, Bastian Wandt, Bodo Rosenhahn

Keywords Paper

0

0

0

0

4:29

05/01/2021

Legacy Photo Editing With Learned Noise Prior

Yuzhi Zhao, Lai-Man Po, Tingyu Lin and
Xuehui Wang, Kangcheng Liu, Yujia Zhang, Wing-Yin Yu, Pengfei Xian, Jingjing Xiong

Keywords Paper

0

0

0

0

4:51

14/06/2020

Learning to Restore Low-Light Images via Decomposition-and-Enhancement

Ke Xu, Xin Yang, Baocai Yin, Rynson W.H. Lau

Keywords Paper

low-light, image restoration, image enhancement, convolutional neural networks

0

0

0

0

1:01

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16

19/08/2021

Federated Model Distillation with Noise-Free Differential Privacy

Lichao Sun, Lingjuan Lyu

Keywords Paper

Data Mining, Federated Learning, Privacy Preserving Data Mining, Multi-agent Learning, Trustable Learning

0

0

0

0

14:30

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

18/07/2021

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

Zhaowei Zhu, Yiwen Song, Yang Liu

Keywords Paper

Deep Learning

0

0

0

0

5:24

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06