Label Noise in Context

04/07/2020

Label Noise in Context

Michael Desmond, Catherine Finegan-Dollak, Jeff Boston, Matt Arnold

Keywords: Label Noise, manual remediation, correcting noise, noise remediation

Abstract Paper Similar Papers

Abstract: Label noise—incorrectly or ambiguously labeled training examples—can negatively impact model performance. Although noise detection techniques have been around for decades, practitioners rarely apply them, as manual noise remediation is a tedious process. Examples incorrectly flagged as noise waste reviewers’ time, and correcting label noise without guidance can be difficult. We propose LNIC, a noise-detection method that uses an example’s neighborhood within the training set to (a) reduce false positives and (b) provide an explanation as to why the ex- ample was flagged as noise. We demonstrate on several short-text classification datasets that LNIC outperforms the state of the art on measures of precision and F0.5-score. We also show how LNIC’s training set context helps a reviewer to understand and correct label noise in a dataset. The LNIC tool lowers the barriers to label noise remediation, increasing its utility for NLP practitioners.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Analysing the Noise Model Error for Realistic Noisy Label Data

Michael A. Hedderich, Dawei Zhu, Dietrich Klakow

Keywords Paper

0

0

0

0

15:11

19/08/2021

Few-Shot Partial-Label Learning

Yunfeng Zhao, Guoxian Yu, Lei Liu and
Zhongmin Yan, Lizhen Cui, Carlotta Domeniconi

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Transfer, Adaptation, Multi-task Learning, Weakly Supervised Learning

0

0

0

0

14:12

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

03/05/2021

How Benign is Benign Overfitting ?

Amartya Sanyal, Puneet Dokania, Varun Kanade, Philip Torr

Keywords Paper

generalization, memorization, benign overfitting, adversarial robustness

0

0

0

0

10:56

14/06/2020

Optical Flow in the Dark

Yinqiang Zheng, Mingfang Zhang, Feng Lu

Keywords Paper

low-light, optical flow, noise modeling, synthetic dataset, cnn

0

0

0

0

1:01

14/06/2020

Learning to Restore Low-Light Images via Decomposition-and-Enhancement

Ke Xu, Xin Yang, Baocai Yin, Rynson W.H. Lau

Keywords Paper

low-light, image restoration, image enhancement, convolutional neural networks

0

0

0

0

1:01

26/04/2020

Meta Dropout: Learning to Perturb Latent Features for Generalization

Hae Beom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang

Keywords Paper

0

1

0

0

4:46

06/12/2021

On the Algorithmic Stability of Adversarial Training

Yue Xing, Qifan Song, Guang Cheng

Keywords Paper

deep learning, adversarial robustness and security

0

0

0

0

13:00

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06

06/12/2020

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

0

0

0

0

3:23

02/02/2021

Learning Precise Temporal Point Event Detection with Misaligned Labels

Julien Schroeter, Kirill Sidorov, David Marshall

Keywords Paper

0

0

0

0

21:24

06/12/2021

Open-set Label Noise Can Improve Robustness Against Inherent Label Noise

Hongxin Wei, Lue Tao, RENCHUNZI XIE, Bo An

Keywords Paper

deep learning, robustness

0

0

0

0

2:46

19/04/2021

Neural data-to-text generation with LM-based text augmentation

Ernie Chang, Xiaoyu Shen, Dawei Zhu and
Vera Demberg, Hui Su

Keywords Paper

0

0

0

0

7:32

02/02/2021

Learning to Purify Noisy Labels via Meta Soft Label Corrector

Yichen Wu, Jun Shu, Qi Xie and
Qian Zhao, Deyu Meng

Keywords Paper

0

0

0

0

13:01

18/11/2020

Boosting-based reliable model reuse

Yao-Xiang Ding, Zhi-Hua Zhou

Keywords Paper

1

1

0

0

11:59

14/06/2020

Distilling Effective Supervision From Severe Label Noise

Zizhao Zhang, Han Zhang, Sercan Ö. Arik and
Honglak Lee, Tomas Pfister

Keywords Paper

robust training, meta learning, noise labels

0

0

0

0

1:00

06/12/2021

Can Less be More? When Increasing-to-Balancing Label Noise Rates Considered Beneficial

Yang Liu, Jialu Wang

Keywords Paper

machine learning, fairness

0

0

0

0

14:49

12/07/2020

Improving generalization by controlling label-noise information in neural network weights

Hrayr Harutyunyan, Kyle Reing, Greg Ver Steeg, Aram Galstyan

Keywords Paper

Supervised Learning

0

0

0

0

14:01

04/07/2020

Noise-Based Augmentation Techniques for Emotion Datasets: What do we Recommend?

Mimansa Jaiswal, Emily Mower Provost

Keywords Paper

mental monitoring, educational diagnosis, hate classification, targeted advertising

0

0

0

0

9:20

06/12/2020

Universally Quantized Neural Compression

Eirikur Agustsson, Lucas Theis

Keywords Paper

0

0

0

0

3:03

06/12/2021

Instance-dependent Label-noise Learning under a Structural Causal Model

Yu Yao, Tongliang Liu, Mingming Gong and
Bo Han, Gang Niu, Kun Zhang

Keywords Paper

deep learning, causality

0

0

0

0

11:12

14/06/2020

A Physics-Based Noise Formation Model for Extreme Low-Light Raw Denoising

Kaixuan Wei, Ying Fu, Jiaolong Yang, Hua Huang

Keywords Paper

extreme low-light imaging, physics-based noise modeling, extreme low-light denoising dataset

0

0

0

0

4:58

05/01/2021

Do We Really Need Gold Samples for Sample Weighting Under Label Noise?

Aritra Ghosh, Andrew Lan

Keywords Paper

0

0

0

0

4:58

02/02/2021

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

Elad Amrani, Rami Ben-Ari, Daniel Rotman, Alex Bronstein

Keywords Paper

0

0

0

0

14:04

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

05/01/2021

Legacy Photo Editing With Learned Noise Prior

Yuzhi Zhao, Lai-Man Po, Tingyu Lin and
Xuehui Wang, Kangcheng Liu, Yujia Zhang, Wing-Yin Yu, Pengfei Xian, Jingjing Xiong

Keywords Paper

0

0

0

0

4:51

22/11/2021

PropMix: Hard Sample Filtering and Proportional MixUp for Learning with Noisy Labels

Filipe Rolim Cordeiro, Vasileios Belagiannis, Ian Reid, Gustavo Carneiro

Keywords Paper

noisy labels, noisy annotation, Mixup, hard samples, noisy samples, noisy training

0

0

0

0

3:01

18/07/2021

Learning Noise Transition Matrix from Only Noisy Labels via Total Variation Regularization

Yivan Zhang, Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

19:29

06/12/2020

Consistency Regularization for Certified Robustness of Smoothed Classifiers

Jongheon Jeong, Jinwoo Shin

Keywords Paper

0

0

0

0

3:16

19/10/2020

Representative negative instance generation for online ad targeting

Yuhan Quan, Jingtao Ding, Depeng Jin and
Jianbo Yang, Xing Zhou, Yong Li

Keywords Paper

feature matching, adversarial learning, ad targeting, negative sampling

0

0

0

0

6:30

18/07/2021

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

Zhaowei Zhu, Yiwen Song, Yang Liu

Keywords Paper

Deep Learning

0

0

0

0

5:24

06/12/2021

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou

Keywords Paper

machine learning, generative model

0

0

0

0

6:53

22/11/2021

Alleviating Noisy-label Effects in Image Classification via Probability Transition Matrix

Ziqi Zhang, Yuexiang Li, Hongxin Wei and
Kai Ma, Tao Xu, Yefeng Zheng

Keywords Paper

noisy labels, image classification, instance selection, robust learning, inter-class correlation, soft label, medical image

0

0

0

0

2:52

18/07/2021

Learning to Generate Noise for Multi-Attack Robustness

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Applications, Privacy, Anonymity, and Security, Probabilistic Methods, MCMC, Algorithms, Adversarial Examples

0

0

0

0

5:12

03/05/2021

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen, Guangyong Chen, Junjie Ye and
jingwei zhao, Pheng-Ann Heng

Keywords Paper

Regularization, SGD noise, Robust Learning, Noisy Labels

0

0

0

0

9:42

06/12/2020

Trust the Model When It Is Confident: Masked Model-based Actor-Critic

Feiyang Pan, Jia He, Dandan Tu, Qing He

Keywords Paper

0

0

0

0

2:57

03/05/2021

Multiscale Score Matching for Out-of-Distribution Detection

Ahsan Mahmood, Junier Oliva, Martin A Styner

Keywords Paper

out-of-distribution detection, deep learning, score matching, outlier detection

0

0

0

0

5:13

04/07/2020

Posterior Calibrated Training on Sentence Classification Tasks

Taehee Jung, Dongyeop Kang, Hua Cheng and
Lucas Mentch, Thomas Schaaf

Keywords Paper

Sentence Tasks, classifications, xSLUE, classification tasks

0

0

0

0

7:00

18/07/2021

Just Train Twice: Improving Group Robustness without Training Group Information

Evan Liu, Behzad Haghgoo, Annie Chen and
Aditi Raghunathan, Pang Wei Koh, Shiori Sagawa, Percy Liang, Chelsea Finn

Keywords Paper

Deep Learning

0

0

0

0

20:58