NAT: Noise-Aware Training for Robust Neural Sequence Labeling

04/07/2020

NAT: Noise-Aware Training for Robust Neural Sequence Labeling

Marcin Namysl, Sven Behnke, Joachim Köhler

Keywords: Noise-Aware Training, Robust Labeling, Sequence systems, noisy problem

Abstract Paper Similar Papers

Abstract: Sequence labeling systems should perform reliably not only under ideal conditions but also with corrupted inputs---as these systems often process user-generated text or follow an error-prone upstream component. To this end, we formulate the noisy sequence labeling problem, where the input may undergo an unknown noising process and propose two Noise-Aware Training (NAT) objectives that improve robustness of sequence labeling performed on perturbed input: Our data augmentation method trains a neural model using a mixture of clean and noisy samples, whereas our stability training algorithm encourages the model to create a noise-invariant latent representation. We employ a vanilla noise model at training time. For evaluation, we use both the original data and its variants perturbed with real OCR errors and misspellings. Extensive experiments on English and German named entity recognition benchmarks confirmed that NAT consistently improved robustness of popular sequence labeling models, preserving accuracy on the original input. We make our code and data publicly available for the research community.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

17:21

02/02/2021

Beyond Class-Conditional Assumption: A Primary Attempt to Combat Instance-Dependent Label Noise

Pengfei Chen, Junjie Ye, Guangyong Chen and
Jingwei Zhao, Pheng-Ann Heng

Keywords Paper

0

0

0

0

14:08

12/07/2020

Label-Noise Robust Domain Adaptation

Xiyu Yu, Tongliang Liu, Mingming Gong and
Kun Zhang, Kayhan Batmanghelich, Dacheng Tao

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

15:01

12/07/2020

Error-Bounded Correction of Noisy Labels

Songzhu Zheng, Pengxiang Wu, Aman Goswami and
Mayank Goswami, Dimitris Metaxas, Chao Chen

Keywords Paper

Deep Learning - Algorithms

1

1

1

1

12:34

05/12/2020

Towards a better understanding of label smoothing in neural machine translation

Yingbo Gao, Weiyue Wang, Christian Herold and
Zijian Yang, Hermann Ney

Keywords Paper

0

0

0

0

13:37

06/12/2021

Improving Deep Learning Interpretability by Saliency Guided Training

Aya Abdelsalam Ismail, Hector Corrada Bravo, Soheil Feizi

Keywords Paper

deep learning, transformers, vision, language, interpretability

0

0

0

0

10:45

06/12/2021

Revisiting Hilbert-Schmidt Information Bottleneck for Adversarial Robustness

Zifeng Wang, Tong Jian, Aria Masoomi and
Stratis Ioannidis, Jennifer Dy

Keywords Paper

deep learning, robustness, adversarial robustness and security

0

0

0

0

13:49

18/07/2021

A Unified Generative Adversarial Network Training via Self-Labeling and Self-Attention

Tomoki Watanabe, Paolo Favaro

Keywords Paper

Deep Learning, Generative Models, Applications, Matrix and Tensor Factorization, Algorithms, Collaborative Filtering; Algorithms, Large Scale Learning; Applications, Denoising

0

0

0

0

5:12

02/02/2021

Error-Correcting Output Codes with Ensemble Diversity for Robust Learning in Neural Networks

Yang Song, Qiyu Kang, Wee Peng Tay

Keywords Paper

0

0

0

0

15:04

16/11/2020

Local Additivity Based Data Augmentation for Semi-supervised NER

Jiaao Chen, Zhenghui Wang, Ran Tian and
Zichao Yang, Diyi Yang

Keywords Paper

named recognition, deep understanding, semi-supervised ner, entity learning

0

0

0

0

11:18

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

26/04/2020

Understanding Knowledge Distillation in Non-autoregressive Machine Translation

Chunting Zhou, Jiatao Gu, Graham Neubig

Keywords Paper

knowledge distillation, non-autoregressive neural machine translation

0

0

0

0

4:55

08/12/2020

Attentively Embracing Noise for Robust Latent Representation in BERT

Gwenaelle Cunha Sergio, Dennis Singh Moirangthem, Minho Lee

Keywords Paper

0

0

0

0

12:55

04/07/2020

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Paper

NLP, neural classification, training, automatic evaluations

0

0

0

0

11:55

06/12/2020

Posterior Re-calibration for Imbalanced Datasets

Junjiao Tian, Yen-Cheng Liu, Nathaniel Glaser and
Yen-Chang Hsu, Zsolt Kira

Keywords Paper

Algorithms -> Few-Shot Learning, Applications -> Computer Vision

0

0

0

0

3:23

05/01/2021

RNNP: A Robust Few-Shot Learning Approach

Pratik Mazumder, Pravendra Singh, Vinay P. Namboodiri

Keywords Paper

0

0

0

0

4:50

04/07/2020

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

Ouyu Lan, Xiao Huang, Bill Yuchen Lin and
He Jiang, Liyuan Liu, Xiang Ren

Keywords Paper

Sequence Labeling, natural problems, crowd annotation, multi-source learning

0

0

0

0

12:01

06/12/2020

Fourier-transform-based attribution priors improve the interpretability and stability of deep learning models for genomics

Alex Tseng, Avanti Shrikumar, Anshul Kundaje

Keywords Paper

0

0

0

0

3:21

16/11/2020

Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning

Tsvetomila Mihaylova, Vlad Niculae, André F. T. Martins

Keywords Paper

pipeline systems, ste, latent models, end-to-end training

0

0

0

0

11:50

18/07/2021

Non-Negative Bregman Divergence Minimization for Deep Direct Density Ratio Estimation

Masahiro Kato, Takeshi Teshima

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

5:17

26/04/2020

Improving Neural Language Generation with Spectrum Control

Lingxiao Wang, Jing Huang, Kevin Huang and
Ziniu Hu, Guangtao Wang, Quanquan Gu

Keywords Paper

0

0

0

0

4:58

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

13/04/2021

Feedback coding for active learning

Gregory Canal, Matthieu Bloch, Christopher Rozell

Keywords Paper

0

0

0

0

2:55

06/12/2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Tai-Yu Pan, Cheng Zhang, Yandong Li and
Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

Keywords Paper

machine learning, vision

0

0

0

0

11:49

06/12/2021

A Framework to Learn with Interpretation

Jayneel Parekh, Pavlo Mozharovskyi, Florence d'Alché-Buc

Keywords Paper

deep learning, interpretability

0

0

0

0

14:05

06/12/2021

Learning with Labeling Induced Abstentions

Kareem Amin, Giulia DeSalvo, Afshin Rostamizadeh

Keywords Paper

machine learning, active learning

0

0

0

0

11:22

06/12/2020

Incorporating BERT into Parallel Sequence Decoding with Adapters

Junliang Guo, Zhirui Zhang, Linli Xu and
Hao-Ran Wei, Boxing Chen, Enhong Chen

Keywords Paper

0

0

0

0

3:17

26/04/2020

Learning Robust Representations via Multi-View Information Bottleneck

Marco Federici, Anjan Dutta, Patrick Forré and
Nate Kushman, Zeynep Akata

Keywords Paper

Information Bottleneck, Multi-View Learning, Representation Learning, Information Theory

0

0

0

0

4:56

18/07/2021

Composed Fine-Tuning: Freezing Pre-Trained Denoising Autoencoders for Improved Generalization

Sang Michael Xie, Tengyu Ma, Percy Liang

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

22:15

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

14/06/2020

An Investigation Into the Stochasticity of Batch Whitening

Lei Huang, Lei Zhao, Yi Zhou and
Fan Zhu, Li Liu, Ling Shao

Keywords Paper

batch normalization, whitening, stochasticity analysis, conditioning, optimization, generalization, stochastic noise, deep learning, gans, classification

0

0

0

0

5:00

03/05/2021

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

Junwen Bai, Weiran Wang, Yingbo Zhou, Caiming Xiong

Keywords Paper

Unsupervised Learning, Mutual Information, Masked Reconstruction, Sequence Data

0

0

0

0

5:08

02/02/2021

Nested Named Entity Recognition with Partially-Observed TreeCRFs

Yao Fu, Chuanqi Tan, Mosha Chen and
Songfang Huang, Fei Huang

Keywords Paper

0

0

0

0

14:38

06/12/2021

Learning Causal Semantic Representation for Out-of-Distribution Prediction

Chang Liu, Xinwei Sun, Jindong Wang and
Haoyue Tang, Tao Li, Tao Qin, Wei Chen, Tie-Yan Liu

Keywords Paper

generative model, domain adaptation, representation learning

0

0

0

0

14:29

06/12/2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Paper

0

0

0

0

3:22

06/12/2020

Unsupervised Data Augmentation for Consistency Training

Qizhe Xie, Zihang Dai, Eduard Hovy and
Thang Luong, Quoc V Le

Keywords Paper

0

0

0

0

3:29

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

04/07/2020

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward

Luyang Huang, Lingfei Wu, Lu Wang

Keywords Paper

Knowledge Summarization, abstractive summarization, semantic interpretation, generation summaries

0

0

0

0

12:01

06/12/2021

Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning

Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima and
Yutaka Matsuo, Shixiang (Shane) Gu

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:00

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18