Noise Isn’t Always Negative: Countering Exposure Bias in Sequence-to-Sequence Inflection Models

08/12/2020

Noise Isn’t Always Negative: Countering Exposure Bias in Sequence-to-Sequence Inflection Models

Garrett Nicolai, Miikka Silfverberg

Keywords:

Abstract Paper Similar Papers

Abstract: Morphological inflection, like many sequence-to-sequence tasks, sees great performance from recurrent neural architectures when data is plentiful, but performance falls off sharply in lower-data settings. We investigate one aspect of neural seq2seq models that we hypothesize contributes to overfitting - teacher forcing. By creating different training and test conditions, exposure bias increases the likelihood that a system too closely models its training data. Experiments show that teacher-forced models struggle to recover when they enter unknown territory. However, a simple modification to the training algorithm to more closely mimic test conditions creates models that are better able to generalize to unseen environments.

The video of this talk cannot be embedded. You can watch it here:

https://underline.io/lecture/6227-noise-isn%27t-always-negative-countering-exposure-bias-in-sequence-to-sequence-inflection-models

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLING 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

Pseudo-Labeling for Class Incremental Learning

Alexis Lechat, Stephane Herbin, Frederic Jurie

Keywords Paper

incremental learning, catastrophic forgetting, semi-supervised learning, pseudo-labeling, consistency regularization

0

0

0

0

8:29

02/02/2021

Exploratory Machine Learning with Unknown Unknowns

Peng Zhao, Yu-Jie Zhang, Zhi-Hua Zhou

Keywords Paper

0

0

0

0

21:39

20/07/2020

SelectNet: Learning to Sample from the Wild for Imbalanced Data Training

Yunru Liu, Tingran Gao, Haizhao Yang

Keywords Paper

0

0

0

0

20:50

03/05/2021

Learning to Recombine and Resample Data For Compositional Generalization

Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

Keywords Paper

sequence models, language processing, compositional generalization, data augmentation, generative modeling

0

0

0

0

6:14

03/05/2021

When Do Curricula Work?

Xiaoxia (Shirley) Wu, Ethan Dyer, Behnam Neyshabur

Keywords Paper

Empirical Investigation, Understanding Deep Learning, Curriculum Learning

0

0

0

0

14:37

02/02/2021

Class-Incremental Instance Segmentation via Multi-Teacher Networks

Yanan Gu, Cheng Deng, Kun Wei

Keywords Paper

0

0

0

0

14:34

26/04/2020

Meta-Dataset: A Dataset of Datasets for Learning to Learn from Few Examples

Eleni Triantafillou, Tyler Zhu, Vincent Dumoulin and
Pascal Lamblin, Utku Evci, Kelvin Xu, Ross Goroshin, Carles Gelada, Kevin Swersky, Pierre-Antoine Manzagol, Hugo Larochelle

Keywords Paper

few-shot learning, meta-learning, few-shot classification

0

0

0

0

5:05

02/02/2021

Task Aligned Generative Meta-learning for Zero-shot Learning

Zhe Liu, Yun Li, Lina Yao and
Xianzhi Wang, Guodong Long

Keywords Paper

0

0

0

0

15:48

02/02/2021

Semi-supervised Sequence Classification through Change Point Detection

Nauman Ahad, Mark A. Davenport

Keywords Paper

0

0

0

0

14:21

22/11/2021

Prototype-based Incremental Few-Shot Segmentation

Fabio Cermelli, Massimiliano Mancini, Yongqin Xian and
Zeynep Akata, Barbara Caputo

Keywords Paper

segmentation, incremental learning, continual learning, few shot learning, any shot learning, prototype, knowledge distillation

0

0

0

0

2:56

04/07/2020

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

0

0

0

0

10:57

26/04/2020

Synthesizing Programmatic Policies that Inductively Generalize

Jeevana Priya Inala, Osbert Bastani, Zenna Tavares, Armando Solar-Lezama

Keywords Paper

Program synthesis, reinforcement learning, inductive generalization

0

0

0

0

4:42

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

26/04/2020

On the interaction between supervision and self-play in emergent communication

Ryan Lowe, Abhinav Gupta, Jakob Foerster and
Douwe Kiela, Joelle Pineau

Keywords Paper

multi-agent communication, self-play, emergent languages

0

0

0

0

5:02

06/12/2021

Learning Debiased and Disentangled Representations for Semantic Segmentation

Sanghyeok Chu, Dongwan Kim, Bohyung Han

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

15:14

05/01/2021

Conflicting Bundles: Adapting Architectures Towards the Improved Training of Deep Neural Networks

David Peer, Sebastian Stabinger, Antonio Rodriguez-Sanchez

Keywords Paper

0

0

0

0

4:58

02/02/2021

Self-Domain Adaptation for Face Anti-Spoofing

Jingjing Wang, Jingyi Zhang, Ying Bian and
Youyi Cai, Chunmao Wang, Shiliang Pu

Keywords Paper

0

0

0

0

14:02

05/01/2021

EvidentialMix: Learning With Combined Open-Set and Closed-Set Noisy Labels

Ragav Sachdeva, Filipe R. Cordeiro, Vasileios Belagiannis and
Ian Reid, Gustavo Carneiro

Keywords Paper

0

0

0

0

4:58

03/05/2021

Meta-learning with negative learning rates

Alberto Bernacchia

Keywords Paper

Meta-learning

0

0

0

0

5:19

19/08/2021

Few-Shot Partial-Label Learning

Yunfeng Zhao, Guoxian Yu, Lei Liu and
Zhongmin Yan, Lizhen Cui, Carlotta Domeniconi

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Transfer, Adaptation, Multi-task Learning, Weakly Supervised Learning

0

0

0

0

14:12

19/08/2021

Object Detection in Densely Packed Scenes via Semi-Supervised Learning with Dual Consistency

Chao Ye, Huaidong Zhang, Xuemiao Xu and
Weiwei Cai, Jing Qin, Kup-Sze Choi

Keywords Paper

Computer Vision, Recognition, Deep Learning, Semi-Supervised Learning

0

0

0

0

10:19

06/12/2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, federated learning

0

0

0

0

3:27

06/12/2020

One Solution is Not All You Need: Few-Shot Extrapolation via Structured MaxEnt RL

Saurabh Kumar, Aviral Kumar, Sergey Levine, Chelsea Finn

Keywords Paper

0

0

0

0

3:24

06/12/2020

SuperLoss: A Generic Loss for Robust Curriculum Learning

Thibault Castells, Philippe Weinzaepfel, Jerome Revaud

Keywords Paper

, Probabilistic Methods -> MCMC

0

0

0

0

3:26

06/12/2021

Shift-Robust GNNs: Overcoming the Limitations of Localized Graph Training data

Qi Zhu, Natalia Ponomareva, Jiawei Han, Bryan Perozzi

Keywords Paper

deep learning, graph learning, transfer learning, semi-supervised learning

0

0

0

0

15:03

03/05/2021

Meta-Learning of Structured Task Distributions in Humans and Machines

Sreejan Kumar, Ishita Dasgupta, Jonathan Cohen and
Nathaniel Daw, Thomas L Griffiths

Keywords Paper

reinforcement learning, compositionality, human cognition, meta-learning

0

0

0

0

5:18

03/05/2021

Undistillable: Making A Nasty Teacher That CANNOT teach students

Haoyu Ma, Tianlong Chen, Ting-Kuei Hu and
Chenyu You, Xiaohui Xie, Zhangyang Wang

Keywords Paper

avoid knowledge leaking, knowledge distillation

0

0

0

0

9:38

05/01/2021

Enhancing Diversity in Teacher-Student Networks via Asymmetric Branches for Unsupervised Person Re-Identification

Hao Chen, Benoit Lagadec, Francois Bremond

Keywords Paper

0

0

0

0

5:01

06/12/2021

A Theoretical Analysis of Fine-tuning with Linear Teachers

Gal Shachaf, Alon Brutzkus, Amir Globerson

Keywords Paper

theory, deep learning, transfer learning

0

0

0

0

14:01

18/07/2021

REPAINT: Knowledge Transfer in Deep Reinforcement Learning

Yunzhe Tao, Sahika Genc, Jonathan Chung and
TAO SUN, Sunil Mallya

Keywords Paper

Algorithms, Ranking and Preference Learning, Algorithms, Regression; Applications, Health; Theory, Learning Theory; Theory, Regularization, Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:04

06/12/2021

Training Over-parameterized Models with Non-decomposable Objectives

Harikrishna Narasimhan, Aditya Menon

Keywords Paper

optimization, machine learning, fairness

0

0

0

0

8:28

03/05/2021

On Statistical Bias In Active Learning: How and When to Fix It

Sebastian Farquhar, Yarin Gal, Tom Rainforth

Keywords Paper

Risk Estimation, Monte Carlo, Active Learning

0

0

0

0

10:03

03/05/2021

Overparameterisation and worst-case generalisation: friend or foe?

Aditya Krishna Menon, Ankit Singh Rawat, Sanjiv Kumar

Keywords Paper

worst-case generalisation, overparameterisation

0

0

0

0

5:01

06/12/2021

What training reveals about neural network complexity

Andreas Loukas, Marinos Poiitis, Stefanie Jegelka

Keywords Paper

deep learning

0

0

0

0

8:29

02/02/2021

Learning to Augment for Data-scarce Domain BERT Knowledge Distillation

Lingyun Feng, Minghui Qiu, Yaliang Li and
Hai-Tao Zheng, Ying Shen

Keywords Paper

0

0

0

0

17:11

06/12/2020

Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks

Mohammadreza Mousavi Kalan, Zalan Fabian, Salman Avestimehr, Mahdi Soltanolkotabi

Keywords Paper

0

0

0

0

3:16

22/11/2021

Few-shot Action Recognition with Prototype-centered Attentive Learning

Xiatian Zhu, Antoine S Toisoul, Juan-Manuel Perez-Rua and
Li Zhang, Brais Martinez, Tao Xiang

Keywords Paper

Few-shot learning, Video recognition, Action classification, Small training data, Model pre-training, Meta-learning, Transformer, Self-attention learning, Cross-attention learning, Prototype learning, Prototype-centered learning, Hybrid-attention learning

0

0

0

0

2:22

04/07/2020

Adversarial NLI: A New Benchmark for Natural Language Understanding

Yixin Nie, Adina Williams, Emily Dinan and
Mohit Bansal, Jason Weston, Douwe Kiela

Keywords Paper

Adversarial NLI, Natural Understanding, never-ending scenario, NLU

0

0

0

0

12:16

06/12/2020

Knowledge Distillation in Wide Neural Networks: Risk Bound, Data Efficiency and Imperfect Teacher

Guangda Ji, Zhanxing Zhu

Keywords Paper

0

0

0

0

3:19

06/12/2020

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

3:21