Few-shot learning through contextual data augmentation

Abstract: Machine translation (MT) models used in industries with constantly changing topics, such as translation or news agencies, need to adapt to new data to maintain their performance over time. Our aim is to teach a pre-trained MT model to translate previously unseen words accurately, based on very few examples. We propose (i) an experimental setup allowing us to simulate novel vocabulary appearing in human-submitted translations, and (ii) corresponding evaluation metrics to compare our approaches. We extend a data augmentation approach using a pretrained language model to create training examples with similar contexts for novel words. We compare different fine-tuning and data augmentation approaches and show that adaptation on the scale of one to five examples is possible. Combining data augmentation with randomly selected training sentences leads to the highest BLEU score and accuracy improvements. Impressively, with only 1 to 5 examples, our model reports better accuracy scores than a reference system trained with on average 313 parallel examples.

16/11/2020

Few-shot learning through contextual data augmentation

Farid Arthaud, Rachel Bawden, Alexandra Birch

Comments

Similar Papers

Pronoun-Targeted Fine-tuning for NMT with Hybrid Losses

Prathyusha Jwalapuram, Shafiq Joty, Youlin Shen

Keywords Abstract Paper

pronoun translations, pronoun translation, neural training, backtranslation

English intermediate-task training improves zero-shot cross-lingual transfer too

Jason Phang, Iacer Calixto, Phu Mon Htut and Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Samuel R. Bowman

Keywords Abstract Paper

Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks

Ori Terner, Kfir Bar, Nachum Dershowitz

Keywords Abstract Paper

Data-Efficient Paraphrase Generation to Bootstrap Intent Classification and Slot Labeling for New Features in Task-Oriented Dialog Systems

Shailza Jolly, Tobias Falke, Caglar Tirkaz, Daniil Sorokin

Keywords Abstract Paper

Deberta: Decoding-Enhanced Bert With Disentangled Attention

Pengcheng He, Xiaodong Liu, Jianfeng Gao, Weizhu Chen

Keywords Abstract Paper

Position Encoding, Attention, Natural Language Processing, Language Model Pre-training, Transformer

Uncertainty-aware Self-training for Few-shot Text Classification

Subhabrata Mukherjee, Ahmed Awadallah

Keywords Abstract Paper

A pairwise probe for understanding BERT fine-tuning on machine reading comprehension

Jie Cai, Zhengzhou Zhu, Ping Nie, Qian Liu

Keywords Abstract Paper

machine reading comprehension, pairwise, fine-tune, BERT

Word alignment by fine-tuning embeddings on parallel corpora

Zi-Yi Dou, Graham Neubig

Keywords Abstract Paper

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Abstract Paper

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Jixuan Wang, Kuan-Chieh Wang, Frank Rudzicz, Michael Brudno

Keywords Abstract Paper

machine learning, transformers, meta learning, language, transfer learning

Sequence-Level Mixed Sample Data Augmentation

Demi Guo, Yoon Kim, Alexander Rush

Keywords Abstract Paper

sequence-to-sequence problems, scan, semantic parsing, neural networks

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

yaxing wang, Lu Yu, Joost van de Weijer

Keywords Abstract Paper

Algorithms -> Online Learning, Optimization -> Stochastic Optimization

Effects of pre- and post-processing on type-based embeddings in lexical semantic change detection

Jens Kaiser, Sinan Kurtyigit, Serge Kotchourko, Dominik Schlechtweg

Keywords Abstract Paper

Learning Which Features Matter: RoBERTa Acquires a Preference for Linguistic Generalizations (Eventually)

Alex Warstadt, Yian Zhang, Xiaocheng Li and Haokun Liu, Samuel R. Bowman

Keywords Abstract Paper

self-supervised tasks, language understanding, ambiguous tasks, finetuning

Logic-Guided Data Augmentation and Regularization for Consistent Question Answering

Akari Asai, Hannaneh Hajishirzi

Keywords Abstract Paper

Logic-Guided Augmentation, Regularization, Consistent Answering, natural questions

The Right Tool for the Job: Matching Model and Instance Complexities

Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta and Jesse Dodge, Noah A. Smith

Keywords Abstract Paper

inference, early decisions, costly retraining, Job Model

SALNet: Semi-supervised Few-Shot Text Classification with Attention-based Lexicon Construction

Ju-Hyoung Lee, Sang-Ki Ko, Yo-Sub Han

Keywords Abstract Paper

Knowledge-driven Data Construction for Zero-shot Evaluation in Commonsense Question Answering

Kaixin Ma, Filip Ilievski, Jonathan Francis and Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

Keywords Abstract Paper

Contextual Embeddings: When Are They Worth It?

Simran Arora, Avner May, Jian Zhang, Christopher Ré

Keywords Abstract Paper

Contextual Embeddings, deep embeddings, pretrained embeddings, GloVe

Partially-Aligned Data-to-Text Generation with Distant Supervision

Zihao Fu, Bei Shi, Wai Lam and Lidong Bing, Zhiyuan Liu

Keywords Abstract Paper

data-to-text task, generation task, dataset problem, over-generation problem

Automatically Paraphrasing via Sentence Reconstruction and Round-trip Translation

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Abstract Paper

Natural Language Processing, Machine Translation, Natural Language Generation, NLP Applications and Tools

Bayesian Methods for Semi-supervised Text Annotation

Keywords Paper

Jason Phang, Iacer Calixto, Phu Mon Htut and
Yada Pruksachatkun, Haokun Liu, Clara Vania, Katharina Kann, Samuel R. Bowman

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Alex Warstadt, Yian Zhang, Xiaocheng Li and
Haokun Liu, Samuel R. Bowman

Keywords Paper

Keywords Paper

Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta and
Jesse Dodge, Noah A. Smith

Keywords Paper

Keywords Paper

Kaixin Ma, Filip Ilievski, Jonathan Francis and
Yonatan Bisk, Eric Nyberg, Alessandro Oltramari

Keywords Paper

Keywords Paper

Zihao Fu, Bei Shi, Wai Lam and
Lidong Bing, Zhiyuan Liu

Keywords Paper

Zilu Guo, Zhongqiang Huang, Kenny Q. Zhu and
Guandan Chen, Kaibo Zhang, Boxing Chen, Fei Huang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zaixiang Zheng, Hao Zhou, Shujian Huang and
Jiajun Chen, Jingjing Xu, Lei Li

Keywords Paper

Jathushan Rajasegaran, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Mubarak Shah

Keywords Paper

Keywords Paper

Shuohang Wang, Yuwei Fang, Siqi Sun and
Zhe Gan, Yu Cheng, Jingjing Liu, Jing Jiang

Keywords Paper

Genta Indra Winata, Samuel Cahyawijaya, Zhaojiang Lin and
Zihan Liu, Peng Xu, Pascale Fung

Keywords Paper

Keywords Paper

Shuo Ren, Yu Wu, Shujie Liu and
Ming Zhou, Shuai Ma

Keywords Paper

Michael Cogswell, Jiasen Lu, Rishabh Jain and
Stefan Lee, Devi Parikh, Dhruv Batra

Keywords Paper

Keywords Paper

Keywords Paper

Sanyuan Chen, Yutai Hou, Yiming Cui and
Wanxiang Che, Ting Liu, Xiangzhan Yu

Keywords Paper

Keywords Paper