Learning a Multi-Domain Curriculum for Neural Machine Translation

04/07/2020

Learning a Multi-Domain Curriculum for Neural Machine Translation

Wei Wang, Ye Tian, Jiquan Ngiam, Yinfei Yang, Isaac Caswell, Zarana Parekh

Keywords: Neural Translation, data selection, machine translation, multi-domain curriculum

Abstract Paper Similar Papers

Abstract: Most data selection research in machine translation focuses on improving a single domain. We perform data selection for multiple domains at once. This is achieved by carefully introducing instance-level domain-relevance features and automatically constructing a training curriculum to gradually concentrate on multi-domain relevant and noise-reduced data batches. Both the choice of features and the use of curriculum are crucial for balancing and improving all domains, including out-of-domain. In large-scale experiments, the multi-domain curriculum simultaneously reaches or outperforms the individual performance and brings solid gains over no-curriculum training.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

IMoJIE: Iterative Memory-Based Joint Open Information Extraction

Keshav Kolluru, Samarth Aggarwal, Vipul Rathore and
Mausam -, Soumen Chakrabarti

Keywords Paper

Iterative Extraction, Open Extraction, IMoJIE, Iterative

0

0

0

0

9:31

06/12/2021

PLUR: A Unifying, Graph-Based View of Program Learning, Understanding, and Repair

Zimin Chen, Vincent J Hellendoorn, Pascal Lamblin and
Petros Maniatis, Pierre-Antoine Manzagol, Daniel Tarlow, Subhodeep Moitra

Keywords Paper

deep learning, machine learning, transformers, graph learning

0

0

0

0

5:59

06/12/2021

Learning to Combine Per-Example Solutions for Neural Program Synthesis

Disha Shrivastava, Hugo Larochelle, Daniel Tarlow

Keywords Paper

deep learning

0

0

0

0

7:54

04/07/2020

Uncertainty-Aware Curriculum Learning for Neural Machine Translation

Yikai Zhou, Baosong Yang, Derek F. Wong and
Yu Wan, Lidia S. Chao

Keywords Paper

Neural Translation, assessment difficulty, translation tasks, Uncertainty-Aware Learning

0

0

0

0

8:20

04/07/2020

Deep Contextualized Self-training for Low Resource Dependency Parsing

Guy Rotman, Roi Reichart

Keywords Paper

Low Parsing, sequence tasks, Deep Self-training, Neural parsing

0

0

0

0

11:41

03/05/2021

MODALS: Modality-agnostic Automated Data Augmentation in the Latent Space

Tsz Him Cheung, Dit-Yan Yeung

Keywords Paper

automated data augmentation, deep learning, data augmentation, latent space

0

0

0

0

5:11

26/04/2020

Reducing Transformer Depth on Demand with Structured Dropout

Angela Fan, Edouard Grave, Armand Joulin

Keywords Paper

reduction, regularization, pruning, dropout, transformer

0

0

0

0

5:01

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

04/07/2020

Simple and Effective Retrieve-Edit-Rerank Text Generation

Nabil Hossain, Marjan Ghazvininejad, Luke Zettlemoyer

Keywords Paper

Retrieve-Edit-Rerank Generation, candidate selection, Retrieve-and-edit methods, post-generation approach

0

0

0

0

6:51

04/07/2020

Generalizing Natural Language Analysis through Span-relation Representations

Zhengbao Jiang, Wei Xu, Jun Araki, Graham Neubig

Keywords Paper

Natural Analysis, Natural processing, dependency parsing, semantic labeling

0

0

0

0

8:30

03/05/2021

BUSTLE: Bottom-Up Program Synthesis Through Learning-Guided Exploration

Augustus Odena, Kensen Shi, David Bieber and
Rishabh Singh, Charles Sutton, Hanjun Dai

Keywords Paper

Program Synthesis

0

0

0

0

10:26

16/11/2020

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Paper

document-level translation, translations, document-level model, selection module

0

0

0

0

11:36

04/07/2020

Good-Enough Compositional Data Augmentation

Jacob Andreas

Keywords Paper

Good-Enough Augmentation, diagnostic tasks, semantic task, data protocol

0

0

0

0

11:31

16/11/2020

Self-Supervised Knowledge Triplet Learning for Zero-Shot Question Answering

Pratyay Banerjee, Chitta Baral

Keywords Paper

data annotation, knowledge learning, knowledge, self-supervised task

0

0

0

0

11:16

12/07/2020

Efficient Domain Generalization via Common-Specific Low-Rank Decomposition

Vihari Piratla, Praneeth Netrapalli, Sunita Sarawagi

Keywords Paper

Supervised Learning

0

0

0

0

14:51

18/07/2021

Data Augmentation for Meta-Learning

Renkun Ni, Micah Goldblum, Amr Sharaf and
Kezhi Kong, Tom Goldstein

Keywords Paper

Deep Learning

0

0

0

0

5:09

26/04/2020

Domain Adaptive Multibranch Networks

Róger Bermúdez-Chacón, Mathieu Salzmann, Pascal Fua

Keywords Paper

Domain Adaptation, Computer Vision

0

0

0

0

5:26

13/04/2021

Learn to expect the unexpected: Probably approximately correct domain generalization

Vikas Garg, Adam Tauman Kalai, Katrina Ligett, Steven Wu

Keywords Paper

0

0

0

0

3:01

18/07/2021

Dataset Condensation with Differentiable Siamese Augmentation

Bo Zhao, Hakan Bilen

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:02

05/01/2021

Deep Unsupervised Anomaly Detection

Tangqing Li, Zheng Wang, Siying Liu, Wen-Yan Lin

Keywords Paper

0

0

0

0

5:00

06/12/2021

Adaptive Risk Minimization: Learning to Adapt to Domain Shift

Marvin Zhang, Henrik Marklund, Nikita Dhawan and
Abhishek Gupta, Sergey Levine, Chelsea Finn

Keywords Paper

machine learning, robustness, vision, domain adaptation

0

0

0

0

9:30

12/07/2020

Improving Transformer Optimization Through Better Initialization

Xiao Shi Huang, Felipe Perez, Jimmy Ba, Maksims Volkovs

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

14:52

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

04/07/2020

Learning to Contextually Aggregate Multi-Source Supervision for Sequence Labeling

Ouyu Lan, Xiao Huang, Bill Yuchen Lin and
He Jiang, Liyuan Liu, Xiang Ren

Keywords Paper

Sequence Labeling, natural problems, crowd annotation, multi-source learning

0

0

0

0

12:01

02/02/2021

Finding Sparse Structures for Domain Specific Neural Machine Translation

Jianze Liang, Chengqi Zhao, Mingxuan Wang and
Xipeng Qiu, Lei Li

Keywords Paper

0

0

0

0

14:45

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

06/12/2020

SuperLoss: A Generic Loss for Robust Curriculum Learning

Thibault Castells, Philippe Weinzaepfel, Jerome Revaud

Keywords Paper

, Probabilistic Methods -> MCMC

0

0

0

0

3:26

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

16/11/2020

Self-Paced Learning for Neural Machine Translation

Yu Wan, Baosong Yang, Derek F. Wong and
Yikai Zhou, Lidia S. Chao, Haibo Zhang, Boxing Chen

Keywords Paper

neural, curriculum learning, translation tasks, nmt

0

0

0

0

6:03

14/06/2020

OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

Mohamed Yousef, Tom E. Bishop

Keywords Paper

text recognition, weakly supervised, handwriting recognition, convolutional neural network fully convolutional, ctc

0

0

0

0

1:00

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

06/12/2021

Batch Active Learning at Scale

Gui Citovsky, Giulia DeSalvo, Claudio Gentile and
Lazaros Karydas, Anand Rajagopalan, Afshin Rostamizadeh, Sanjiv Kumar

Keywords Paper

active learning

0

0

0

0

12:19

13/04/2021

A theory of multiple-source adaptation with limited target labeled data

Yishay Mansour, Mehryar Mohri, Jae Ro and
Ananda Theertha Suresh, Ke Wu

Keywords Paper

0

0

0

0

2:39

06/12/2021

Neural Program Generation Modulo Static Analysis

Rohan Mukherjee, Yeming Wen, Dipak Chaudhari and
Thomas Reps, Swarat Chaudhuri, Christopher Jermaine

Keywords Paper

deep learning, transformers, generative model

0

0

0

0

14:58

05/01/2021

Few-Shot Learning via Feature Hallucination With Variational Inference

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and
Shiming Xiang, Chunhong Pan

Keywords Paper

0

0

0

0

4:56

03/05/2021

Dataset Condensation with Gradient Matching

Bo ZHAO, Konda Reddy Mopuri, Hakan Bilen

Keywords Paper

dataset condensation, image generation, data-efficient learning

0

0

0

0

15:09

06/12/2020

Fourier-transform-based attribution priors improve the interpretability and stability of deep learning models for genomics

Alex Tseng, Avanti Shrikumar, Anshul Kundaje

Keywords Paper

0

0

0

0

3:21

02/02/2021

Copy That! Editing Sequences by Copying Spans

Sheena Panthaplackel, Miltiadis Allamanis, Marc Brockschmidt

Keywords Paper

0

0

0

0

19:25

06/12/2021

Referring Transformer: A One-step Approach to Multi-task Visual Grounding

Muchen Li, Leonid Sigal

Keywords Paper

transformers, vision

0

0

0

0

7:54

14/06/2020

SLV: Spatial Likelihood Voting for Weakly Supervised Object Detection

Ze Chen, Zhihang Fu, Rongxin Jiang and
Yaowu Chen, Xian-Sheng Hua

Keywords Paper

object detection, weakly supervised, spatial likelihood, multi-task learning

0

0

0

0

1:01