Training and Inference Methods for High-Coverage Neural Machine Translation

01/07/2020

Training and Inference Methods for High-Coverage Neural Machine Translation

Michael Yang, Yixin Liu, Rahul Mayuranath

Keywords:

Abstract Paper Similar Papers

Abstract: In this paper, we introduce a system built for the Duolingo Simultaneous Translation And Paraphrase for Language Education (STAPLE) shared task at the 4th Workshop on Neural Generation and Translation (WNGT 2020). We participated in the English-to-Japanese track with a Transformer model pretrained on the JParaCrawl corpus and fine-tuned in two steps on the JESC corpus and then the (smaller) Duolingo training corpus. First, during training, we find it is essential to deliberately expose the model to higher-quality translations more often during training for optimal translation performance. For inference, encouraging a small amount of diversity with Diverse Beam Search to improve translation coverage yielded marginal improvement over regular Beam Search. Finally, using an auxiliary filtering model to filter out unlikely candidates from Beam Search improves performance further. We achieve a weighted F1 score of 27.56% on our own test set, outperforming the STAPLE AWS translations baseline score of 4.31%.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL Workshops virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Future-Guided Incremental Transformer for Simultaneous Translation

Shaolei Zhang, Yang Feng, Liangyou Li

Keywords Paper

0

0

0

0

14:44

04/07/2020

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Paper

Neural training, NMT training, document-level training, NMT objective

0

0

0

0

6:48

16/11/2020

Pre-training Multilingual Neural Machine Translation by Leveraging Alignment Information

Zehui Lin, Xiao Pan, Mingxuan Wang and
Xipeng Qiu, Jiangtao Feng, Hao Zhou, Lei Li

Keywords Paper

machine mt, mt, rich mt, universal model

0

0

0

0

12:00

01/07/2020

Expand and Filter: CUNI and LMU Systems for the WNGT 2020 Duolingo Shared Task

Jindřich Libovický, Zdeněk Kasner, Jindřich Helcl, Ondřej Dušek

Keywords Paper

0

0

0

0

4:59

26/04/2020

Learning Hierarchical Discrete Linguistic Units from Visually-Grounded Speech

David Harwath, Wei-Ning Hsu, James Glass

Keywords Paper

visually-grounded speech, self-supervised learning, discrete representation learning, vision and language, vision and speech, hierarchical representation learning

0

0

0

0

13:42

01/07/2020

Exploring Model Consensus to Generate Translation Paraphrases

Zhenhao Li, Marina Fomicheva, Lucia Specia

Keywords Paper

0

0

0

0

8:19

01/07/2020

English-to-Japanese Diverse Translation by Combining Forward and Backward Outputs

Masahiro Kaneko, Aizhan Imankulova, Tosho Hirasawa, Mamoru Komachi

Keywords Paper

0

0

0

0

5:00

01/07/2020

The JHU Submission to the 2020 Duolingo Shared Task on Simultaneous Translation and Paraphrase for Language Education

Huda Khayrallah, Jacob Bremerman, Arya D. McCarthy and
Kenton Murray, Winston Wu, Matt Post

Keywords Paper

0

0

0

0

8:55

26/04/2020

Massively Multilingual Sparse Word Representations

Gábor Berend

Keywords Paper

sparse word representations, multilinguality, sparse coding

0

0

0

0

5:42

05/01/2021

Class-Wise Metric Scaling for Improved Few-Shot Classification

Ge Liu, Linglan Zhao, Wei Li and
Dashan Guo, Xiangzhong Fang

Keywords Paper

0

0

0

0

5:01

14/06/2020

JL-DCF: Joint Learning and Densely-Cooperative Fusion Framework for RGB-D Salient Object Detection

Keren Fu, Deng-Ping Fan, Ge-Peng Ji, Qijun Zhao

Keywords Paper

visual saliency, salient object detection, rgb-d, depth information, joint learning, dense connections, multi-modal features, feature fusion, deep learning, encoder-decoder

0

0

0

0

1:01

16/11/2020

Dynamic Data Selection and Weighting for Iterative Back-Translation

Zi-Yi Dou, Antonios Anastasopoulos, Graham Neubig

Keywords Paper

neural translation, neural nmt, nmt, domain adaptation

0

0

0

0

11:30

08/12/2020

Neural Machine Translation Models with Back-Translation for the Extremely Low-Resource Indigenous Language Bribri

Isaac Feldman, Rolando Coto-Solano

Keywords Paper

0

0

0

0

13:50

04/07/2020

Non-Linear Instance-Based Cross-Lingual Mapping for Non-Isomorphic Embedding Spaces

Goran Glavaš, Ivan Vulić

Keywords Paper

bilingual induction, Non-Linear Mapping, InstaMap, instance-based method

0

0

0

0

7:04

04/07/2020

A Retrieve-and-Rewrite Initialization Method for Unsupervised Machine Translation

Shuo Ren, Yu Wu, Shujie Liu and
Ming Zhou, Shuai Ma

Keywords Paper

Unsupervised Translation, translation, Retrieve-and-Rewrite Method, translation models

0

0

0

0

6:31

08/12/2020

Transliteration of Judeo-Arabic Texts into Arabic Script Using Recurrent Neural Networks

Ori Terner, Kfir Bar, Nachum Dershowitz

Keywords Paper

0

0

0

0

9:50

06/12/2020

DeepI2I: Enabling Deep Hierarchical Image-to-Image Translation by Transferring from GANs

yaxing wang, Lu Yu, Joost van de Weijer

Keywords Paper

Algorithms -> Online Learning, Optimization -> Stochastic Optimization

0

0

0

0

3:23

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39

16/11/2020

Plug and Play Autoencoders for Conditional Text Generation

Florian Mai, Nikolaos Pappas, Ivan Montero and
Noah A. Smith, James Henderson

Keywords Paper

conditional tasks, style transfer, style tasks, text autoencoders

0

0

0

0

9:23

18/07/2021

Self-supervised and Supervised Joint Training for Resource-rich Machine Translation

Yong Cheng, Wei Wang, Lu Jiang, Wolfgang Macherey

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

5:21

05/01/2021

Few-Shot Learning via Feature Hallucination With Variational Inference

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and
Shiming Xiang, Chunhong Pan

Keywords Paper

0

0

0

0

4:56

16/11/2020

Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning

Yuncheng Hua, Yuan-Fang Li, Gholamreza Haffari and
Guilin Qi, Tongtong Wu

Keywords Paper

program induction, meta-training, cqa, neural approach

0

0

0

0

12:41

06/12/2020

Unsupervised Data Augmentation for Consistency Training

Qizhe Xie, Zihang Dai, Eduard Hovy and
Thang Luong, Quoc V Le

Keywords Paper

0

0

0

0

3:29

02/02/2021

Inferring Emotion from Large-scale Internet Voice Data: A Semi-supervised Curriculum Augmentation based Deep Learning Approach

Suping Zhou, Jia Jia, Zhiyong Wu and
Zhihan Yang, Yanfeng Wang, Wei Chen, Fanbo Meng, Shuo Huang, Jialie Shen, Xiaochuan Wang

Keywords Paper

0

0

0

0

17:24

04/07/2020

Classification-Based Self-Learning for Weakly Supervised Bilingual Lexicon Induction

Mladen Karan, Ivan Vulić, Anna Korhonen, Goran Glavaš

Keywords Paper

Weakly Induction, self-learning, induction CLWEs, bilingual induction

0

0

0

0

6:34

16/11/2020

On the Role of Supervision in Unsupervised Constituency Parsing

Haoyue Shi, Karen Livescu, Kevin Gimpel

Keywords Paper

few-shot parsing, unsupervised parsing, hyperparameter tuning, model selection

0

0

0

0

7:03

04/07/2020

AdvAug: Robust Adversarial Augmentation for Neural Machine Translation

Yong Cheng, Lu Jiang, Wolfgang Macherey, Jacob Eisenstein

Keywords Paper

Robust Augmentation, Neural Translation, Neural NMT, Neural

0

0

0

0

12:16

19/04/2021

The Gutenberg dialogue dataset

Richard Csaky, Gábor Recski

Keywords Paper

0

0

0

0

10:14

06/12/2020

Data Diversification: A Simple Strategy For Neural Machine Translation

Xuan-Phi Nguyen, Shafiq Joty, Kui Wu, Ai Ti Aw

Keywords Paper

0

0

0

0

3:23

02/02/2021

LRC-BERT: Latent-representation Contrastive Knowledge Distillation for Natural Language Understanding

Hao Fu, Shaojun Zhou, Qihong Yang and
Junjie Tang, Guiquan Liu, Kaikui Liu, Xiaolong Li

Keywords Paper

0

0

0

0

15:25

16/11/2020

LAReQA: Language-Agnostic Answer Retrieval from a Multilingual Pool

Uma Roy, Noah Constant, Rami Al-Rfou and
Aditya Barua, Aaron Phillips, Yinfei Yang

Keywords Paper

language-agnostic retrieval, cross-lingual tasks, cross-lingual retrieval, alignment

0

0

0

0

12:07

14/06/2020

EmotiCon: Context-Aware Multimodal Emotion Recognition Using Frege’s Principle

Trisha Mittal, Pooja Guhan, Uttaran Bhattacharya and
Rohan Chandra, Aniket Bera, Dinesh Manocha

Keywords Paper

affective computing, perceived emotions, context understanding, multimodal, inter-agent interactions, depth maps, deep learning, background, attention maps

0

0

0

0

1:00

13/04/2021

Feedback coding for active learning

Gregory Canal, Matthieu Bloch, Christopher Rozell

Keywords Paper

0

0

0

0

2:55

04/07/2020

From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains

Jan-Christoph Klie, Richard Eckart de Castilho, Iryna Gurevych

Keywords Paper

Human-In-The-Loop Linking, Entity linking, disambiguating mentions, annotation process

0

0

0

0

12:26

01/07/2020

Generating Diverse Translations via Weighted Fine-tuning and Hypotheses Filtering for the Duolingo STAPLE Task

Sweta Agrawal, Marine Carpuat

Keywords Paper

0

0

0

0

9:19

16/11/2020

MinTL: Minimalist Transfer Learning for Task-Oriented Dialogue Systems

Zhaojiang Lin, Andrea Madotto, Genta Indra Winata, Pascale Fung

Keywords Paper

dialogue tracking, dialogue generation, end-to-end generation, minimalist learning

0

0

0

0

7:48

30/11/2020

Meta-Learning with Context-Agnostic Initialisations

Toby Perrett, Alessandro Masullo, Tilo Burghardt and
Majid Mirmehdi, Dima Damen

Keywords Paper

0

0

0

0

7:42

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

23/06/2021

Web Question Answering with Neurosymbolic Program Synthesis

Qiaochu Chen, Aaron Lamoreaux, Xinyu Wang and
Greg Durrett, Osbert Bastani, Isil Dillig

Keywords Paper

Program Synthesis, Programming by Example, Web Information Extraction

0

0

0

0

15:15