Non-Autoregressive Machine Translation with Latent Alignments

Abstract: This paper presents two strong methods, CTC and Imputer, for non-autoregressive machine translation that model latent alignments with dynamic programming. We revisit CTC for machine translation and demonstrate that a simple CTC model can achieve state-of-the-art for single-step non-autoregressive machine translation, contrary to what prior work indicates. In addition, we adapt the Imputer model for non-autoregressive machine translation and demonstrate that Imputer with just 4 generation steps can match the performance of an autoregressive Transformer baseline. Our latent alignment models are simpler than many existing non-autoregressive translation baselines; for example, we do not require target length prediction or re-scoring with an autoregressive model. On the competitive WMT′14 En$i̊ghtarrow$De task, our CTC model achieves 25.7 BLEU with a single generation step, while Imputer achieves 27.5 BLEU with 2 generation steps, and 28.0 BLEU with 4 generation steps. This compares favourably to the autoregressive Transformer baseline at 27.8 BLEU.

05/12/2020

counting automata, regular expression matching, Antimirov’s derivatives, bounded repetition, determinization, counting-set automata, ReDos

14:18

04/07/2020

reference-based super-resolution, self-similarity super-resolution, deformable convolution, non-local block, single-image super-resolution, perceptual-oriented super-resolution

1:01

04/07/2020

Non-Autoregressive Machine Translation with Latent Alignments

Chitwan Saharia, William Chan, Saurabh Saxena, Mohammad Norouzi

Comments

Similar Papers

Mixed-lingual pre-training for cross-lingual summarization

Ruochen Xu, Chenguang Zhu, Yu Shi and Michael Zeng, Xuedong Huang

Keywords Abstract Paper

Stepwise Extractive Summarization and Planning with Structured Transformers

Shashi Narayan, Joshua Maynez, Jakub Adamek and Daniele Pighin, Blaz Bratanic, Ryan McDonald

Keywords Abstract Paper

extractive summarization, stepwise summarization, sentence filtering, rotowire generation

Enriching non-autoregressive transformer with syntactic and semantic structures for neural machine translation

Ye Liu, Yao Wan, Jianguo Zhang and Wenting Zhao, Philip Yu

Keywords Abstract Paper

Lexically Constrained Neural Machine Translation with Levenshtein Transformer

Raymond Hendy Susanto, Shamil Chollampatt, Liling Tan

Keywords Abstract Paper

Lexically Translation, neural translation, Levenshtein Transformer, beam decoding

A Probabilistic Formulation of Unsupervised Text Style Transfer

Junxian He, Xinyi Wang, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Abstract Paper

unsupervised text style transfer, deep latent sequence model

Iterative Refinement in the Continuous Space for Non-Autoregressive Neural Machine Translation

Jason Lee, Raphael Shu, Kyunghyun Cho

Keywords Abstract Paper

non-autoregressive translation, translation, machine translation, inference procedure

The Simple Essence of Algebraic Subtyping: Principal Type Inference with Subtyping Made Easy (Functional Pearl)

Lionel Parreaux

Keywords Abstract Paper

subtyping, principal types, type inference

CompCertM: CompCert with C-Assembly Linking and Lightweight Modular Verification

Youngju Song, Minki Cho, Dongjoo Kim and Yonghyun Kim, Jeehoon Kang, Chung-Kil Hur

Keywords Abstract Paper

Multi-Language Linking, Compositional Compiler Verification, CompCert

Incorporating BERT into Parallel Sequence Decoding with Adapters

Junliang Guo, Zhirui Zhang, Linli Xu and Hao-Ran Wei, Boxing Chen, Enhong Chen

Keywords Abstract Paper

Dual-decoder Transformer for Joint Automatic Speech Recognition and Multilingual Speech Translation

Hang Le, Juan Pino, Changhan Wang and Jiatao Gu, Didier Schwab, Laurent Besacier

Keywords Abstract Paper

Embarrassingly Simple Unsupervised Aspect Extraction

Stéphan Tulkens, Andreas van Cranenburgh

Keywords Abstract Paper

Embarrassingly Extraction, aspect identification, sentiment analysis, aspect extraction

Regex Matching with Counting-Set Automata

Lenka Turoňová, Lukáš Holík, Ondřej Lengál and Olli Saarikivi, Margus Veanes, Tomáš Vojnar

Keywords Abstract Paper

counting automata, regular expression matching, Antimirov’s derivatives, bounded repetition, determinization, counting-set automata, ReDos

Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation

Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

Keywords Abstract Paper

Variational Translation, posterior collapse, auxiliary task, uncertainty

Scalable validation of binary lifters

Sandeep Dasgupta, Sushant Dinesh, Deepan Venkatesh and Vikram S. Adve, Christopher W. Fletcher

Keywords Abstract Paper

Compiler Optimizations, Formal Semantics, Translation Validation, Graph Isomorphism, x86-64, LLVM IR

Abstract Extensionality: On the Properties of Incomplete Abstract Interpretations

Roberto Bruni, Roberto Giacobazzi, Roberta Gori and Isabel Garcia-Contreras, Dusko Pavlovic

Keywords Abstract Paper

Abstract Interpretation, Intensionality, Obfuscation, Extensionality

IOT: Instance-wise Layer Reordering for Transformer Structures

Jinhua Zhu, Lijun Wu, Yingce Xia and Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

Keywords Abstract Paper

Transformers, Instance-wise Learning, Layer order

Aligned Cross Entropy for Non-Autoregressive Machine Translation

Marjan Ghazvininejad, Vladimir Karpukhin, Luke Zettlemoyer, Omer Levy

Keywords Abstract Paper

Applications - Language, Speech and Dialog

OrigamiNet: Weakly-Supervised, Segmentation-Free, One-Step, Full Page Text Recognition by learning to unfold

Mohamed Yousef, Tom E. Bishop

Keywords Abstract Paper

text recognition, weakly supervised, handwriting recognition, convolutional neural network fully convolutional, ctc

Smooth Bilevel Programming for Sparse Regularization

Clarice Poon, Gabriel Peyré

Keywords Abstract Paper

machine learning

Language-Agnostic Representation Learning of Source Code from Structure and Context

Daniel Zügner, Tobias Kirschstein, Michele Catasta and Jure Leskovec, Stephan Günnemann

Keywords Abstract Paper

code summarization, machine learning for code

Ruochen Xu, Chenguang Zhu, Yu Shi and
Michael Zeng, Xuedong Huang

Keywords Paper

Shashi Narayan, Joshua Maynez, Jakub Adamek and
Daniele Pighin, Blaz Bratanic, Ryan McDonald

Keywords Paper

Ye Liu, Yao Wan, Jianguo Zhang and
Wenting Zhao, Philip Yu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Youngju Song, Minki Cho, Dongjoo Kim and
Yonghyun Kim, Jeehoon Kang, Chung-Kil Hur

Keywords Paper

Junliang Guo, Zhirui Zhang, Linli Xu and
Hao-Ran Wei, Boxing Chen, Enhong Chen

Keywords Paper

Hang Le, Juan Pino, Changhan Wang and
Jiatao Gu, Didier Schwab, Laurent Besacier

Keywords Paper

Keywords Paper

Lenka Turoňová, Lukáš Holík, Ondřej Lengál and
Olli Saarikivi, Margus Veanes, Tomáš Vojnar

Keywords Paper

Keywords Paper

Sandeep Dasgupta, Sushant Dinesh, Deepan Venkatesh and
Vikram S. Adve, Christopher W. Fletcher

Keywords Paper

Roberto Bruni, Roberto Giacobazzi, Roberta Gori and
Isabel Garcia-Contreras, Dusko Pavlovic

Keywords Paper

Jinhua Zhu, Lijun Wu, Yingce Xia and
Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Daniel Zügner, Tobias Kirschstein, Michele Catasta and
Jure Leskovec, Stephan Günnemann

Keywords Paper

Keywords Paper

Chenfeng Miao, Liang Shuang, Zhengchen Liu and
Chen Minchuan, Jun Ma, Shaojun Wang, Jing Xiao

Keywords Paper

Aditya Siddhant, Ankur Bapna, Yuan Cao and
Orhan Firat, Mia Chen, Sneha Kudugunta, Naveen Arivazhagan, Yonghui Wu

Keywords Paper

Keywords Paper

Xiao Li, Chenghua Lin, Ruizhe Li and
Chaozheng Wang, Frank Guerin

Keywords Paper

Gustavo Hernandez Abrego, Bowen Liang, Wei Wang and
Zarana Parekh, Yinfei Yang, Yunhsuan Sung

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jungo Kasai, Nikolaos Pappas, Hao Peng and
James Cross, Noah Smith

Keywords Paper

Keywords Paper

Hao Peng, Nikolaos Pappas, Dani Yogatama and
Roy Schwartz, Noah Smith, Lingpeng Kong

Keywords Paper

Keywords Paper

Keywords Paper

Matthieu Sozeau, Simon Boulier, Yannick Forster and
Nicolas Tabareau, Théo Winterhalter

Keywords Paper