Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

04/07/2020

Does Multi-Encoder Help? A Case Study on Context-Aware Neural Machine Translation

Bei Li, Hui Liu, Ziyang Wang, Yufan Jiang, Tong Xiao, Jingbo Zhu, Tongran Liu, Changliang Li

Keywords: Context-Aware Translation, document-level translation, document-level NMT, document-level

Abstract Paper Similar Papers

Abstract: In encoder-decoder neural models, multiple encoders are in general used to represent the contextual information in addition to the individual sentence. In this paper, we investigate multi-encoder approaches in document-level neural machine translation (NMT). Surprisingly, we find that the context encoder does not only encode the surrounding sentences but also behaves as a noise generator. This makes us rethink the real benefits of multi-encoder in context-aware translation - some of the improvements come from robust training. We compare several methods that introduce noise and/or well-tuned dropout setup into the training of these encoders. Experimental results show that noisy training plays an important role in multi-encoder-based NMT, especially when the training data is small. Also, we establish a new state-of-the-art on IWSLT Fr-En task by careful use of noise generation and dropout methods.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Revisiting Self-Training for Neural Sequence Generation

Junxian He, Jiatao Gu, Jiajun Shen, Marc'Aurelio Ranzato

Keywords Paper

self-training, semi-supervised learning, neural sequence generatioin

0

0

0

0

5:07

19/04/2021

Quantifying appropriateness of summarization data for curriculum learning

Ryuji Kano, Takumi Takahashi, Toru Nishino and
Motoki Taniguchi, Tomoki Taniguchi, Tomoko Ohkuma

Keywords Paper

0

0

0

0

5:13

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

05/01/2021

Noise as a Resource for Learning in Knowledge Distillation

Elahe Arani, Fahad Sarfraz, Bahram Zonooz

Keywords Paper

0

0

0

0

4:45

26/04/2020

Robust training with ensemble consensus

Jisoo Lee, Sae-Young Chung

Keywords Paper

Annotation noise, Noisy label, Robustness, Ensemble, Perturbation

0

0

0

0

5:04

06/12/2020

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

Keywords Paper

Deep Learning -> Generative Models, Algorithms -> Semi-Supervised Learning

0

0

0

0

3:23

02/02/2021

Denoising Distantly Supervised Named Entity Recognition via a Hypergeometric Probabilistic Model

Wenkai Zhang, Hongyu Lin, Xianpei Han and
Le Sun, Huidan Liu, Zhicheng Wei, Nicholas Yuan

Keywords Paper

0

0

0

0

19:22

02/02/2021

Tackling Instance-Dependent Label Noise via a Universal Probabilistic Model

Qizhou Wang, Bo Han, Tongliang Liu and
Gang Niu, Jian Yang, Chen Gong

Keywords Paper

0

0

0

0

14:56

18/07/2021

Learning to Generate Noise for Multi-Attack Robustness

Divyam Madaan, Jinwoo Shin, Sung Ju Hwang

Keywords Paper

Applications, Privacy, Anonymity, and Security, Probabilistic Methods, MCMC, Algorithms, Adversarial Examples

0

0

0

0

5:12

18/07/2021

Clusterability as an Alternative to Anchor Points When Learning with Noisy Labels

Zhaowei Zhu, Yiwen Song, Yang Liu

Keywords Paper

Deep Learning

0

0

0

0

5:24

02/02/2021

Meta Label Correction for Noisy Label Learning

Guoqing Zheng, Ahmed Hassan Awadallah, Susan Dumais

Keywords Paper

0

0

0

0

20:16

22/11/2021

Alleviating Noisy-label Effects in Image Classification via Probability Transition Matrix

Ziqi Zhang, Yuexiang Li, Hongxin Wei and
Kai Ma, Tao Xu, Yefeng Zheng

Keywords Paper

noisy labels, image classification, instance selection, robust learning, inter-class correlation, soft label, medical image

0

0

0

0

2:52

02/02/2021

Improving Model Robustness by Adaptively Correcting Perturbation Levels with Active Queries

Kun-Peng Ning, Lue Tao, Songcan Chen, Sheng-Jun Huang

Keywords Paper

0

1

0

0

16:10

16/11/2020

Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training

Hai Ye, Qingyu Tan, Ruidan He and
Juntao Li, Hwee Tou Ng, Lidong Bing

Keywords Paper

unsupervised adaptation, self-training, pre-trained models, bert

0

0

0

0

10:33

03/05/2021

Robust Curriculum Learning: from clean label detection to noisy label self-correction

Tianyi Zhou, Shengjie Wang, Jeff Bilmes

Keywords Paper

neural networks, curriculum learning, training dynamics, robust learning, noisy label

0

0

0

0

5:02

14/06/2020

Transfer Learning From Synthetic to Real-Noise Denoising With Adaptive Instance Normalization

Yoonsik Kim, Jae Woong Soh, Gu Yong Park, Nam Ik Cho

Keywords Paper

real-noise, real-noise denoiser, real-noise denoising, transfer learning, adaptive denoiser, adaptive instance normalization, simulate noise denoiser, awgn denoiser

0

0

0

0

1:01

02/02/2021

Decision-Guided Weighted Automata Extraction from Recurrent Neural Networks

Xiyue Zhang, Xiaoning Du, Xiaofei Xie and
Lei Ma, Yang Liu, Meng Sun

Keywords Paper

0

0

0

0

16:44

16/11/2020

Improving Grammatical Error Correction Models with Purpose-Built Adversarial Examples

Lihao Wang, Xiaoqing Zheng

Keywords Paper

grammatical correction, sequence-to-sequence learning, neural networks, gec

0

0

0

0

11:40

26/04/2020

Meta Dropout: Learning to Perturb Latent Features for Generalization

Hae Beom Lee, Taewook Nam, Eunho Yang, Sung Ju Hwang

Keywords Paper

0

1

0

0

4:46

06/12/2020

Unsupervised Data Augmentation for Consistency Training

Qizhe Xie, Zihang Dai, Eduard Hovy and
Thang Luong, Quoc V Le

Keywords Paper

0

0

0

0

3:29

03/05/2021

Learning with Instance-Dependent Label Noise: A Sample Sieve Approach

Hao Cheng, Zhaowei Zhu, Xingyu Li and
Yifei Gong, Xing Sun, Yang Liu

Keywords Paper

deep neural networks., instance-based label noise, Learning with noisy labels

0

0

0

0

5:18

18/11/2020

Boosting-based reliable model reuse

Yao-Xiang Ding, Zhi-Hua Zhou

Keywords Paper

1

1

0

0

11:59

06/12/2021

Discrete-Valued Neural Communication

Dianbo Liu, Alex Lamb, Kenji Kawaguchi and
Anirudh Goyal ALIAS PARTH GOYAL, Chen Sun, Michael Mozer, Yoshua Bengio

Keywords Paper

deep learning, robustness, transformers, generative model, graph learning

0

0

0

0

11:09

06/12/2021

Learning with Noisy Correspondence for Cross-modal Matching

Zhenyu Huang, Guocheng Niu, Xiao Liu and
Wenbiao Ding, Xinyan Xiao, Hua Wu, Xi Peng

Keywords Paper

deep learning, language

0

0

0

0

15:11

12/07/2020

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu and
Chunjing Xu, Enhua Wu, Chang Xu

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:34

06/12/2021

Neural Analysis and Synthesis: Reconstructing Speech from Self-Supervised Representations

Hyeong-Seok Choi, Juheon Lee, Wansoo Kim and
Jie Lee, Hoon Heo, Kyogu Lee

Keywords Paper

0

0

0

0

11:14

02/02/2021

Noise Estimation Using Density Estimation for Self-Supervised Multimodal Learning

Elad Amrani, Rami Ben-Ari, Daniel Rotman, Alex Bronstein

Keywords Paper

0

0

0

0

14:04

13/04/2021

Robust imitation learning from noisy demonstrations

Voot Tangkaratt, Nontawat Charoenphakdee, Masashi Sugiyama

Keywords Paper

0

0

0

0

2:31

06/12/2020

Listening to Sounds of Silence for Speech Denoising

Henry Xu, Rundi Wu, Yuko Ishiwaka and
Carl Vondrick, Changxi Zheng

Keywords Paper

0

0

0

0

3:22

19/08/2021

Multi-Scale Selective Feedback Network with Dual Loss for Real Image Denoising

Xiaowan Hu, Yuanhao Cai, Zhihong Liu and
Haoqian Wang, Yulun Zhang

Keywords Paper

Computer Vision, Computational Photography, Photometry, Shape from X, Deep Learning

0

0

0

0

9:52

19/04/2021

Joint energy-based model training for better calibrated natural language understanding models

Tianxing He, Bryan McCann, Caiming Xiong, Ehsan Hosseini-Asl

Keywords Paper

0

0

0

0

5:58

14/09/2020

Incremental training of a recurrent neural network exploiting a multi-scale dynamic memory

Antonio Carta, Alessandro Sperduti, Davide Bacciu

Keywords Paper

recurrent neural networks, linear dynamical systems, incremental learning

0

0

0

0

15:12

06/12/2020

Early-Learning Regularization Prevents Memorization of Noisy Labels

Sheng Liu, Jonathan Niles-Weed, Narges Razavian, Carlos Fernandez-Granda

Keywords Paper

0

0

0

0

3:06

02/02/2021

Multi-SpectroGAN: High-Diversity and High-Fidelity Spectrogram Generation with Adversarial Style Combination for Speech Synthesis

Sang-Hoon Lee, Hyun-Wook Yoon, Hyeong-Rae Noh and
Ji-Hoon Kim, Seong-Whan Lee

Keywords Paper

0

0

0

0

14:19

18/07/2021

Unsupervised Representation Learning via Neural Activation Coding

Yookoon Park, Sangho Lee, Gunhee Kim, David Blei

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

13:50

16/11/2020

Data Weighted Training Strategies for Grammatical Error Correction

Jared Lichtarge, Chris Alberti, Shankar Kumar

Keywords Paper

neural nmt, neural, example scoring, gec

0

0

0

0

10:22

12/07/2020

Peer Loss Functions: Learning from Noisy Labels without Knowing Noise Rates

Yang Liu, Hongyi Guo

Keywords Paper

Supervised Learning

0

0

0

0

15:57

04/07/2020

Parallel Corpus Filtering via Pre-trained Language Models

Boliang Zhang, Ajay Nagesh, Kevin Knight

Keywords Paper

machine models, WMT task, Parallel Filtering, Pre-trained Models

0

0

0

0

12:04

12/07/2020

A Sequential Self Teaching Approach for Improving Generalization in Sound Event Recognition

Anurag Kumar, Vamsi Krishna Ithapu

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

14:17

26/04/2020

Compositional languages emerge in a neural iterated learning model

Yi Ren, Shangmin Guo, Matthieu Labeau and
Shay B. Cohen, Simon Kirby

Keywords Paper

Compositionality, Multi-agent, Emergent language, Iterated learning

0

0

0

0

5:07