Improving Text Generation with Student-Forcing Optimal Transport

Abstract: Neural language models are often trained with maximum likelihood estimation (MLE), where the next word is generated conditioned on the ground-truth word tokens. During testing, however, the model is instead conditioned on previously generated tokens, resulting in what is termed exposure bias. To reduce this gap between training and testing, we propose using optimal transport (OT) to match the sequences generated in these two modes. We examine the necessity of adding Student-Forcing scheme during training with an imitation learning interpretation. An extension is further proposed to improve the OT learning for long sequences, based on the structural and contextual information of the text sequences. The effectiveness of the proposed method is validated on machine translation, text summarization, and text generation tasks.

16/11/2020

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Sentiment Analysis and Text Mining

14:52

03/05/2021

Improving Text Generation with Student-Forcing Optimal Transport

Jianqiao Li, Chunyuan Li, Guoyin Wang, Hao Fu, Yuhchen Lin, Liqun Chen, Yizhe Zhang, Chenyang Tao, Ruiyi Zhang, Wenlin Wang, Dinghan Shen, Qian Yang, Lawrence Carin

Comments

Similar Papers

Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation

Minki Kang, Moonsu Han, Sung Ju Hwang

Keywords Abstract Paper

self-supervised pre-training, question answering, task, reinforcement learning

Using Context in Neural Machine Translation Training Objectives

Danielle Saunders, Felix Stahlberg, Bill Byrne

Keywords Abstract Paper

Neural training, NMT training, document-level training, NMT objective

Dynamic Data Selection and Weighting for Iterative Back-Translation

Zi-Yi Dou, Antonios Anastasopoulos, Graham Neubig

Keywords Abstract Paper

neural translation, neural nmt, nmt, domain adaptation

Cross-Thought for Sentence Encoder Pre-training

Shuohang Wang, Yuwei Fang, Siqi Sun and Zhe Gan, Yu Cheng, Jingjing Liu, Jing Jiang

Keywords Abstract Paper

pre-training encoder, large-scale tasks, question answering, predicting words

Improving Grammatical Error Correction Models with Purpose-Built Adversarial Examples

Lihao Wang, Xiaoqing Zheng

Keywords Abstract Paper

grammatical correction, sequence-to-sequence learning, neural networks, gec

Improving Transformation Invariance in Contrastive Representation Learning

Adam Foster, Rattana Pukdee, Tom Rainforth

Keywords Abstract Paper

transformation invariance, contrastive learning, representation learning

Improving Generalization in Meta-learning via Task Augmentation

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Abstract Paper

Algorithms, Multitask, Transfer, and Meta Learning

Show Me How To Revise: Improving Lexically Constrained Sentence Generation with XLNet

Xingwei He, Victor O.K. Li

Keywords Abstract Paper

Meta Fine-Tuning Neural Language Models for Multi-Domain Text Mining

Chengyu Wang, Minghui Qiu, Jun Huang, Xiaofeng He

Keywords Abstract Paper

nlp tasks, fine-tuning, learning process, multi-domain tasks

Dynamic Context Selection for Document-level Neural Machine Translation via Reinforcement Learning

Xiaomian Kang, Yang Zhao, Jiajun Zhang, Chengqing Zong

Keywords Abstract Paper

document-level translation, translations, document-level model, selection module

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Abstract Paper

Generating Dialogue Responses from a Semantic Latent Space

Wei-Jen Ko, Avik Ray, Yilin Shen, Hongxia Jin

Keywords Abstract Paper

generation responses, regression task, open-domain models, end-to-end classification

CSP:Code-Switching Pre-training for Neural Machine Translation

Zhen Yang, Bojie Hu, Ambyera Han and Shen Huang, Qi Ju

Keywords Abstract Paper

neural nmt, lexicon induction, unsupervised nmt, pre-training method

Coach: A Coarse-to-Fine Approach for Cross-domain Slot Filling

Zihan Liu, Genta Indra Winata, Peng Xu, Pascale Fung

Keywords Abstract Paper

Cross-domain Filling, task-oriented systems, slot filling, data problem

On Guaranteed Optimal Robust Explanations for NLP Models

Emanuele La Malfa, Rhiannon Michelmore, Agnieszka M. Zbrzezny and Nicola Paoletti, Marta Kwiatkowska

Keywords Abstract Paper

Machine Learning, Adversarial Machine Learning, Explainable/Interpretable Machine Learning, Sentiment Analysis and Text Mining

Predicting Inductive Biases of Pre-Trained Models

Charles Lovering, Rohan Jha, Tal Linzen, Ellie Pavlick

Keywords Abstract Paper

probing, information-theoretical probing, natural language processing, challenge sets

Attentive Pooling with Learnable Norms for Text Representation

Chuhan Wu, Fangzhao Wu, Tao Qi and Xiaohui Cui, Yongfeng Huang

Keywords Abstract Paper

Text Representation, text representations, model training, Pooling

Prioritized Level Replay

Minqi Jiang, Edward Grefenstette, Tim Rocktäschel

Keywords Abstract Paper

Reinforcement Learning and Planning, Deep RL

Domain General Face Forgery Detection by Learning to Weight

Ke Sun, Hong Liu, Qixiang Ye and Yue Gao, Jianzhuang Liu, Ling Shao, Rongrong Ji

Keywords Abstract Paper

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and Arthur Szlam, Marc'Aurelio Ranzato

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Shuohang Wang, Yuwei Fang, Siqi Sun and
Zhe Gan, Yu Cheng, Jingjing Liu, Jing Jiang

Keywords Paper

Keywords Paper

Keywords Paper

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and
Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhen Yang, Bojie Hu, Ambyera Han and
Shen Huang, Qi Ju

Keywords Paper

Keywords Paper

Emanuele La Malfa, Rhiannon Michelmore, Agnieszka M. Zbrzezny and
Nicola Paoletti, Marta Kwiatkowska

Keywords Paper

Keywords Paper

Chuhan Wu, Fangzhao Wu, Tao Qi and
Xiaohui Cui, Yongfeng Huang

Keywords Paper

Keywords Paper

Ke Sun, Hong Liu, Qixiang Ye and
Yue Gao, Jianzhuang Liu, Ling Shao, Rongrong Ji

Keywords Paper

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Keywords Paper

Xu-Hui Liu, Zhenghai Xue, Jingcheng Pang and
Shengyi Jiang, Feng Xu, Yang Yu

Keywords Paper

Mingyang Yi, Lu Hou, Jiacheng Sun and
Lifeng Shang, Xin Jiang, Qun Liu, Zhiming Ma

Keywords Paper

Yuxian Gu, Zhengyan Zhang, Xiaozhi Wang and
Zhiyuan Liu, Maosong Sun

Keywords Paper

Keywords Paper

Hao Fu, Shaojun Zhou, Qihong Yang and
Junjie Tang, Guiquan Liu, Kaikui Liu, Xiaolong Li

Keywords Paper

Keywords Paper

Keywords Paper

Sanyuan Chen, Yutai Hou, Yiming Cui and
Wanxiang Che, Ting Liu, Xiangzhan Yu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper