TextGAIL: Generative Adversarial Imitation Learning for Text Generation

02/02/2021

TextGAIL: Generative Adversarial Imitation Learning for Text Generation

Qingyang Wu, Lei Li, Zhou Yu

Keywords:

Abstract Paper Similar Papers

Abstract: Generative Adversarial Networks (GANs) for text generation have recently received many criticisms, as they perform worse than their MLE counterparts. We suspect previous text GANs' inferior performance is due to the lack of a reliable guiding signal in their discriminators. To address this problem, we propose a generative adversarial imitation learning framework for text generation that uses large pre-trained language models to provide more reliable reward guidance. As previous text GANs suffer from high variance of gradients, we apply contrastive discriminator, and proximal policy optimization (PPO) to stabilize and improve text generation performance. For evaluation, we conduct experiments on a diverse set of unconditional and conditional text generation tasks. Experimental results show that TextGAIL achieves better performance in terms of both quality and diversity than the MLE baseline. We also validate our intuition that TextGAIL's discriminator demonstrates the capability of providing reasonable rewards with an additional task.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948462

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

ColdGANs: Taming Language GANs with Cautious Sampling Strategies

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

0

0

0

0

3:19

06/12/2020

Improving GAN Training with Probability Ratio Clipping and Sample Reweighting

Yue Wu, Pan Zhou, Andrew Wilson and
Eric Xing, Zhiting Hu

Keywords Paper

0

0

0

0

3:22

03/05/2021

Empirical Analysis of Unlabeled Entity Problem in Named Entity Recognition

Yangming Li, lemao liu, Shuming Shi

Keywords Paper

Negative Sampling, Unlabeled Entity Problem, Named Entity Recognition

0

0

0

1

4:49

03/05/2021

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu

Keywords Paper

Task-oriented Dialogue System, Hierarchical Reinforcement Learning, Policy Optimization, Natural Language Processing

0

0

0

0

5:44

06/12/2020

Generalized Hindsight for Reinforcement Learning

Alex Li, Lerrel Pinto, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

04/07/2020

Max-Margin Incremental CCG Parsing

Miloš Stanojević, Mark Steedman

Keywords Paper

Incremental parsing, human processing, ASR, MT

0

0

0

0

11:39

26/04/2020

The Curious Case of Neural Text Degeneration

Ari Holtzman, Jan Buys, Li Du and
Maxwell Forbes, Yejin Choi

Keywords Paper

generation, text, NLG, NLP, natural language, natural language generation, language model, neural, neural language model

0

0

0

0

4:57

26/08/2020

Nested-Wasserstein Self-Imitation Learning for Sequence Generation

Ruiyi Zhang, Changyou Chen, Zhe Gan and
Zheng Wen, Wenlin Wang, Lawrence Carin

Keywords Paper

0

0

0

0

11:18

06/12/2021

Curriculum Disentangled Recommendation with Noisy Multi-feedback

Hong Chen, Yudong Chen, Xin Wang and
Ruobing Xie, Rui Wang, Feng Xia, Wenwu Zhu

Keywords Paper

representation learning, interpretability

0

0

0

0

6:03

03/05/2021

What Makes Instance Discrimination Good for Transfer Learning?

Nanxuan Zhao, Zhirong Wu, Rynson W Lau, Stephen Lin

Keywords Paper

Unsupervised Learning, Transfer Learning, Self-supervised Learning

0

0

0

0

5:10

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

01/07/2020

Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference

Cemil Cengiz, Deniz Yuret

Keywords Paper

0

0

0

0

4:38

26/04/2020

Self-Adversarial Learning with Comparative Discrimination for Text Generation

Wangchunshu Zhou, Tao Ge, Ke Xu and
Furu Wei, Ming Zhou

Keywords Paper

adversarial learning, text generation

0

0

0

0

9:16

26/04/2020

On the Weaknesses of Reinforcement Learning for Neural Machine Translation

Leshem Choshen, Lior Fox, Zohar Aizenbud, Omri Abend

Keywords Paper

Reinforcement learning, MRT, minimum risk training, reinforce, machine translation, peakkiness, generation

0

0

0

0

5:32

16/11/2020

An Information Bottleneck Approach for Controlling Conciseness in Rationale Extraction

Bhargavi Paranjape, Mandar Joshi, John Thickstun and
Hannaneh Hajishirzi, Luke Zettlemoyer

Keywords Paper

language understanding, semi-supervised setting, complex models, explainer

0

0

0

0

11:44

16/11/2020

On the Sentence Embeddings from Pre-trained Language Models

Bohan Li, Hao Zhou, Junxian He and
Mingxuan Wang, Yiming Yang, Lei Li

Keywords Paper

natural processing, semantic task, semantic tasks, pre-trained representations

0

0

0

0

9:11

26/04/2020

Understanding the Limitations of Conditional Generative Models

Ethan Fetaya, Joern-Henrik Jacobsen, Will Grathwohl, Richard Zemel

Keywords Paper

Conditional Generative Models, Generative Classifiers, Robustness, Adversarial Examples

0

0

0

0

4:46

16/11/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Yuning Mao, Yanru Qu, Yiqing Xie and
Xiang Ren, Jiawei Han

Keywords Paper

single-document summarization, single-document sds, multi-document summarization, multi-document mds

0

0

0

0

10:58

06/12/2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration

Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Keywords Paper

0

0

0

0

3:17

03/05/2021

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

Keywords Paper

Deep Learning, Calibration, Uncertainty, Pseudo-Labeling, Semi-Supervised Learning

0

0

0

0

5:06

04/07/2020

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Margaret Li, Stephen Roller, Ilia Kulikov and
Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Paper

dialogue tasks, Unlikelihood Training, Generative models, maximum training

0

0

0

0

11:26

06/12/2020

Robust Deep Reinforcement Learning against Adversarial Perturbations on State Observations

Huan Zhang, Hongge Chen, Chaowei Xiao and
Bo Li, Mingyan Liu, Duane Boning, Cho-Jui Hsieh

Keywords Paper

0

0

0

0

3:18

02/02/2021

A Case Study of the Shortcut Effects in Visual Commonsense Reasoning

Keren Ye, Adriana Kovashka

Keywords Paper

0

0

0

0

14:26

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

06/12/2021

Understanding the Limits of Unsupervised Domain Adaptation via Data Poisoning

Akshay Mehra, Bhavya Kailkhura, Pin-Yu Chen, Jihun Hamm

Keywords Paper

robustness, domain adaptation

0

0

0

0

13:34

16/11/2020

BERT-ATTACK: Adversarial Attack Against BERT Using BERT

Linyang Li, Ruotian Ma, Qipeng Guo and
Xiangyang Xue, Xipeng Qiu

Keywords Paper

adversarial attacks, downstream tasks, calculation, gradient-based methods

0

0

0

0

11:36

04/07/2020

Text Classification with Negative Supervision

Sora Ohashi, Junya Takayama, Tomoyuki Kajiwara and
Chenhui Chu, Yuki Arase

Keywords Paper

Text Classification, text representation, text tasks, single- classifications

0

0

0

0

6:27

02/02/2021

Learning from Noisy Labels with Complementary Loss Functions

Deng-Bao Wang, Yong Wen, Lujia Pan, Min-Ling Zhang

Keywords Paper

0

0

0

0

14:00

04/07/2020

On the Limitations of Cross-lingual Encoders as Exposed by Reference-Free Machine Translation Evaluation

Wei Zhao, Goran Glavaš, Maxime Peyrard and
Yang Gao, Robert West, Steffen Eger

Keywords Paper

Evaluation encoders, zero-shot transfer, supervised tasks, web-scale systems

0

0

0

0

12:19

14/06/2020

Towards Causal VQA: Revealing and Reducing Spurious Correlations by Invariant and Covariant Semantic Editing

Vedika Agarwal, Rakshith Shetty, Mario Fritz

Keywords Paper

robustness, vqa, causality, gan, dataset, evaluation, automated semantic scene editing, data augmentation, invariance, covariance

0

0

0

0

1:00

06/12/2021

True Few-Shot Learning with Language Models

Ethan Perez, Douwe Kiela, Kyunghyun Cho

Keywords Paper

language, few shot learning

0

0

0

0

15:04

08/12/2020

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

0

0

0

0

13:01

12/07/2020

Mix-n-Match : Ensemble and Compositional Methods for Uncertainty Calibration in Deep Learning

Jize Zhang, Bhavya Kailkhura, T. Yong-Jin Han

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

13:17

02/02/2021

Robustness to Spurious Correlations in Text Classification via Automatically Generated Counterfactuals

Zhao Wang, Aron Culotta

Keywords Paper

0

0

0

0

17:39

16/11/2020

If beam search is the answer, what was the question?

Clara Meister, Ryan Cotterell, Tim Vieira

Keywords Paper

language tasks, beam search, decoding, maximum decoding

0

0

0

0

12:18

04/07/2020

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Esin Durmus, He He, Mona Diab

Keywords Paper

Faithfulness Assessment, Abstractive Summarization, evaluating summary, reading comprehension

0

0

0

1

12:13

26/04/2020

Neural Text Generation With Unlikelihood Training

Sean Welleck, Ilia Kulikov, Stephen Roller and
Emily Dinan, Kyunghyun Cho, Jason Weston

Keywords Paper

language modeling, machine learning

0

0

0

0

4:20

06/12/2020

ContraGAN: Contrastive Learning for Conditional Image Generation

Minguk Kang, Jaesik Park

Keywords Paper

Neuroscience and Cognitive Science -> Brain Mapping, Neuroscience and Cognitive Science -> Visual Perception

0

0

0

0

3:21

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

26/04/2020

Learning The Difference That Makes A Difference With Counterfactually-Augmented Data

Divyansh Kaushik, Eduard Hovy, Zachary Lipton

Keywords Paper

humans in the loop, annotation artifacts, text classification, sentiment analysis, natural language inference

0

0

0

0

4:25