TaylorGAN: Neighbor-Augmented Policy Update Towards Sample-Efficient Natural Language Generation

06/12/2020

TaylorGAN: Neighbor-Augmented Policy Update Towards Sample-Efficient Natural Language Generation

Chun-Hsing Lin, Siang-Ruei Wu, Hung-yi Lee, Yun-Nung (Vivian) Chen

Keywords:

Abstract Paper Similar Papers

Abstract: Score function-based natural language generation (NLG) approaches such as REINFORCE, in general, suffer from low sample efficiency and training instability problems. This is mainly due to the non-differentiable nature of the discrete space sampling and thus these methods have to treat the discriminator as a black box and ignore the gradient information. To improve the sample efficiency and reduce the variance of REINFORCE, we propose a novel approach, TaylorGAN, which augments the gradient estimation by off-policy update and the first-order Taylor expansion. This approach enables us to train NLG models from scratch with smaller batch size --- without maximum likelihood pre-training, and outperforms existing GAN-based methods on multiple metrics of quality and diversity.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

ColdGANs: Taming Language GANs with Cautious Sampling Strategies

Thomas Scialom, Paul-Alexis Dray, Sylvain Lamprier and
Benjamin Piwowarski, Jacopo Staiano

Keywords Paper

0

0

0

0

3:19

04/07/2020

Mind the Trade-off: Debiasing NLU Models without Degrading the In-distribution Performance

Prasetya Ajie Utama, Nafise Sadat Moosavi, Iryna Gurevych

Keywords Paper

Debiasing Models, natural tasks, NLU tasks, debiasing methods

0

0

0

1

11:09

04/07/2020

Reducing Gender Bias in Neural Machine Translation as a Domain Adaptation Problem

Danielle Saunders, Bill Byrne

Keywords Paper

Reducing Bias, Neural Translation, Domain Problem, NLP tasks

0

0

0

0

11:50

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

14/09/2020

Learning Implicit Generative Models By Teaching Density Estimators

Kun Xu, Chao Du, Chongxuan Li and
Jun Zhu, Bo Zhang

Keywords Paper

deep generative models, generative adversarial nets, mode collapse problem

0

0

0

0

9:30

26/04/2020

Neural Text Generation With Unlikelihood Training

Sean Welleck, Ilia Kulikov, Stephen Roller and
Emily Dinan, Kyunghyun Cho, Jason Weston

Keywords Paper

language modeling, machine learning

0

0

0

0

4:20

17/08/2020

Learning temporal coherence via self-supervision for GAN-based video generation

Mengyu Chu, You Xie, Jonas Mayer and
Laura Leal-Taixé, Nils Thuerey

Keywords Paper

self-supervision, temporal cycle-consistency, video super-resolution, generative adversarial network, unpaired video translation

0

0

0

0

16:59

26/04/2020

Is a Good Representation Sufficient for Sample Efficient Reinforcement Learning?

Simon S. Du, Sham M. Kakade, Ruosong Wang, Lin F. Yang

Keywords Paper

reinforcement learning, function approximation, lower bound, representation

0

0

0

0

4:55

06/12/2021

Bridging Offline Reinforcement Learning and Imitation Learning: A Tale of Pessimism

Paria Rashidinejad, Banghua Zhu, Cong Ma and
Jiantao Jiao, Stuart Russell

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

12:21

08/12/2020

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Paper

0

0

0

0

14:42

02/02/2021

DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation

Haoyue Bai, Rui Sun, Lanqing Hong and
Fengwei Zhou, Nanyang Ye, Han-Jia Ye, S.-H. Gary Chan, Zhenguo Li

Keywords Paper

0

0

0

0

15:59

04/07/2020

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

0

0

0

0

10:57

26/04/2020

Language GANs Falling Short

Massimo Caccia, Lucas Caccia, William Fedus and
Hugo Larochelle, Joelle Pineau, Laurent Charlin

Keywords Paper

NLP, GAN, MLE, adversarial, text generation, temperature

0

0

0

0

4:29

16/11/2020

From Zero to Hero: On the Limitations of Zero-Shot Language Transfer with Multilingual Transformers

Anne Lauscher, Vinit Ravishankar, Ivan Vulić, Goran Glavaš

Keywords Paper

zero-shot transfer, downstream transfer, resource-lean scenarios, pos tagging

0

0

0

0

11:45

06/12/2021

Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

vision

0

0

0

0

9:03

19/08/2021

BESA: BERT-based Simulated Annealing for Adversarial Text Attacks

Xinghao Yang, Weifeng Liu, Dacheng Tao, Wei Liu

Keywords Paper

Machine Learning, Adversarial Machine Learning, Natural Language Processing

0

0

0

0

14:01

26/04/2020

Identifying through Flows for Recovering Latent Representations

Shen Li, Bryan Hooi, Gim Hee Lee

Keywords Paper

Representation learning, identifiable generative models, nonlinear-ICA

0

0

0

0

5:11

06/12/2021

True Few-Shot Learning with Language Models

Ethan Perez, Douwe Kiela, Kyunghyun Cho

Keywords Paper

language, few shot learning

0

0

0

0

15:04

13/04/2021

Automatic differentiation variational inference with mixtures

Warren Morningstar, Sharad Vikram, Cusuh Ham and
Andrew Gallagher, Joshua Dillon

Keywords Paper

0

0

0

0

3:05

06/12/2021

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou

Keywords Paper

machine learning, generative model

0

0

0

0

6:53

05/01/2021

Multimodal Prototypical Networks for Few-Shot Learning

Frederik Pahde, Mihai Puscas, Tassilo Klein, Moin Nabi

Keywords Paper

0

0

0

0

4:56

02/02/2021

Longitudinal Deep Kernel Gaussian Process Regression

Junjie Liang, Yanting Wu, Dongkuan Xu, Vasant G Honavar

Keywords Paper

0

0

0

0

16:27

06/12/2020

HAWQ-V2: Hessian Aware trace-Weighted Quantization of Neural Networks

Zhen Dong, Zhewei Yao, Daiyaan Arfeen and
Amir Gholami, Michael Mahoney, Kurt Keutzer

Keywords Paper

1

0

0

0

3:21

06/12/2020

Generalized Hindsight for Reinforcement Learning

Alex Li, Lerrel Pinto, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

19/08/2021

Learning Deeper Non-Monotonic Networks by Softly Transferring Solution Space

Zheng-Fan Wu, Hui Xue, Weimin Bai

Keywords Paper

Machine Learning, Kernel Methods, Deep Learning, Classification

0

0

0

0

12:50

16/11/2020

Do sequence-to-sequence VAEs learn global features of sentences?

Tom Bosc, Pascal Vincent

Keywords Paper

generation, memorization, autoregressive models, variational autoencoder

0

0

0

0

12:00

26/04/2020

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning

Kimin Lee, Kibok Lee, Jinwoo Shin, Honglak Lee

Keywords Paper

Deep reinforcement learning, Generalization in visual domains

0

0

0

0

5:03

30/11/2020

dpVAEs: Fixing Sample Generation for Regularized VAEs

Riddhish Bhalodia, Iain Lee, Shireen Elhabian

Keywords Paper

0

0

0

0

7:54

03/05/2021

Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online

Yangchen Pan, Kirby Banman, Martha White

Keywords Paper

natural sparsity, Reinforcement learning, fuzzy tiling activation function, sparse representation

0

0

0

1

6:22

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

06/12/2020

Sampling-Decomposable Generative Adversarial Recommender

Binbin Jin, Defu Lian, Zheng Liu and
Qi Liu, Jianhui Ma, Xing Xie, Enhong Chen

Keywords Paper

0

0

0

0

3:17

06/12/2020

Improving GAN Training with Probability Ratio Clipping and Sample Reweighting

Yue Wu, Pan Zhou, Andrew Wilson and
Eric Xing, Zhiting Hu

Keywords Paper

0

0

0

0

3:22

16/11/2020

An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels

Ilias Chalkidis, Manos Fergadiotis, Sotiris Kotitsas and
Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos

Keywords Paper

flat classification, hierarchical approaches, zero-shot learning, few learning

0

0

0

0

12:21

03/05/2021

You Only Need Adversarial Supervision for Semantic Image Synthesis

Edgar Schoenfeld, Vadim Sushko, Dan Zhang and
Juergen Gall, Bernt Schiele, Anna Khoreva

Keywords Paper

GANs, Semantic Image Synthesis, Image Generation, Deep Learning

0

0

0

0

5:11

16/11/2020

If beam search is the answer, what was the question?

Clara Meister, Ryan Cotterell, Tim Vieira

Keywords Paper

language tasks, beam search, decoding, maximum decoding

0

0

0

0

12:18

06/12/2021

Curriculum Offline Imitating Learning

Minghuan Liu, Hanye Zhao, Zhengyu Yang and
Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:28

02/02/2021

LREN: Low-Rank Embedded Network for Sample-Free Hyperspectral Anomaly Detection

Kai Jiang, Weiying Xie, Jie Lei and
Tao Jiang, Yunsong Li

Keywords Paper

0

0

0

0

12:56

14/06/2020

Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

Runfa Chen, Wenbing Huang, Binghui Huang and
Fuchun Sun, Bin Fang

Keywords Paper

nice-gan, reusing discriminators for encoding, unsupervised image-to-image translation, decoupled training, multi-scale discriminators, adversarial loss, no independent component for encoding, shared layers, residual attention, cyclegan

0

0

0

0

1:01

19/10/2020

Robust normalized squares maximization for unsupervised domain adaptation

Wenju Zhang, Xiang Zhang, Qing Liao and
Wenjing Yang, Long Lan, Zhigang Luo

Keywords Paper

transfer learning, image classification, domain adaptation

0

0

0

0

6:23

06/12/2020

Information Theoretic Counterfactual Learning from Missing-Not-At-Random Feedback

Zifeng Wang, Xi Chen, Rui Wen and
Shao-Lun Huang, Ercan E Kuruoglu, Yefeng Zheng

Keywords Paper

0

0

0

0

3:23