Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

16/11/2020

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Rongsheng Zhang, Yinhe Zheng, Jianzhi Shao, Xiaoxi Mao, Yadong Xi, Minlie Huang

Keywords: collecting data, automatic evaluation, open-domain systems, neural models

Abstract Paper Similar Papers

Abstract: Recent advances in open-domain dialogue systems rely on the success of neural models that are trained on large-scale data. However, collecting large-scale dialogue data is usually time-consuming and labor-intensive. To address this data dilemma, we propose a novel data augmentation method for training open-domain dialogue models by utilizing unpaired data. Specifically, a data-level distillation process is first proposed to construct augmented dialogues where both post and response are retrieved from the unpaired data. A ranking module is employed to filter out low-quality dialogues. Further, a model-level distillation process is employed to distill a teacher model trained on high-quality paired data to augmented dialogue pairs, thereby preventing dialogue models from being affected by the noise in the augmented data. Automatic and manual evaluation indicates that our method can produce high-quality dialogue pairs with diverse contents, and the proposed data-level and model-level dialogue distillation can improve the performance of competitive baselines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

06/12/2020

Zero-Resource Knowledge-Grounded Dialogue Generation

Linxiao Li, Can Xu, Wei Wu and
YUFAN ZHAO, Xueliang Zhao, Chongyang Tao

Keywords Paper

0

0

0

1

3:22

06/12/2021

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and
Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

12:51

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

02/02/2021

Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues

Ruijian Xu, Chongyang Tao, Daxin Jiang and
Xueliang Zhao, Dongyan Zhao, Rui Yan

Keywords Paper

0

0

0

1

16:40

16/11/2020

Improving Grammatical Error Correction Models with Purpose-Built Adversarial Examples

Lihao Wang, Xiaoqing Zheng

Keywords Paper

grammatical correction, sequence-to-sequence learning, neural networks, gec

0

0

0

0

11:40

19/08/2021

Object Detection in Densely Packed Scenes via Semi-Supervised Learning with Dual Consistency

Chao Ye, Huaidong Zhang, Xuemiao Xu and
Weiwei Cai, Jing Qin, Kup-Sze Choi

Keywords Paper

Computer Vision, Recognition, Deep Learning, Semi-Supervised Learning

0

0

0

0

10:19

06/12/2020

ExpandNets: Linear Over-parameterization to Train Compact Convolutional Networks

Shuxuan Guo, Jose M. Alvarez, Mathieu Salzmann

Keywords Paper

0

0

0

0

3:20

06/12/2021

Stylized Dialogue Generation with Multi-Pass Dual Learning

Jinpeng Li, Yingce Xia, Rui Yan and
Hongda Sun, Dongyan Zhao, Tie-Yan Liu

Keywords Paper

0

0

0

0

3:16

19/04/2021

The Gutenberg dialogue dataset

Richard Csaky, Gábor Recski

Keywords Paper

0

0

0

0

10:14

03/05/2021

MixKD: Towards Efficient Distillation of Large-scale Language Models

Kevin Liang, Weituo Hao, Dinghan Shen and
Yufan Zhou, Weizhu Chen, Changyou Chen, Lawrence Carin

Keywords Paper

Representation Learning, Natural Language Processing

0

0

0

0

3:52

08/12/2020

Query Distillation: BERT-based Distillation for Ensemble Ranking

Wangshu Zhang, Junhong Liu, Zujie Wen and
Yafang Wang, Gerard de Melo

Keywords Paper

0

0

0

0

15:01

12/07/2020

Self-PU: Self Boosted and Calibrated Positive-Unlabeled Training

Xuxi Chen, Wuyang Chen, Tianlong Chen and
Ye Yuan, Chen Gong, Kewei Chen, Zhangyang Wang

Keywords Paper

Supervised Learning

0

0

0

0

7:05

02/02/2021

Automatic Curriculum Learning With Over-repetition Penalty for Dialogue Policy Learning

Yangyang Zhao, Zhenyu Wang, Zhenhua Huang

Keywords Paper

0

0

0

0

15:41

25/07/2020

Improving contextual language models for response retrieval in multi-turn conversation

Junyu Lu, Xiancong Ren, Yazhou Ren and
Ao Liu, Zenglin Xu

Keywords Paper

pre-trained language model, augmentation, response retrieval

0

0

0

0

8:08

02/02/2021

Robust Knowledge Transfer via Hybrid Forward on the Teacher-Student Model

Liangchen Song, Jialian Wu, Ming Yang and
Qian Zhang, Yuan Li, Junsong Yuan

Keywords Paper

0

0

0

0

16:09

03/05/2021

Learning A Minimax Optimizer: A Pilot Study

Jiayi Shen, Xiaohan Chen, Howard Heaton and
Tianlong Chen, Jialin Liu, Wotao Yin, Zhangyang Wang

Keywords Paper

Minimax Optimization, Learning to Optimize

0

0

0

0

5:04

04/07/2020

Multi-Domain Dialogue Acts and Response Co-Generation

Kai Wang, Junfeng Tian, Rui Wang and
Xiaojun Quan, Jianxing Yu

Keywords Paper

Generating responses, task-oriented systems, response generation, automatic evaluations

0

0

0

1

10:01

05/01/2021

Distillation Multiple Choice Learning for Multimodal Action Recognition

Nuno Cruz Garcia, Sarah Adel Bargal, Vitaly Ablavsky and
Pietro Morerio, Vittorio Murino, Stan Sclaroff

Keywords Paper

0

0

0

1

4:31

02/02/2021

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning

Bo Cui, Guyue Hu, Shan Yu

Keywords Paper

0

0

0

0

15:13

02/02/2021

Multi-View Feature Representation for Dialogue Generation with Bidirectional Distillation

Shaoxiong Feng, Xuancheng Ren, Kan Li, Xu Sun

Keywords Paper

0

0

0

0

19:13

13/04/2021

Learning to defend by learning to attack

Haoming Jiang, Zhehui Chen, Yuyang Shi and
Bo Dai, Tuo Zhao

Keywords Paper

0

0

0

0

2:58

06/12/2020

Self-paced Contrastive Learning with Hybrid Memory for Domain Adaptive Object Re-ID

Yixiao Ge, Feng Zhu, Dapeng Chen and
Rui Zhao, hongsheng Li

Keywords Paper

0

0

0

0

3:21

23/08/2020

Multimodal learning with incomplete modalities by knowledge distillation

Qi Wang, Liang Zhan, Paul Thompson, Jiayu Zhou

Keywords Paper

knowledge distillation, multimodal learning, incomplete modalities

0

0

0

0

17:53

19/08/2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation

Chengwei Chen, Yuan Xie, Shaohui Lin and
Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Anomaly/Outlier Detection

0

0

0

0

14:37

14/06/2020

Neural Networks Are More Productive Teachers Than Human Raters: Active Mixup for Data-Efficient Knowledge Distillation From a Blackbox Model

Dongdong Wang, Yandong Li, Liqiang Wang, Boqing Gong

Keywords Paper

blackbox knowledge distillation, data-efficient learning, active learning, mixup

0

0

0

0

4:59

03/05/2021

On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning

Ren Wang, Kaidi Xu, Sijia Liu and
Pin-Yu Chen, Lily Weng, Chuang Gan, Meng Wang

Keywords Paper

0

0

0

0

5:12

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

18/07/2021

Self-Damaging Contrastive Learning

Ziyu Jiang, Tianlong Chen, Bobak Mortazavi, Zhangyang Wang

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

1

5:10

16/11/2020

Amalgamating Knowledge from Two Teachers for Task-oriented Dialogue System with Adversarial Training

Wanwei He, Min Yang, Rui Yan and
Chengming Li, Ying Shen, Ruifeng Xu

Keywords Paper

task completion, generating responses, task-oriented dialogue, task-oriented systems

0

0

0

0

9:15

04/07/2020

Improving Non-autoregressive Neural Machine Translation with Monolingual Data

Jiawei Zhou, Phillip Keung

Keywords Paper

Non-autoregressive Translation, WMT14 tasks, monolingual augmentation, knowledge distillation

0

0

0

0

6:48

05/01/2021

Effectiveness of Arbitrary Transfer Sets for Data-Free Knowledge Distillation

Gaurav Kumar Nayak, Konda Reddy Mopuri, Anirban Chakraborty

Keywords Paper

0

0

0

0

5:00

18/07/2021

Interactive Learning from Activity Description

Khanh Nguyen, Dipendra Misra, Robert Schapire and
Miro Dudik, Patrick Shafto

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:57

03/05/2021

Federated Learning Based on Dynamic Regularization

Durmus Alp Emre Acar, Yue Zhao, Ramon Matas and
Matthew Mattina, Paul Whatmough, Venkatesh Saligrama

Keywords Paper

Distributed Optimization, Deep Neural Networks, Federated Learning

1

0

0

0

17:21

06/12/2021

No Fear of Heterogeneity: Classifier Calibration for Federated Learning with Non-IID Data

Mi Luo, Fei Chen, Dapeng Hu and
Yifan Zhang, Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, federated learning

0

0

0

0

3:27

12/07/2020

Training Binary Neural Networks through Learning with Noisy Supervision

Kai Han, Yunhe Wang, Yixing Xu and
Chunjing Xu, Enhua Wu, Chang Xu

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

12:34

04/07/2020

Diversifying Dialogue Generation with Non-Conversational Text

Hui Su, Xiaoyu Shen, Sanqiang Zhao and
Zhou Xiao, Pengwei Hu, Randy Zhong, Cheng Niu, Jie Zhou

Keywords Paper

Diversifying Generation, low-diversity problem, open-domain generation, dialogue generation

0

0

0

1

10:53

22/11/2021

DomainMix: Learning Generalizable Person Re-Identification Without Human Annotations

Wenhao Wang, Shengcai Liao, Fang Zhao and
Cuicui Kang, Ling Shao

Keywords Paper

DomainMix, Person Re-Identification, Domain Generalization

0

0

0

0

2:51

04/07/2020

CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation

Lei Shen, Yang Feng

Keywords Paper

Emotion-Controllable Generation, training process, response tasks, CDL

0

0

0

0

11:05

12/07/2020

Fair Generative Modeling via Weak Supervision

Kristy Choi, Aditya Grover, Trisha Singh and
Rui Shu, Stefano Ermon

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:01