More Diverse Dialogue Datasets via Diversity-Informed Data Collection

04/07/2020

More Diverse Dialogue Datasets via Diversity-Informed Data Collection

Katherine Stasaski, Grace Hui Yang, Marti A. Hearst

Keywords: Automated dialogue, diversity problem, Diversity-Informed Collection, emotion classification

Abstract Paper Similar Papers

Abstract: Automated generation of conversational dialogue using modern neural architectures has made notable advances. However, these models are known to have a drawback of often producing uninteresting, predictable responses; this is known as the diversity problem. We introduce a new strategy to address this problem, called Diversity-Informed Data Collection. Unlike prior approaches, which modify model architectures to solve the problem, this method uses dynamically computed corpus-level statistics to determine which conversational participants to collect data from. Diversity-Informed Data Collection produces significantly more diverse data than baseline data collection methods, and better results on two downstream tasks: emotion classification and dialogue generation. This method is generalizable and can be used with other corpus-level metrics.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Multi-Domain Dialogue Acts and Response Co-Generation

Kai Wang, Junfeng Tian, Rui Wang and
Xiaojun Quan, Jianxing Yu

Keywords Paper

Generating responses, task-oriented systems, response generation, automatic evaluations

0

0

0

1

10:01

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

08/12/2020

Meet Changes with Constancy: Learning Invariance in Multi-Source Translation

Jianfeng Liu, Ling Luo, Xiang Ao and
Yan Song, Haoran Xu, Jian Ye

Keywords Paper

0

0

0

0

13:35

04/07/2020

Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation

Haoyu Song, Yan Wang, Wei-Nan Zhang and
Xiaojiang Liu, Ting Liu

Keywords Paper

Persona Generation, persona-based task, personality-inconsistent problem, generating responses

0

0

0

0

9:19

04/07/2020

Diversifying Dialogue Generation with Non-Conversational Text

Hui Su, Xiaoyu Shen, Sanqiang Zhao and
Zhou Xiao, Pengwei Hu, Randy Zhong, Cheng Niu, Jie Zhou

Keywords Paper

Diversifying Generation, low-diversity problem, open-domain generation, dialogue generation

0

0

0

1

10:53

02/02/2021

SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration

Mengzuo Huang, Feng Li, Wuhe Zou, Weidong Zhang

Keywords Paper

0

0

0

0

14:50

19/04/2021

Enriching non-autoregressive transformer with syntactic and semantic structures for neural machine translation

Ye Liu, Yao Wan, Jianguo Zhang and
Wenting Zhao, Philip Yu

Keywords Paper

0

0

0

0

10:18

03/05/2021

Deep Encoder, Shallow Decoder: Reevaluating Non-autoregressive Machine Translation

Jungo Kasai, Nikolaos Pappas, Hao Peng and
James Cross, Noah Smith

Keywords Paper

Machine Translation, Sequence Modeling, Natural Language Processing

0

0

0

0

5:04

04/07/2020

Neural Data-to-Text Generation via Jointly Learning the Segmentation and Correspondence

Xiaoyu Shen, Ernie Chang, Hui Su and
Cheng Niu, Dietrich Klakow

Keywords Paper

Neural Generation, Segmentation, data-to-text tasks, neural model

0

0

0

0

9:09

04/07/2020

You Impress Me: Dialogue Generation via Mutual Persona Perception

Qian Liu, Yihong Chen, Bei Chen and
Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Keywords Paper

Dialogue Generation, mimicking responses, cognitive science, understanding

0

0

0

0

10:13

01/07/2020

How Self-Attention Improves Rare Class Performance in a Question-Answering Dialogue Agent

Adam Stiff, Qi Song, Eric Fosler-Lussier

Keywords Paper

0

0

0

0

7:55

08/12/2020

Evaluating Cross-Lingual Transfer Learning Approaches in Multilingual Conversational Agent Models

Lizhen Tan, Olga Golovneva

Keywords Paper

0

0

0

0

9:23

03/05/2021

NAS-Bench-ASR: Reproducible Neural Architecture Search for Speech Recognition

Abhinav Mehrotra, Alberto Gil Couto Pimentel Ramos, Sourav Bhattacharya and
Łukasz Dudziak, Ravichander Vipperla, Thomas C Chau, Mohamed Abdelfattah, Samin Ishtiaq, Nic Lane

Keywords Paper

Benchmark, NAS, ASR

0

0

0

0

4:50

16/11/2020

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Rongsheng Zhang, Yinhe Zheng, Jianzhi Shao and
Xiaoxi Mao, Yadong Xi, Minlie Huang

Keywords Paper

collecting data, automatic evaluation, open-domain systems, neural models

0

0

0

0

11:01

01/07/2020

It’s About Time: Turn-Entry Timing For Situated Human-Robot Dialogue

Felix Gervits, Ravenna Thielstrom, Antonio Roque, Matthias Scheutz

Keywords Paper

0

0

0

0

12:00

16/11/2020

Learning a Simple and Effective Model for Multi-turn Response Generation with Auxiliary Tasks

Yufan Zhao, Can Xu, Wei Wu

Keywords Paper

multi-turn generation, response generation, word recovery, utterance recovery

0

0

0

0

10:08

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

19/10/2020

Learning to generate reformulation actions for scalable conversational query understanding

Zihan Xu, Jiangang Zhu, Ling Geng and
Yang Yang, Bojia Lin, Daxin Jiang

Keywords Paper

contextual query reformulation, question answering, conversational query understanding

0

0

0

0

6:58

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

06/12/2020

Bayesian Attention Modules

Xinjie Fan, Shujian Zhang, Bo Chen, Mingyuan Zhou

Keywords Paper

0

0

0

0

3:32

16/11/2020

Repulsive Attention: Rethinking Multi-head Attention as Bayesian Inference

Bang An, Jie Lyu, Zhenyi Wang and
Chunyuan Li, Changwei Hu, Fei Tan, Ruiyi Zhang, Yifan Hu, Changyou Chen

Keywords Paper

natural applications, attention collapse, neural mechanism, bayesian perspective

0

0

0

0

9:29

04/07/2020

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos and
Ryan Cotterell, Naoaki Okazaki

Keywords Paper

Measuring Difficulty, generation, asymmetric difficulty, machine difficulty

0

0

0

0

6:52

19/04/2021

Recipes for building an open-domain chatbot

Stephen Roller, Emily Dinan, Naman Goyal and
Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Eric Michael Smith, Y-Lan Boureau, Jason Weston

Keywords Paper

0

0

0

1

11:33

16/11/2020

Transformer Based Multi-Source Domain Adaptation

Dustin Wright, Isabelle Augenstein

Keywords Paper

unsupervised adaptation, cnns, rnns, domain classifiers

0

0

0

0

11:30

25/07/2020

Improving contextual language models for response retrieval in multi-turn conversation

Junyu Lu, Xiancong Ren, Yazhou Ren and
Ao Liu, Zenglin Xu

Keywords Paper

pre-trained language model, augmentation, response retrieval

0

0

0

0

8:08

01/07/2020

DLGNet: A Transformer-based Model for Dialogue Response Generation

Olabiyi Oluwatobi, Erik Mueller

Keywords Paper

0

0

0

0

15:52

19/04/2021

Zero-shot neural passage retrieval via domain-targeted synthetic question generation

Ji Ma, Ivan Korotkov, Yinfei Yang and
Keith Hall, Ryan McDonald

Keywords Paper

0

0

0

0

12:47

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

03/05/2021

Share or Not? Learning to Schedule Language-Specific Capacity for Multilingual Translation

Biao Zhang, Ankur Bapna, Rico Sennrich, Orhan Firat

Keywords Paper

multilingual transformer, multilingual translation, language-specific modeling, conditional computation

0

0

0

0

15:04

16/11/2020

Long-Short Term Masking Transformer: A Simple but Effective Baseline for Document-level Neural Machine Translation

Pei Zhang, Boxing Chen, Niyu Ge, Kai Fan

Keywords Paper

document-level translation, document-level systems, context-aware architecture, transformer

0

0

0

0

6:36

04/07/2020

Harvesting and Refining Question-Answer Pairs for Unsupervised QA

Zhongli Li, Wenhui Wang, Li Dong and
Furu Wei, Ke Xu

Keywords Paper

Unsupervised QA, Question Answering, Question QA, QA

0

0

0

0

10:28

04/07/2020

Language-aware Interlingua for Multilingual Neural Machine Translation

Changfeng Zhu, Heng Yu, Shanbo Cheng, Weihua Luo

Keywords Paper

Multilingual Translation, low-resource scenarios, Language-aware Interlingua, NMT

0

0

0

0

6:09

12/07/2020

Emergence of Separable Manifolds in Deep Language Representations

Jonathan Mamou, Hang Le, Miguel del Rio Fernandez and
Cory Stephenson, Hanlin Tang, Yoon Kim, SueYeon Chung

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

14:24

19/08/2021

ALaSca: an Automated approach for Large-Scale Lexical Substitution

Caterina Lacerra, Tommaso Pasini, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:27

06/12/2021

Learning and Generalization in RNNs

Abhishek Panigrahi, Navin Goyal

Keywords Paper

deep learning, optimization

0

0

0

0

15:51

02/02/2021

Keyword-Guided Neural Conversational Model

Peixiang Zhong, Yong Liu, Hao Wang, Chunyan Miao

Keywords Paper

0

0

0

1

15:55

26/08/2020

Improving Maximum Likelihood Training for Text Generation with Density Ratio Estimation

Yuxuan Song, Ning Miao, Hao Zhou and
Lantao Yu, Mingxuan Wang, Lei Li

Keywords Paper

0

0

0

0

12:32

04/07/2020

Automatic Detection of Generated Text is Easiest when Humans are Fooled

Daphne Ippolito, Daniel Duckworth, Chris Callison-Burch, Douglas Eck

Keywords Paper

Automatic Text, detection, humanness systems, neural modelling

0

0

0

0

11:01

04/07/2020

Variational Neural Machine Translation with Normalizing Flows

Hendra Setiawan, Matthias Sperber, Udhyakumar Nallasamy, Matthias Paulik

Keywords Paper

Variational Translation, Variational VNMT, Variational, generation translations

0

0

0

0

7:09

04/07/2020

Posterior Control of Blackbox Generation

Xiang Lisa Li, Alexander Rush

Keywords Paper

Posterior Generation, Text generation, deep models, neural models

0

0

0

0

11:47