Dialogue Response Ranking Training with Large-Scale Human Feedback Data

16/11/2020

Dialogue Response Ranking Training with Large-Scale Human Feedback Data

Xiang Gao, Yizhe Zhang, Michel Galley, Chris Brockett, Bill Dolan

Keywords: feedback prediction, ranking problem, predicting feedback, open-domain models

Abstract Paper Similar Papers

Abstract: Existing open-domain dialog models are generally trained to minimize the perplexity of target human responses. However, some human replies are more engaging than others, spawning more followup interactions. Current conversational models are increasingly capable of producing turns that are context-relevant, but in order to produce compelling agents, these models need to be able to predict and optimize for turns that are genuinely engaging. We leverage social media feedback data (number of replies and upvotes) to build a large-scale training dataset for feedback prediction. To alleviate possible distortion between the feedback and engagingness, we convert the ranking problem to a comparison of response pairs which involve few confounding factors. We trained DialogRPT, a set of GPT-2 based models on 133M pairs of human feedback data and the resulting ranker outperformed several baselines. Particularly, our ranker outperforms the conventional dialog perplexity baseline with a large margin on predicting Reddit feedback. We finally combine the feedback prediction models and a human-like scoring model to rank the machine-generated dialog responses. Crowd-sourced human evaluation shows that our ranking method correlates better with real human preferences than baseline models.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

0

0

0

0

3:17

04/07/2020

uBLEU: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems

Tsuta Yuma, Naoki Yoshinaga, Masashi Toyoda

Keywords Paper

Open-Domain Systems, uBLEU, Uncertainty-Aware Method, ΔBLEU

0

0

0

0

11:07

04/07/2020

Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog Evaluation

Weixin Liang, James Zou, Zhou Yu

Keywords Paper

Automatic Evaluation, Open evaluation, dialog research, dialog evaluation

0

0

0

0

11:24

02/02/2021

Reinforced Imitative Graph Representation Learning for Mobile User Profiling: An Adversarial Training Perspective

Dongjie Wang, Pengyang Wang, Kunpeng Liu and
Yuanchun Zhou, Charles E Hughes, Yanjie Fu

Keywords Paper

0

0

0

0

18:46

19/04/2021

Recipes for building an open-domain chatbot

Stephen Roller, Emily Dinan, Naman Goyal and
Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Eric Michael Smith, Y-Lan Boureau, Jason Weston

Keywords Paper

0

0

0

1

11:33

19/08/2021

User Retention: A Causal Approach with Triple Task Modeling

Yang Zhang, Dong Wang, Qiang Li and
Yue Shen, Ziqi Liu, Xiaodong Zeng, Zhiqiang Zhang, Jinjie Gu, Derek F. Wong

Keywords Paper

Machine Learning, Deep Learning, Applications of Supervised Learning, Recommender Systems

0

0

0

0

13:53

04/07/2020

Dynamic Online Conversation Recommendation

Xingshan Zeng, Jing Li, Lu Wang and
Zhiming Mao, Kam-Fai Wong

Keywords Paper

Dynamic Recommendation, neural architecture, Trending topics, social users

0

0

0

0

11:36

16/11/2020

Human-centric dialog training via offline reinforcement learning

Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun and
Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Keywords Paper

dialog model, rl policy, rl, language models

0

0

0

0

12:10

05/12/2020

DAPPER: Learning domain-adapted persona representation using pretrained BERT and external memory

Prashanth Vijayaraghavan, Eric Chu, Deb Roy

Keywords Paper

0

0

0

0

14:48

04/07/2020

Large Scale Multi-Actor Generative Dialog Modeling

Alex Boyd, Raul Puri, Mohammad Shoeybi and
Mostofa Patwary, Bryan Catanzaro

Keywords Paper

Large Modeling, generation, style matching, automatic evaluations

0

0

0

0

11:49

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

18/07/2021

Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach

Qitian Wu, Hengrui Zhang, Xiaofeng Gao and
Junchi Yan, Hongyuan Zha

Keywords Paper

Applications, Recommender Systems

0

0

0

0

5:08

02/02/2021

Constructing a Fair Classifier with Generated Fair Data

Taeuk Jang, Feng Zheng, Xiaoqian Wang

Keywords Paper

0

0

0

0

15:58

19/04/2021

The Gutenberg dialogue dataset

Richard Csaky, Gábor Recski

Keywords Paper

0

0

0

0

10:14

04/07/2020

From Zero to Hero: Human-In-The-Loop Entity Linking in Low Resource Domains

Jan-Christoph Klie, Richard Eckart de Castilho, Iryna Gurevych

Keywords Paper

Human-In-The-Loop Linking, Entity linking, disambiguating mentions, annotation process

0

0

0

0

12:26

04/07/2020

You Impress Me: Dialogue Generation via Mutual Persona Perception

Qian Liu, Yihong Chen, Bei Chen and
Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Keywords Paper

Dialogue Generation, mimicking responses, cognitive science, understanding

0

0

0

0

10:13

26/04/2020

Learning from Rules Generalizing Labeled Exemplars

Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi

Keywords Paper

Learning from Rules, Learning from limited labeled data, Weakly Supervised Learning

0

0

0

0

5:18

16/11/2020

Will I Sound Like Me? Improving Persona Consistency in Dialogues through Pragmatic Self-Consciousness

Hyunwoo Kim, Byeongchang Kim, Gunhee Kim

Keywords Paper

training, dialogue agents, generative agent, persona-based agents

0

0

0

0

11:24

04/07/2020

Unsupervised Opinion Summarization as Copycat-Review Generation

Arthur Bražinskas, Mirella Lapata, Ivan Titov

Keywords Paper

Unsupervised Summarization, Copycat-Review Generation, Opinion summarization, automatically summaries

0

0

0

0

10:55

16/11/2020

F1 is Not Enough! Models and Evaluation Towards User-Centered Explainable Question Answering

Hendrik Schuff, Heike Adel, Ngoc Thang Vu

Keywords Paper

reasoning process, user study, model selection, explainable systems

0

0

0

0

12:03

23/08/2020

Towards building an intelligent chatbot for customer service: Learning to respond at the appropriate time

Che Liu, Junfeng Jiang, Chao Xiong and
Yi Yang, Jieping Ye

Keywords Paper

customer service, triggering model, chatbot, self-supervised learning

0

0

0

0

10:34

14/06/2020

Towards Fairness in Visual Recognition: Effective Strategies for Bias Mitigation

Zeyu Wang, Klint Qinami, Ioannis Christos Karakozis and
Kyle Genova, Prem Nair, Kenji Hata, Olga Russakovsky

Keywords Paper

fairness, bias mitigation, recognition, domain shift, adversarial training

0

0

0

0

1:01

16/11/2020

Interactive Text Ranking with Bayesian Optimisation: A Case Study on Community QA and Summarisation

Edwin Simpson, Yang Gao, Iryna Gurevych

Keywords Paper

nlp applications, question answering, question summarisation, community answering

0

0

0

0

11:09

22/09/2020

Contextual meta-bandit for recommender systems selection

Marlesson R. O. Santana, Luckeciano C. Melo, Fernando H. F. Camargo and
Bruno Brandão, Anderson Soares, Renan M. Oliveira, Sandor Caetano

Keywords Paper

contextual bandits, hierarchical recommender systems, options framework, reinforcement learning

0

0

0

0

1:48

19/10/2020

Leveraging historical interaction data for improving conversational recommender system

Kun Zhou, Wayne Xin Zhao, Hui Wang and
Sirui Wang, Fuzheng Zhang, Zhongyuan Wang, Ji-Rong Wen

Keywords Paper

pre-training approach, conversational recommender system

0

0

0

0

6:44

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

04/07/2020

CDL: Curriculum Dual Learning for Emotion-Controllable Response Generation

Lei Shen, Yang Feng

Keywords Paper

Emotion-Controllable Generation, training process, response tasks, CDL

0

0

0

0

11:05

12/07/2020

DeBayes: a Bayesian method for debiasing network embeddings

Maarten Buyl, Tijl De Bie

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

14:27

22/09/2020

TAFA: Two-headed attention fused autoencoder for context-aware recommendations

Jin Peng Zhou, Zhaoyue Cheng, Felipe Perez, Maksims Volkovs

Keywords Paper

Deep Learning, Context-Aware Recommender Systems, Neural Attention Networks

0

0

0

0

2:06

07/06/2020

Top Comment or Flop Comment? Predicting and Explaining User Engagement in Online News Discussions

Julian Risch, Ralf Krestel

Keywords Paper

articles, classifiers, discussions, engagement, influences, networks, news, news articles, predictions, texts, traditional, words

0

0

0

0

10:25

12/07/2020

Optimization and Analysis of the pAp@k Metric for Recommender Systems

Gaurush Hiranandani, Warut Vijitbenjaronk, Sanmi Koyejo, Prateek Jain

Keywords Paper

Learning Theory

0

0

0

0

16:11

16/11/2020

The World is Not Binary: Learning to Rank with Grayscale Data for Dialogue Response Selection

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

response selection, retrieval-based systems, learning-to-rank problem, learning-to-rank

0

0

0

0

12:03

04/07/2020

Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills

Eric Michael Smith, Mary Williamson, Kurt Shuster and
Jason Weston, Y-Lan Boureau

Keywords Paper

conversational agent, open-domain agent, model schemes, multi-task training

0

0

0

1

11:39

06/12/2021

Dynamic population-based meta-learning for multi-agent communication with natural language

Abhinav Gupta, Marc Lanctot, Angeliki Lazaridou

Keywords Paper

reinforcement learning and planning, robustness, meta learning

0

0

0

0

14:43

06/12/2021

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Lin Guan, Mudit Verma, Suna (Sihang) Guo and
Ruohan Zhang, Subbarao Kambhampati

Keywords Paper

reinforcement learning and planning, machine learning

0

0

0

0

13:41

04/07/2020

Unsupervised Opinion Summarization with Noising and Denoising

Reinald Kim Amplayo, Mirella Lapata

Keywords Paper

Unsupervised Summarization, supervised models, abstractive summarization, Noising

0

0

0

0

12:16

22/09/2020

Improving one-class recommendation with multi-tasking on various preference intensities

Chu-Jen Shao, Hao-Ming Fu, Pu-Jen Cheng

Keywords Paper

implicit feedback, graph convolutional network, one-class recommendation, collaborative filtering

0

0

0

0

2:38

19/08/2021

Accounting for Confirmation Bias in Crowdsourced Label Aggregation

Meric Altug Gemalmaz, Ming Yin

Keywords Paper

Humans and AI, Human Computation and Crowdsourcing, Human-AI Collaboration, Human-Computer Interaction

0

0

0

0

14:19

07/09/2020

BCaR: Beginner Classifier as Regularization Towards Generalizable Re-ID

Masato Tamura, Tomoaki Yoshinaga

Keywords Paper

person re-identification, generalizable, soft label, knowledge distillation, Re-ID, domain generalization

0

0

0

0

6:53