Learning an Unreferenced Metric for Online Dialogue Evaluation

04/07/2020

Learning an Unreferenced Metric for Online Dialogue Evaluation

Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang, Ryan Lowe, William L. Hamilton, Joelle Pineau

Keywords: Online Evaluation, inference, online setting, Unreferenced Metric

Abstract Paper Similar Papers

Abstract: Evaluating the quality of a dialogue interaction between two agents is a difficult task, especially in open-domain chit-chat style dialogue. There have been recent efforts to develop automatic dialogue evaluation metrics, but most of them do not generalize to unseen datasets and/or need a human-generated reference response during inference, making it infeasible for online evaluation. Here, we propose an unreferenced automated evaluation metric that uses large pre-trained language models to extract latent representations of utterances, and leverages the temporal transitions that exist between them. We show that our model achieves higher correlation with human annotations in an online setting, while not requiring true responses for comparison during inference.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

16/11/2020

Multi-View Sequence-to-Sequence Models with Conversational Structure for Abstractive Dialogue Summarization

Jiaao Chen, Diyi Yang

Keywords Paper

text summarization, nlp, summarizing text, human-humanmachine interaction

0

0

0

0

12:02

02/02/2021

Keyword-Guided Neural Conversational Model

Peixiang Zhong, Yong Liu, Hao Wang, Chunyan Miao

Keywords Paper

0

0

0

1

15:55

25/07/2020

An analysis of mixed initiative and collaboration in information-seeking dialogues

Svitlana Vakulenko, Evangelos Kanoulas, Maarten Rijke

Keywords Paper

mixed initiative, conversational search, dialogue

0

0

0

0

8:43

02/02/2021

Unsupervised Abstractive Dialogue Summarization for Tete-a-Tetes

Xinyuan Zhang, Ruiyi Zhang, Manzil Zaheer, Amr Ahmed

Keywords Paper

0

0

0

0

16:17

25/07/2020

Improving contextual language models for response retrieval in multi-turn conversation

Junyu Lu, Xiancong Ren, Yazhou Ren and
Ao Liu, Zenglin Xu

Keywords Paper

pre-trained language model, augmentation, response retrieval

0

0

0

0

8:08

16/11/2020

Structured Attention for Unsupervised Dialogue Structure Induction

Liang Qiu, Yizhou Zhao, Weiyan Shi and
Yuan Liang, Feng Shi, Tao Yuan, Zhou Yu, Song-Chun Zhu

Keywords Paper

inducing representation, computational linguistics, dialogue design, discourse analysis

0

0

0

0

9:58

04/07/2020

Diversifying Dialogue Generation with Non-Conversational Text

Hui Su, Xiaoyu Shen, Sanqiang Zhao and
Zhou Xiao, Pengwei Hu, Randy Zhong, Cheng Niu, Jie Zhou

Keywords Paper

Diversifying Generation, low-diversity problem, open-domain generation, dialogue generation

0

0

0

1

10:53

16/11/2020

Continuity of Topic, Interaction, and Query: Learning to Quote in Online Conversations

Lingzhi Wang, Jing Li, Xingshan Zeng and
Haisong Zhang, Kam-Fai Wong

Keywords Paper

persuasions, automatic generation, language generation, encoder-decoder framework

0

0

0

0

11:43

02/02/2021

A Controllable Model of Grounded Response Generation

Zeqiu Wu, Michel Galley, Chris Brockett and
Yizhe Zhang, Xiang Gao, Chris Quirk, Rik Koncel-Kedziorski, Jianfeng Gao, Hannaneh Hajishirzi, Mari Ostendorf, Bill Dolan

Keywords Paper

0

0

0

0

16:46

04/07/2020

You Impress Me: Dialogue Generation via Mutual Persona Perception

Qian Liu, Yihong Chen, Bei Chen and
Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Keywords Paper

Dialogue Generation, mimicking responses, cognitive science, understanding

0

0

0

0

10:13

12/07/2020

Deep Graph Random Process for Relational-Thinking-Based Speech Recognition

Huang Hengguan, Fuzhao Xue, Hao Wang, Ye Wang

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

15:07

06/12/2021

Local Explanation of Dialogue Response Generation

Yi-Lin Tuan, Connor Pryor, Wenhu Chen and
Lise Getoor, William Yang Wang

Keywords Paper

machine learning

0

0

0

0

13:14

04/07/2020

Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation

Haoyu Song, Yan Wang, Wei-Nan Zhang and
Xiaojiang Liu, Ting Liu

Keywords Paper

Persona Generation, persona-based task, personality-inconsistent problem, generating responses

0

0

0

0

9:19

16/11/2020

Online Conversation Disentanglement with Pointer Networks

Tao Yu, Shafiq Joty

Keywords Paper

conversation disentanglement, generalization, time-consuming engineering, disentanglement

0

0

0

0

11:26

16/11/2020

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues

Hung Le, Doyen Sahoo, Nancy Chen, Steven C.H. Hoi

Keywords Paper

video-grounded dialogues, high-resolution queries, video setting, bi-directional learning

0

0

0

0

11:05

02/02/2021

Topic-Aware Multi-turn Dialogue Modeling

Yi Xu, Hai Zhao, Zhuosheng Zhang

Keywords Paper

0

0

0

0

15:28

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

02/02/2021

Filling the Gap of Utterance-aware and Speaker-aware Representation for Multi-turn Dialogue

Longxiang Liu, Zhuosheng Zhang, Hai Zhao and
Xi Zhou, Xiang Zhou

Keywords Paper

0

0

0

0

18:11

08/12/2020

MEISD: A Multimodal Multi-Label Emotion, Intensity and Sentiment Dialogue Dataset for Emotion Recognition and Sentiment Analysis in Conversations

Mauajama Firdaus, Hardik Chauhan, Asif Ekbal, Pushpak Bhattacharyya

Keywords Paper

0

0

0

0

11:06

04/07/2020

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Margaret Li, Stephen Roller, Ilia Kulikov and
Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Paper

dialogue tasks, Unlikelihood Training, Generative models, maximum training

0

0

0

0

11:26

05/12/2020

FERNet: Fine-grained extraction and reasoning network for emotion recognition in dialogues

Yingmei Guo, Zhiyong Wu, Mingxing Xu

Keywords Paper

0

0

0

0

10:16

25/07/2020

GoChat: Goal-oriented chatbots with hierarchical reinforcement learning

Jianfeng Liu, Feiyang Pan, Ling Luo

Keywords Paper

dialogue system, reinforcement learning, goal-oriented chatbot

0

0

0

0

9:15

08/12/2020

EmpDG: Multi-resolution Interactive Empathetic Dialogue Generation

Qintong Li, Hongshen Chen, Zhaochun Ren and
Pengjie Ren, Zhaopeng Tu, Zhumin Chen

Keywords Paper

0

0

0

0

14:43

26/04/2020

Non-Autoregressive Dialog State Tracking

Hung Le, Richard Socher, Steven C.H. Hoi

Keywords Paper

task-oriented, dialogues, dialogue state tracking, non-autoregressive

0

0

0

0

5:23

04/07/2020

FEQA: A Question Answering Evaluation Framework for Faithfulness Assessment in Abstractive Summarization

Esin Durmus, He He, Mona Diab

Keywords Paper

Faithfulness Assessment, Abstractive Summarization, evaluating summary, reading comprehension

0

0

0

1

12:13

02/02/2021

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

0

0

0

0

17:37

01/07/2020

How Self-Attention Improves Rare Class Performance in a Question-Answering Dialogue Agent

Adam Stiff, Qi Song, Eric Fosler-Lussier

Keywords Paper

0

0

0

0

7:55

01/07/2020

Neural Multi-task Text Normalization and Sanitization with Pointer-Generator

Hoang Nguyen, Sandro Cavallari

Keywords Paper

0

0

0

0

9:16

04/07/2020

TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition

Ruifang He, Jian Wang, Fengyu Guo, Yugui Han

Keywords Paper

Implicit Recognition, discourse understanding, TransS-Driven Architecture, multi-level encoder

0

0

0

0

11:42

02/02/2021

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

0

0

0

0

16:08

02/02/2021

Unsupervised Summarization for Chat Logs with Topic-Oriented Ranking and Context-Aware Auto-Encoders

Yicheng Zou, Jun Lin, Lujun Zhao and
Yangyang Kang, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

Keywords Paper

0

0

0

0

14:21

01/07/2020

Syntactic Parsing in Humans and Machines

Paola Merlo

Keywords Paper

0

0

0

0

44:12

26/04/2020

Neural Module Networks for Reasoning over Text

Nitish Gupta, Kevin Lin, Dan Roth and
Sameer Singh, Matt Gardner

Keywords Paper

question answering, compositionality, neural module networks, multi-step reasoning, reading comprehension

0

0

0

0

4:36

19/08/2021

A Streaming End-to-End Framework For Spoken Language Understanding

Nihal Potdar, Anderson Raymundo Avila, Chao Xing and
Dong Wang, Yiran Cao, Xiao Chen

Keywords Paper

Natural Language Processing, Dialogue, Speech

0

0

0

0

14:09

04/07/2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2

Donghoon Ham, Jeong-Gwan Lee, Youngsoo Jang, Kee-Eung Kim

Keywords Paper

tracking flow, dialogue systems, human evaluation, End-to-End Pipeline

0

0

0

1

11:37

06/12/2020

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Michael Cogswell, Jiasen Lu, Rishabh Jain and
Stefan Lee, Devi Parikh, Dhruv Batra

Keywords Paper

1

0

0

0

3:29

25/04/2020

An Honest Conversation: Transparently Combining Machine and Human Speech Assistance in Public Spaces

Thomas Reitmaier, Simon Robinson, Jennifer Pearson and
Dani Kalarikalayil Raju, Matt Jones

Keywords Paper

conversational agents, speech appliances, public space interaction, emergent users

0

0

0

0

15:04

16/11/2020

Consistent Transcription and Translation of Speech

Matthias Sperber, Hendra Setiawan, Christian Gollan and
Udhay Nallasamy, Matthias Paulik

Keywords Paper

speech translation, jointly speech, joint task, speech step

0

0

0

0

11:52

16/11/2020

Human-centric dialog training via offline reinforcement learning

Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun and
Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Keywords Paper

dialog model, rl policy, rl, language models

0

0

0

0

12:10

04/07/2020

More Diverse Dialogue Datasets via Diversity-Informed Data Collection

Katherine Stasaski, Grace Hui Yang, Marti A. Hearst

Keywords Paper

Automated dialogue, diversity problem, Diversity-Informed Collection, emotion classification

0

0

0

0

12:06