Similarity Scoring for Dialogue Behaviour Comparison

Abstract: The differences in decision making between behavioural models of voice interfaces are hard to capture using existing measures for the absolute performance of such models. For instance, two models may have a similar task success rate, but very different ways of getting there. In this paper, we propose a general methodology to compute the similarity of two dialogue behaviour models and investigate different ways of computing scores on both the semantic and the textual level. Complementing absolute measures of performance, we test our scores on three different tasks and show the practical usability of the measures.

19/04/2021

Yali Du, Xue Yan, Xu Chen and
Jun Wang, Haifeng Zhang

audiovisual, audio-visual, source separation, singing, speech, graph, acappella

2:51

01/07/2020

Similarity Scoring for Dialogue Behaviour Comparison

Stefan Ultes, Wolfgang Maier

Comments

Similar Papers

WER-BERT: Automatic WER estimation with BERT in a balanced ordinal classification paradigm

Akshay Krishna Sheshadri, Anvesh Rao Vijjini, Sukhdeep Kharbanda

Keywords Abstract Paper

Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

Jaehyeon Kim, Jungil Kong, Juhee Son

Keywords Abstract Paper

Applications, Audio and Speech Processing

Unsupervised Abstractive Dialogue Summarization for Tete-a-Tetes

Xinyuan Zhang, Ruiyi Zhang, Manzil Zaheer, Amr Ahmed

Keywords Abstract Paper

Can You Put it All Together: Evaluating Conversational Agents' Ability to Blend Skills

Eric Michael Smith, Mary Williamson, Kurt Shuster and Jason Weston, Y-Lan Boureau

Keywords Abstract Paper

conversational agent, open-domain agent, model schemes, multi-task training

An analysis of mixed initiative and collaboration in information-seeking dialogues

Svitlana Vakulenko, Evangelos Kanoulas, Maarten Rijke

Keywords Abstract Paper

mixed initiative, conversational search, dialogue

Don’t Say That! Making Inconsistent Dialogue Unlikely with Unlikelihood Training

Margaret Li, Stephen Roller, Ilia Kulikov and Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Abstract Paper

dialogue tasks, Unlikelihood Training, Generative models, maximum training

Robust Neural Machine Translation with ASR Errors

Haiyang Xue, Yang Feng, Shuhao Gu, Wei Chen

Keywords Abstract Paper

Learning from revisions: Quality assessment of claims in argumentation at scale

Gabriella Skitalinskaya, Jonas Klaff, Henning Wachsmuth

Keywords Abstract Paper

Estimating $\alpha$-Rank from A Few Entries with Low Rank Matrix Completion

Yali Du, Xue Yan, Xu Chen and Jun Wang, Haifeng Zhang

Keywords Abstract Paper

Optimization, Probabilistic Methods, Distributed Inference, Algorithms, Algorithms Evaluation

Incorporating Pragmatic Reasoning Communication into Emergent Language

Yipeng Kang, Tonghan Wang, Gerard de Melo

Keywords Abstract Paper

Interactive Speech and Noise Modeling for Speech Enhancement

Chengyu Zheng, Xiulian Peng, Yuan Zhang and Sriram Srinivasan, Yan Lu

Keywords Abstract Paper

Consistent Transcription and Translation of Speech

Matthias Sperber, Hendra Setiawan, Christian Gollan and Udhay Nallasamy, Matthias Paulik

Keywords Abstract Paper

speech translation, jointly speech, joint task, speech step

Multi-Domain Dialogue Acts and Response Co-Generation

Kai Wang, Junfeng Tian, Rui Wang and Xiaojun Quan, Jianxing Yu

Keywords Abstract Paper

Generating responses, task-oriented systems, response generation, automatic evaluations

Evaluating the Factual Consistency of Abstractive Text Summarization

Wojciech Kryscinski, Bryan McCann, Caiming Xiong, Richard Socher

Keywords Abstract Paper

assessing algorithms, natural inference, fact checking, auxiliary tasks

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

Damien Teney, Ehsan Abbasnejad, Kushal Kafle and Robik Shrestha, Christopher Kanan, Anton van den Hengel

Keywords Abstract Paper

Is Automated Topic Model Evaluation Broken? The Incoherence of Coherence

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Abstract Paper

WiC-TSV: An evaluation benchmark for target sense verification of words in context

Anna Breit, Artem Revenko, Kiamehr Rezaee and Mohammad Taher Pilehvar, Jose Camacho-Collados

Keywords Abstract Paper

Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment

Forrest Davis, Marten van Schijndel

Keywords Abstract Paper

production, Recurrent Always, language models, RNN LMs

Local Explanation of Dialogue Response Generation

Yi-Lin Tuan, Connor Pryor, Wenhu Chen and Lise Getoor, William Yang Wang

Keywords Abstract Paper

machine learning

A unifying framework for modeling acoustic/prosodic entrainment: definition and evaluation on two large corpora

Ramiro H. Gálvez, Lara Gauder, Jordi Luque, Agustín Gravano

Keywords Abstract Paper

Multi-task Learning of Spoken Language Understanding by Integrating N-Best Hypotheses with Hierarchical Attention

Mingda Li, Xinyue Liu, Weitong Ruan and Luca Soldaini, Wael Hamza, Chengwei Su

Keywords Abstract Paper

Beyond accuracy: quantifying trial-by-trial behaviour of CNNs and humans by measuring error consistency

Robert Geirhos, Kristof Meding, Felix A. Wichmann

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Eric Michael Smith, Mary Williamson, Kurt Shuster and
Jason Weston, Y-Lan Boureau

Keywords Paper

Keywords Paper

Margaret Li, Stephen Roller, Ilia Kulikov and
Sean Welleck, Y-Lan Boureau, Kyunghyun Cho, Jason Weston

Keywords Paper

Keywords Paper

Keywords Paper

Yali Du, Xue Yan, Xu Chen and
Jun Wang, Haifeng Zhang

Keywords Paper

Keywords Paper

Chengyu Zheng, Xiulian Peng, Yuan Zhang and
Sriram Srinivasan, Yan Lu

Keywords Paper

Matthias Sperber, Hendra Setiawan, Christian Gollan and
Udhay Nallasamy, Matthias Paulik

Keywords Paper

Kai Wang, Junfeng Tian, Rui Wang and
Xiaojun Quan, Jianxing Yu

Keywords Paper

Keywords Paper

Damien Teney, Ehsan Abbasnejad, Kushal Kafle and
Robik Shrestha, Christopher Kanan, Anton van den Hengel

Keywords Paper

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and
Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Paper

Anna Breit, Artem Revenko, Kiamehr Rezaee and
Mohammad Taher Pilehvar, Jose Camacho-Collados

Keywords Paper

Keywords Paper

Yi-Lin Tuan, Connor Pryor, Wenhu Chen and
Lise Getoor, William Yang Wang

Keywords Paper

Keywords Paper

Mingda Li, Xinyue Liu, Weitong Ruan and
Luca Soldaini, Wael Hamza, Chengwei Su

Keywords Paper

Keywords Paper

Keywords Paper

Emily Dinan, Angela Fan, Adina Williams and
Jack Urbanek, Douwe Kiela, Jason Weston

Keywords Paper

Ryuichi Takanobu, Qi Zhu, Jinchao Li and
Baolin Peng, Jianfeng Gao, Minlie Huang

Keywords Paper

Keywords Paper

Keywords Paper

Marius Mosbach, Stefania Degaetano-Ortlieb, Marie-Pauline Krielke and
Badr M. Abdullah, Dietrich Klakow

Keywords Paper

Keywords Paper

Keywords Paper

Enrico Palumbo, Andrea Mezzalira, Cristina Marco and
Alessandro Manzotti, Daniele Amberti

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mayee Chen, Karan Goel, Nimit Sohoni and
Fait Poms, Kayvon Fatahalian, Christopher Re

Keywords Paper