GoChat: Goal-oriented chatbots with hierarchical reinforcement learning

25/07/2020

GoChat: Goal-oriented chatbots with hierarchical reinforcement learning

Jianfeng Liu, Feiyang Pan, Ling Luo

Keywords: dialogue system, reinforcement learning, goal-oriented chatbot

Abstract Paper Similar Papers

Abstract: A chatbot that converses like a human should be goal-oriented (i.e., be purposeful in conversation), which is beyond language generation. However, existing goal-oriented dialogue systems often heavily rely on cumbersome hand-crafted rules or costly labelled datasets, which limits the applicability. In this paper, we propose Goal-oriented Chatbots (GoChat), a framework for end-to-end training the chatbot to maximize the long-term return from offline multi-turn dialogue datasets. Our framework utilizes hierarchical reinforcement learning (HRL), where the high-level policy determines some sub-goals to guide the conversation towards the final goal, and the low-level policy fulfills the sub-goals by generating the corresponding utterance for response. In our experiments conducted on a real-world dialogue dataset for anti-fraud in financial, our approach outperforms previous methods on both the quality of response generation as well as the success rate of accomplishing the goal.

The video of this talk cannot be embedded. You can watch it here:

https://dl.acm.org/doi/10.1145/3397271.3401250#sec-supp

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at SIGIR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

23/08/2020

Towards building an intelligent chatbot for customer service: Learning to respond at the appropriate time

Che Liu, Junfeng Jiang, Chao Xiong and
Yi Yang, Jieping Ye

Keywords Paper

customer service, triggering model, chatbot, self-supervised learning

0

0

0

0

10:34

25/07/2020

Improving contextual language models for response retrieval in multi-turn conversation

Junyu Lu, Xiancong Ren, Yazhou Ren and
Ao Liu, Zenglin Xu

Keywords Paper

pre-trained language model, augmentation, response retrieval

0

0

0

0

8:08

04/07/2020

You Impress Me: Dialogue Generation via Mutual Persona Perception

Qian Liu, Yihong Chen, Bei Chen and
Jian-Guang Lou, Zixuan Chen, Bin Zhou, Dongmei Zhang

Keywords Paper

Dialogue Generation, mimicking responses, cognitive science, understanding

0

0

0

0

10:13

02/02/2021

Keyword-Guided Neural Conversational Model

Peixiang Zhong, Yong Liu, Hao Wang, Chunyan Miao

Keywords Paper

0

0

0

1

15:55

02/02/2021

Topic-Oriented Spoken Dialogue Summarization for Customer Service with Saliency-Aware Topic Modeling

Yicheng Zou, Lujun Zhao, Yangyang Kang and
Jun Lin, Minlong Peng, Zhuoren Jiang, Changlong Sun, Qi Zhang, Xuanjing Huang, Xiaozhong Liu

Keywords Paper

0

0

0

0

14:22

19/04/2021

Recipes for building an open-domain chatbot

Stephen Roller, Emily Dinan, Naman Goyal and
Da Ju, Mary Williamson, Yinhan Liu, Jing Xu, Myle Ott, Eric Michael Smith, Y-Lan Boureau, Jason Weston

Keywords Paper

0

0

0

1

11:33

16/11/2020

Human-centric dialog training via offline reinforcement learning

Natasha Jaques, Judy Hanwen Shen, Asma Ghandeharioun and
Craig Ferguson, Agata Lapedriza, Noah Jones, Shixiang Gu, Rosalind Picard

Keywords Paper

dialog model, rl policy, rl, language models

0

0

0

0

12:10

04/07/2020

Generate, Delete and Rewrite: A Three-Stage Framework for Improving Persona Consistency of Dialogue Generation

Haoyu Song, Yan Wang, Wei-Nan Zhang and
Xiaojiang Liu, Ting Liu

Keywords Paper

Persona Generation, persona-based task, personality-inconsistent problem, generating responses

0

0

0

0

9:19

04/07/2020

TransS-Driven Joint Learning Architecture for Implicit Discourse Relation Recognition

Ruifang He, Jian Wang, Fengyu Guo, Yugui Han

Keywords Paper

Implicit Recognition, discourse understanding, TransS-Driven Architecture, multi-level encoder

0

0

0

0

11:42

02/02/2021

Stylized Dialogue Response Generation Using Stylized Unpaired Texts

Yinhe Zheng, Zikai Chen, Rongsheng Zhang and
Shilei Huang, Xiaoxi Mao, Minlie Huang

Keywords Paper

0

0

0

0

15:13

04/07/2020

End-to-End Neural Pipeline for Goal-Oriented Dialogue Systems using GPT-2

Donghoon Ham, Jeong-Gwan Lee, Youngsoo Jang, Kee-Eung Kim

Keywords Paper

tracking flow, dialogue systems, human evaluation, End-to-End Pipeline

0

0

0

1

11:37

19/10/2020

Learning to profile: User meta-profile network for few-shot learning

Hao Gong, Qifang Zhao, Tianyu Li and
Derek Cho, DuyKhuong Nguyen

Keywords Paper

multi-task learning, multi-modal model, representation learning, meta-learning

0

0

0

1

12:10

06/12/2020

Zero-Resource Knowledge-Grounded Dialogue Generation

Linxiao Li, Can Xu, Wei Wu and
YUFAN ZHAO, Xueliang Zhao, Chongyang Tao

Keywords Paper

0

0

0

1

3:22

08/12/2020

Interactive Question Clarification in Dialogue via Reinforcement Learning

Xiang Hu, Zujie Wen, Yafang Wang and
Xiaolong Li, Gerard de Melo

Keywords Paper

0

0

0

0

14:46

08/12/2020

LAVA: Latent Action Spaces via Variational Auto-encoding for Dialogue Policy Optimization

Nurul Lubis, Christian Geishauser, Michael Heck and
Hsien-chin Lin, Marco Moresi, Carel van Niekerk, Milica Gasic

Keywords Paper

0

0

0

0

15:12

04/07/2020

Learning Efficient Dialogue Policy from Demonstrations through Shaping

Huimin Wang, Baolin Peng, Kam-Fai Wong

Keywords Paper

Demonstrations, learning progress, domain task, human evaluation

0

0

0

0

12:36

04/07/2020

uBLEU: Uncertainty-Aware Automatic Evaluation Method for Open-Domain Dialogue Systems

Tsuta Yuma, Naoki Yoshinaga, Masashi Toyoda

Keywords Paper

Open-Domain Systems, uBLEU, Uncertainty-Aware Method, ΔBLEU

0

0

0

0

11:07

16/11/2020

Dialogue Response Ranking Training with Large-Scale Human Feedback Data

Xiang Gao, Yizhe Zhang, Michel Galley and
Chris Brockett, Bill Dolan

Keywords Paper

feedback prediction, ranking problem, predicting feedback, open-domain models

0

0

0

0

11:57

04/07/2020

Learning an Unreferenced Metric for Online Dialogue Evaluation

Koustuv Sinha, Prasanna Parthasarathi, Jasmine Wang and
Ryan Lowe, William L. Hamilton, Joelle Pineau

Keywords Paper

Online Evaluation, inference, online setting, Unreferenced Metric

0

0

0

0

6:58

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

16/11/2020

Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems

Jan Deriu, Don Tuggener, Pius von Däniken and
Jon Ander Campos, Alvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak

Keywords Paper

evalu-ation methods, conversational systems, chat bots, spot bot

0

0

0

0

12:11

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

18/07/2021

Targeted Data Acquisition for Evolving Negotiation Agents

Minae Kwon, Sidd Karamcheti, Mariano-Florentino Cuellar, Dorsa Sadigh

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:15

16/11/2020

Dialogue Distillation: Open-Domain Dialogue Augmentation Using Unpaired Data

Rongsheng Zhang, Yinhe Zheng, Jianzhi Shao and
Xiaoxi Mao, Yadong Xi, Minlie Huang

Keywords Paper

collecting data, automatic evaluation, open-domain systems, neural models

0

0

0

0

11:01

18/07/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:57

02/02/2021

Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues

Ruijian Xu, Chongyang Tao, Daxin Jiang and
Xueliang Zhao, Dongyan Zhao, Rui Yan

Keywords Paper

0

0

0

1

16:40

16/11/2020

Counterfactual Off-Policy Training for Neural Dialogue Generation

Qingfu Zhu, Wei-Nan Zhang, Ting Liu, William Yang Wang

Keywords Paper

open-domain generation, data problem, training, counterfactual reasoning

0

0

0

0

11:37

08/12/2020

EmpDG: Multi-resolution Interactive Empathetic Dialogue Generation

Qintong Li, Hongshen Chen, Zhaochun Ren and
Pengjie Ren, Zhaopeng Tu, Zhumin Chen

Keywords Paper

0

0

0

0

14:43

04/07/2020

Data Manipulation: Towards Effective Instance Learning for Neural Dialogue Generation via Learning to Augment and Reweight

Hengyi Cai, Hongshen Chen, Yonghao Song and
Cheng Zhang, Xiaofang Zhao, Dawei Yin

Keywords Paper

Data Manipulation, Neural Generation, learning, dialogue generation

0

0

0

1

9:39

05/12/2020

DAPPER: Learning domain-adapted persona representation using pretrained BERT and external memory

Prashanth Vijayaraghavan, Eric Chu, Deb Roy

Keywords Paper

0

0

0

0

14:48

02/02/2021

Carbon to Diamond: An Incident Remediation Assistant System From Site Reliability Engineers’ Conversations in Hybrid Cloud Operations

Suranjana Samanta, Ajay Gupta, Prateeti Mohapatra, Amar Prakash Azad

Keywords Paper

0

0

0

0

17:58

29/06/2020

Challenges in chatbot development: A study of stack overflow posts

Ahmad Abdellatif, Diego Costa, Khaled Badran and
Rabe Abdalkareem, Emad Shihab

Keywords Paper

0

0

0

0

14:34

06/12/2020

Online Bayesian Persuasion

Matteo Castiglioni, Andrea Celli, Alberto Marchesi, Nicola Gatti

Keywords Paper

0

0

0

0

3:00

04/07/2020

Semi-Supervised Dialogue Policy Learning via Stochastic Reward Estimation

Xinting Huang, Jianzhong Qi, Yu Sun, Rui Zhang

Keywords Paper

Semi-Supervised Learning, generalization function, Stochastic Estimation, Dialogue optimization

0

0

0

0

11:31

26/04/2020

Learning Nearly Decomposable Value Functions Via Communication Minimization

Tonghan Wang, Jianhao Wang, Chongyi Zheng, Chongjie Zhang

Keywords Paper

Multi-agent reinforcement learning, Nearly decomposable value function, Minimized communication

0

0

0

0

5:00

16/11/2020

BiST: Bi-directional Spatio-Temporal Reasoning for Video-Grounded Dialogues

Hung Le, Doyen Sahoo, Nancy Chen, Steven C.H. Hoi

Keywords Paper

video-grounded dialogues, high-resolution queries, video setting, bi-directional learning

0

0

0

0

11:05

06/12/2020

Learning Multi-Agent Communication through Structured Attentive Reasoning

Murtaza Rangwala, Ryan K Williams

Keywords Paper

0

0

0

1

3:21

04/07/2020

Multi-Agent Task-Oriented Dialog Policy Learning with Role-Aware Reward Decomposition

Ryuichi Takanobu, Runze Liang, Minlie Huang

Keywords Paper

pretraining, Multi-Agent Learning, Role-Aware Decomposition, reinforcement learning

0

0

0

0

13:00

19/04/2021

Dialogue act-based breakdown detection in negotiation dialogues

Atsuki Yamaguchi, Kosui Iwasa, Katsuhide Fujita

Keywords Paper

0

0

0

0

11:19

04/07/2020

Learning to Customize Model Structures for Few-shot Dialogue Generation Tasks

Yiping Song, Zequn Liu, Wei Bi and
Rui Yan, Ming Zhang

Keywords Paper

Few-shot Tasks, open-domain systems, generative models, meta-learning framework

0

0

0

0

11:43