Distilling knowledge for fast retrieval-based chat-bots

25/07/2020

Distilling knowledge for fast retrieval-based chat-bots

Amir Vakili Tahami, Kamyar Ghajar, Azadeh Shakery

Keywords: retrieval-based chat-bot, response ranking, neural information retrieval

Abstract Paper Similar Papers

Abstract: Response retrieval is a subset of neural ranking in which a model selects a suitable response from a set of candidates given a conversation history. Retrieval-based chat-bots are typically employed in information seeking conversational systems such as customer support agents. To make pairwise comparisons between a conversation history and a candidate response, two approaches are common: cross-encoders performing full self-attention over the pair and bi-encoders encoding the pair separately. The former gives better prediction quality but is too slow for practical use. In this paper, we propose a new cross-encoder architecture and transfer knowledge from this model to a bi-encoder model using distillation. This effectively boosts bi-encoder performance at no cost during inference time. We perform a detailed analysis of this approach on three response retrieval datasets.

The video of this talk cannot be embedded. You can watch it here:

https://dl.acm.org/doi/10.1145/3397271.3401296#sec-supp

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at SIGIR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search

Chuchu Han, Zhedong Zheng, Changxin Gao and
Nong Sang, Yi Yang

Keywords Paper

0

0

0

0

10:34

01/07/2020

Multi-Task Supervised Pretraining for Neural Domain Adaptation

Sara Meftah, Nasredine Semmar, Mohamed-Ayoub Tahiri and
Youssef Tamaazousti, Hassane Essafi, Fatiha Sadat

Keywords Paper

0

0

0

0

15:47

18/07/2021

Unsupervised Representation Learning via Neural Activation Coding

Yookoon Park, Sangho Lee, Gunhee Kim, David Blei

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

13:50

02/06/2020

Entity Summarization with User Feedback

Qingxia Liu, Yue Chen, Gong Cheng and
Evgeny Kharlamov, Junyou Li, Yuzhong Qu

Keywords Paper

0

0

0

0

21:30

04/07/2020

Estimating the influence of auxiliary tasks for multi-task learning of sequence tagging tasks

Fynn Schröder, Chris Biemann

Keywords Paper

multi-task tasks, MTL, TL, MTL setups

0

0

0

0

12:02

01/07/2020

Contextualized Emotion Recognition in Conversation as Sequence Tagging

Yan Wang, Jiayu Zhang, Jun Ma and
Shaojun Wang, Jing Xiao

Keywords Paper

0

0

0

1

11:06

14/06/2020

Non-Local Neural Networks With Grouped Bilinear Attentional Transforms

Lu Chi, Zehuan Yuan, Yadong Mu, Changhu Wang

Keywords Paper

attention, non-local, bilinear, image classification, video classification, grouped, data-adaptive

0

0

0

0

1:01

04/07/2020

Effective Estimation of Deep Generative Language Models

Tom Pelsmaeker, Wilker Aziz

Keywords Paper

Estimation Models, parameterisation models, posterior collapse, language modelling

0

0

0

0

12:19

02/02/2021

Harmonized Dense Knowledge Distillation Training for Multi-Exit Architectures

Xinglu Wang, Yingming Li

Keywords Paper

0

0

0

0

15:12

16/11/2020

Neural Conversational QA: Learning to Reason vs Exploiting Patterns

Nikhil Verma, Abhishek Sharma, Dhiraj Madan and
Danish Contractor, Harshit Kumar, Sachindra Joshi

Keywords Paper

neural tasks, sharc task, sharc, heuristic-based program

0

0

0

0

7:04

13/04/2021

Feedback coding for active learning

Gregory Canal, Matthieu Bloch, Christopher Rozell

Keywords Paper

0

0

0

0

2:55

18/07/2021

Clustered Sampling: Low-Variance and Improved Representativity for Clients Selection in Federated Learning

Yann Fraboni, Richard Vidal, Laetitia Kameni, Marco Lorenzi

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:01

06/12/2020

Network Diffusions via Neural Mean-Field Dynamics

shushan He, Hongyuan Zha, Xiaojing Ye

Keywords Paper

0

0

0

0

3:21

18/07/2021

Towards Open-World Recommendation: An Inductive Model-based Collaborative Filtering Approach

Qitian Wu, Hengrui Zhang, Xiaofeng Gao and
Junchi Yan, Hongyuan Zha

Keywords Paper

Applications, Recommender Systems

0

0

0

0

5:08

04/07/2020

Multi-Domain Dialogue Acts and Response Co-Generation

Kai Wang, Junfeng Tian, Rui Wang and
Xiaojun Quan, Jianxing Yu

Keywords Paper

Generating responses, task-oriented systems, response generation, automatic evaluations

0

0

0

1

10:01

26/04/2020

Sharing Knowledge in Multi-Task Deep Reinforcement Learning

Carlo D'Eramo, Davide Tateo, Andrea Bonarini and
Marcello Restelli, Jan Peters

Keywords Paper

Deep Reinforcement Learning, Multi-Task

0

0

0

0

4:27

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

04/07/2020

Guiding Variational Response Generator to Exploit Persona

Bowen Wu, Mengyuan Li, Zongsheng Wang and
Yifu Chen, Derek F. Wong, Qihang Feng, Junhong Huang, Baoxun Wang

Keywords Paper

conversational agents, optimization, persona-aware generation, Variational Generator

0

0

0

0

8:42

26/04/2020

Bayesian Meta Sampling for Fast Uncertainty Adaptation

Zhenyi Wang, Yang Zhao, Ping Yu and
Ruiyi Zhang, Changyou Chen

Keywords Paper

Bayesian Sampling, Uncertainty Adaptation, Meta Learning, Variational Inference

0

0

0

0

4:44

12/07/2020

Convolutional dictionary learning based auto-encoders for natural exponential-family distributions

Bahareh Tolooshams, Andrew Song, Simona Temereanca, Demba Ba

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

14:49

26/04/2020

Continual Learning with Bayesian Neural Networks for Non-Stationary Data

Richard Kurle, Botond Cseke, Alexej Klushyn and
Patrick van der Smagt, Stephan Günnemann

Keywords Paper

Continual Learning, Online Variational Bayes, Non-Stationary Data, Bayesian Neural Networks, Variational Inference, Lifelong Learning, Concept Drift, Episodic Memory

0

0

0

0

5:26

18/07/2021

Group Fisher Pruning for Practical Network Compression

Liyang Liu, Shilong Zhang, Zhanghui Kuang and
Aojun Zhou, Jing-Hao Xue, Xinjiang Wang, Yimin Chen, Wenming Yang, Qingmin Liao, Wayne Zhang

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:05

23/08/2020

Context-to-session matching: Utilizing whole session for response selection in information-seeking dialogue systems

Zhenxin Fu, Shaobo Cui, Mingyue Shang and
Feng Ji, Dongyan Zhao, Haiqing Chen, Rui Yan

Keywords Paper

text matching, graph attention network, response selection

0

0

0

0

13:33

06/12/2021

Estimating the Unique Information of Continuous Variables

Ari Pakman, Amin Nejatbakhsh, Dar Gilboa and
Abdullah Makkeh, Luca Mazzucato, Michael Wibral, Elad Schneidman

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

12:39

22/09/2020

Exploring clustering of bandits for online recommendation system

Liu Yang, Bo Liu, Leyu Lin and
Feng Xia, Kai Chen, Qiang Yang

Keywords Paper

online learning, cluster-of-bandit, recommendation system

0

0

0

0

2:57

02/02/2021

Dynamic Automaton-Guided Reward Shaping for Monte Carlo Tree Search

Alvaro Velasquez, Brett Bissey, Lior Barak and
Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Paper

0

0

0

0

18:52

19/10/2020

Learning to generate reformulation actions for scalable conversational query understanding

Zihan Xu, Jiangang Zhu, Ling Geng and
Yang Yang, Bojia Lin, Daxin Jiang

Keywords Paper

contextual query reformulation, question answering, conversational query understanding

0

0

0

0

6:58

19/10/2020

Deep multi-interest network for click-through rate prediction

Zhibo Xiao, Luwei Yang, Wen Jiang and
Yi Wei, Yi Hu, Hao Wang

Keywords Paper

click-through rate prediction, multi-interest, recommender system

0

0

0

0

6:09

12/07/2020

Operation-Aware Soft Channel Pruning using Differentiable Masks

Minsoo Kang, Bohyung Han

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:56

14/09/2020

Mend The Learning Approach, Not the Data: Insights for Ranking E-Commerce Products

Muhammad Umer Anwaar, Dmytro Rybalko, Martin Kleinsteuber

Keywords Paper

information retrieval, ranking and preference learning, learning to rank, e-commerce search, implicit feedback, counterfactual risk minimization, dataset, mining data logs

0

0

0

0

11:21

02/02/2021

Self-Supervised Hypergraph Convolutional Networks for Session-based Recommendation

Xin Xia, Hongzhi Yin, Junliang Yu and
Qinyong Wang, Lizhen Cui, Xiangliang Zhang

Keywords Paper

0

0

0

0

21:04

06/12/2020

A Simple Language Model for Task-Oriented Dialogue

Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu and
Semih Yavuz, Richard Socher

Keywords Paper

0

0

0

0

3:21

06/12/2020

Can Temporal-Diﬀerence and Q-Learning Learn Representation? A Mean-Field Theory

Yufeng Zhang, Qi Cai, Zhuoran Yang and
Yongxin Chen, Zhaoran Wang

Keywords Paper

0

0

0

0

3:02

02/02/2021

Bridging Towers of Multi-task Learning with a Gating Mechanism for Aspect-based Sentiment Analysis and Sequential Metaphor Identification

Rui Mao, Xiao Li

Keywords Paper

0

0

0

0

19:27

02/02/2021

Exploring Auxiliary Reasoning Tasks for Task-oriented Dialog Systems with Meta Cooperative Learning

Bowen Qin, Min Yang, Lidong Bing and
Qingshan Jiang, Chengming Li, Ruifeng Xu

Keywords Paper

0

0

0

0

15:41

16/11/2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu and
Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher, Caiming Xiong

Keywords Paper

intent detection, detecting intents, oos detection, large-scale task

0

0

0

0

11:43

25/07/2020

Knowledge enhanced personalized search

Shuqi Lu, Zhicheng Dou, Chenyan Xiong and
Xiaojie Wang, Ji-Rong Wen

Keywords Paper

entity-oriented search, knowledge base, personalized search

0

0

0

0

15:12

06/12/2021

Learning from Inside: Self-driven Siamese Sampling and Reasoning for Video Question Answering

Weijiang Yu, Haoteng Zheng, Mengfei Li and
Lei Ji, Lijun Wu, Nong Xiao, Nan Duan

Keywords Paper

transformers

0

0

0

0

13:47

19/08/2021

Progressive Open-Domain Response Generation with Multiple Controllable Attributes

Haiqin Yang, Xiaoyuan Yao, Yiqun Duan and
Jianping Shen, Jie Zhong, Kun Zhang

Keywords Paper

Machine Learning, Learning Generative Models, Dialogue

0

0

0

0

14:43

04/07/2020

Learning Source Phrase Representations for Neural Machine Translation

Hongfei Xu, Josef van Genabith, Deyi Xiong and
Qiuhui Liu, Jingyi Zhang

Keywords Paper

Neural Translation, WMT tasks, Learning Representations, Transformer model

0

0

0

0

7:18