FASTMATCH: Accelerating the Inference of BERT-based Text Matching

08/12/2020

FASTMATCH: Accelerating the Inference of BERT-based Text Matching

Shuai Pang, Jianqiang Ma, Zeyu Yan, Yang Zhang, Jianping Shen

Keywords:

Abstract Paper Similar Papers

Abstract: Recently, pre-trained language models such as BERT have shown state-of-the-art accuracies in text matching. When being applied to IR (or QA), the BERT-based matching models need to online calculate the representations and interactions for all query-candidate pairs. The high inference cost has prohibited the deployments of BERT-based matching models in many practical applications. To address this issue, we propose a novel BERT-based text matching model, in which the representations and the interactions are decoupled. Then, the representations of the candidates can be calculated and stored offline, and directly retrieved during the online matching phase. To conduct the interactions and generate final matching scores, a lightweight attention network is designed. Experiments based on several large scale text matching datasets show that the proposed model, called FASTMATCH, can achieve up to 100X speed-up to BERT and RoBERTa at the online matching phase, while keeping more up to 98.7% of the performance.

The video of this talk cannot be embedded. You can watch it here:

https://underline.io/lecture/6178-fastmatch-accelerating-the-inference-of-bert-based-text-matching

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLING 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Automatic Mixed-Precision Quantization Search of BERT

Changsheng Zhao, Ting Hua, Yilin Shen and
Qian Lou, Hongxia Jin

Keywords Paper

Machine Learning, Deep Learning, NLP Applications and Tools, Text Classification

0

0

0

0

12:12

19/04/2021

Retrieval, re-ranking and multi-task learning for knowledge-base question answering

Zhiguo Wang, Patrick Ng, Ramesh Nallapati, Bing Xiang

Keywords Paper

0

0

0

0

11:12

16/11/2020

KGPT: Knowledge-Grounded Pre-Training for Data-to-Text Generation

Wenhu Chen, Yu Su, Xifeng Yan, William Yang Wang

Keywords Paper

data-to-text generation, data-to-text tasks, fully-supervised setting, pre-training learning

0

0

0

0

11:10

06/12/2020

Incorporating BERT into Parallel Sequence Decoding with Adapters

Junliang Guo, Zhirui Zhang, Linli Xu and
Hao-Ran Wei, Boxing Chen, Enhong Chen

Keywords Paper

0

0

0

0

3:17

06/12/2020

SMYRF - Efficient Attention using Asymmetric Clustering

Giannis Daras, Nikita Kitaev, Augustus Odena, Alex Dimakis

Keywords Paper

0

0

0

0

3:28

22/11/2021

One-Shot Deep Model for End-to-End Multi-Person Activity Recognition

Shuhei Tarashima

Keywords Paper

Group Activity Recognition, Action Recognition, Multi-Object Tracking, Multi-task Learning

0

0

0

0

2:50

25/07/2020

DC-BERT: Decoupling question and document for efficient contextual encoding

Ping Nie, Yuyu Zhang, Xiubo Geng and
Arun Ramamurthy, Le Song, Daxin Jiang

Keywords Paper

open-domain question answering, document retrieval

0

0

0

0

7:09

04/07/2020

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction

Zhepei Wei, Jianlin Su, Yue Wang and
Yuan Tian, Yi Chang

Keywords Paper

Relational Extraction, large-scale construction, overlapping problem, relational task

0

0

0

0

11:05

16/11/2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu and
Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher, Caiming Xiong

Keywords Paper

intent detection, detecting intents, oos detection, large-scale task

0

0

0

0

11:43

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

05/12/2020

Towards non-task-specific distillation of BERT via sentence representation approximation

Bowen Wu, Huan Zhang, MengYuan Li and
Zongsheng Wang, Qihang Feng, Junhong Huang, Baoxun Wang

Keywords Paper

0

0

0

0

10:51

12/09/2020

WOLED: A tool for Online Learning Weighted Answer Set Rules for Temporal Reasoning Under Uncertainty

Nikos Katzouris, Alexander Artikis

Keywords Paper

KR related tools and systems-General, Case studies for KR systems-General, Applications that combine KR with machine learning-General

0

0

0

0

15:59

19/04/2021

Non-autoregressive text generation with pre-trained language models

Yixuan Su, Deng Cai, Yan Wang and
David Vandyke, Simon Baker, Piji Li, Nigel Collier

Keywords Paper

0

0

0

0

11:05

04/11/2020

Serving DNNs like Clockwork: Performance Predictability from the Bottom Up

Arpan Gujarati, Reza Karimi, Safya Alzayat and
Wei Hao, Antoine Kaufmann, Ymir Vigfusson, Jonathan Mace

Keywords Paper

0

0

0

0

20:17

02/02/2021

Improving the Efficiency and Effectiveness for BERT-based Entity Resolution

Bing Li, Yukai Miao, Yaoshu Wang and
Yifang Sun, Wei Wang

Keywords Paper

0

1

0

0

14:53

06/12/2020

ConvBERT: Improving BERT with Span-based Dynamic Convolution

Zi-Hang Jiang, Weihao Yu, Daquan Zhou and
Yunpeng Chen, Jiashi Feng, Shuicheng Yan

Keywords Paper

0

0

0

0

3:20

01/07/2020

Zero-Resource Cross-Domain Named Entity Recognition

Zihan Liu, Genta Indra Winata, Pascale Fung

Keywords Paper

0

0

0

0

5:15

06/12/2021

Hyperparameter Tuning is All You Need for LISTA

Xiaohan Chen, Jialin Liu, Zhangyang Wang, Wotao Yin

Keywords Paper

deep learning

0

0

0

0

15:05

06/12/2020

Language Models are Few-Shot Learners

Tom B Brown, Ben Mann, Nick Ryder and
Melanie Subbiah, Jared D Kaplan, Prafulla Dhariwal, Arvind Neelakantan, Pranav Shyam, Girish Sastry, Amanda Askell, Sandhini Agarwal, Ariel Herbert-Voss, Gretchen M Krueger, Tom Henighan, Rewon Child, Aditya Ramesh, Daniel Ziegler, Jeffrey Wu, Clemens Winter, Chris Hesse, Mark Chen, Eric Sigler, Mateusz Litwin, Scott Gray, Benjamin Chess, Jack Clark, Christopher Berner, Sam McCandlish, Alec Radford, Ilya Sutskever, Dario Amodei

Keywords Paper

0

0

0

0

3:11

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

26/08/2020

Improving Maximum Likelihood Training for Text Generation with Density Ratio Estimation

Yuxuan Song, Ning Miao, Hao Zhou and
Lantao Yu, Mingxuan Wang, Lei Li

Keywords Paper

0

0

0

0

12:32

08/12/2020

Have Your Text and Use It Too! End-to-End Neural Data-to-Text Generation with Semantic Fidelity

Hamza Harkous, Isabel Groves, Amir Saffari

Keywords Paper

0

0

0

0

14:37

03/05/2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng and
Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Keywords Paper

adversarial training, QA, NLI, BERT, information theory, adversarial robustness

0

0

0

0

5:21

14/06/2020

Diverse Image Generation via Self-Conditioned GANs

Steven Liu, Tongzhou Wang, David Bau and
Jun-Yan Zhu, Antonio Torralba

Keywords Paper

generative adversarial networks, image synthesis, mode collapse, clustering, unsupervised learning

0

0

0

0

1:00

01/07/2020

Exploring the Limits of Simple Learners in Knowledge Distillation for Document Classification with DocBERT

Ashutosh Adhikari, Achyudh Ram, Raphael Tang and
William L. Hamilton, Jimmy Lin

Keywords Paper

0

0

0

0

4:55

16/11/2020

Recall and Learn: Fine-tuning Deep Pretrained Language Models with Less Forgetting

Sanyuan Chen, Yutai Hou, Yiming Cui and
Wanxiang Che, Ting Liu, Xiangzhan Yu

Keywords Paper

pretraining, pretraining tasks, learning tasks, fine-tuning bert-large

0

0

0

1

10:52

04/07/2020

GAN-BERT: Generative Adversarial Learning for Robust Text Classification with a Bunch of Labeled Examples

Danilo Croce, Giuseppe Castellucci, Roberto Basili

Keywords Paper

Robust Classification, Natural tasks, image processing, generative setting

0

0

0

0

6:48

25/07/2020

SummPip: Unsupervised multi-document summarization with sentence graph compression

Jinming Zhao, Ming Liu, Longxiang Gao and
Yuan Jin, Lan Du, He Zhao, He Zhang, Gholamreza Haffari

Keywords Paper

summarization, cluster, sentence graph, text compression

0

0

0

0

9:47

16/11/2020

Coarse-to-Fine Pre-training for Named Entity Recognition

Xue Mengge, Bowen Yu, Zhenyu Zhang and
Tingwen Liu, Yue Zhang, Bin Wang

Keywords Paper

named recognition, bert, en-tity task, pre-trainingapproaches

0

0

0

0

9:23

01/07/2020

Simple Compounded-Label Training for Fact Extraction and Verification

Yixin Nie, Lisa Bauer, Mohit Bansal

Keywords Paper

0

0

0

0

9:59

03/05/2021

Text Generation by Learning from Demonstrations

Richard Pang, He He

Keywords Paper

learning from demonstrations, nlp, text generation

0

0

0

0

5:21

16/11/2020

Scalable Zero-shot Entity Linking with Dense Entity Retrieval

Ledell Wu, Fabio Petroni, Martin Josifoski and
Sebastian Riedel, Luke Zettlemoyer

Keywords Paper

retrieval, non-zero-shot evaluations, bi-encoder linking, bert-based model

0

0

0

0

11:37

19/08/2021

Weakly Supervised Dense Video Captioning via Jointly Usage of Knowledge Distillation and Cross-modal Matching

Bofeng Wu, Guocheng Niu, Jun Yu and
Xinyan Xiao, Jian Zhang, Hua Wu

Keywords Paper

Computer Vision, Language and Vision, Multi-instance; Multi-label; Multi-view learning

0

0

0

0

12:03

04/07/2020

A Transformer-based Approach for Source Code Summarization

Wasi Ahmad, Saikat Chakraborty, Baishakhi Ray, Kai-Wei Chang

Keywords Paper

Source Summarization, summarization, ablation studies, Transformer-based Approach

0

0

0

0

6:14

12/07/2020

Variable Skipping for Autoregressive Range Density Estimation

Eric Liang, Zongheng Yang, Ion Stoica and
Pieter Abbeel, Yan Duan, Peter Chen

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:01

25/07/2020

Evolutionary product description generation: A dynamic fine-tuning approach leveraging user click behavior

Yongzhen Wang, Jian Wang, Heng Huang and
Hongsong Li, Xiaozhong Liu

Keywords Paper

product description generation, neural network, sequence-to-sequence, click-through rate, reinforcement learning

0

0

0

0

14:34

18/07/2021

Of Moments and Matching: A Game-Theoretic Framework for Closing the Imitation Gap

Gokul Swamy, Sanjiban Choudhury, J. Bagnell, Steven Wu

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

5:12

06/12/2020

DynaBERT: Dynamic BERT with Adaptive Width and Depth

Lu Hou, Zhiqi Huang, Lifeng Shang and
Xin Jiang, Xiao Chen, Qun Liu

Keywords Paper

0

0

0

0

2:59

30/11/2020

Gaussian Vector: An Efficient Solution for Facial Landmark Detection

Yilin Xiong, Zijian Zhou, Yuhao Dou, Zhizhong Su

Keywords Paper

0

0

0

0

8:53

19/08/2021

UNBERT: User-News Matching BERT for News Recommendation

Qi Zhang, Jingjie Li, Qinglin Jia and
Chuyuan Wang, Jieming Zhu, Zhaowei Wang, Xiuqiang He

Keywords Paper

Machine Learning, Recommender Systems, Recommender Systems

0

0

0

0

12:18