Multilingual Universal Sentence Encoder for Semantic Retrieval

04/07/2020

Multilingual Universal Sentence Encoder for Semantic Retrieval

Yinfei Yang, Daniel Cer, Amin Ahmad, Mandy Guo, Jax Law, Noah Constant, Gustavo Hernandez Abrego, Steve Yuan, Chris Tar, Yun-hsuan Sung, Brian Strope, Ray Kurzweil

Keywords: Semantic Retrieval, translation tasks, monolingual retrieval, translation retrieval

Abstract Paper Similar Papers

Abstract: We present easy-to-use retrieval focused multilingual sentence embedding models, made available on TensorFlow Hub. The models embed text from 16 languages into a shared semantic space using a multi-task trained dual-encoder that learns tied cross-lingual representations via translation bridge tasks (Chidambaram et al., 2018). The models achieve a new state-of-the-art in performance on monolingual and cross-lingual semantic retrieval (SR). Competitive performance is obtained on the related tasks of translation pair bitext retrieval (BR) and retrieval question answering (ReQA). On transfer learning tasks, our multilingual embeddings approach, and in some cases exceed, the performance of English only sentence embeddings.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation

Raúl Vázquez, Alessandro Raganato, Mathias Creutz, Jörg Tiedemann

Keywords Paper

Multilingual Translation, Neural translation, transfer learning, translation

0

0

0

0

14:05

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

26/04/2020

Neural Machine Translation with Universal Visual Representation

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Neural Machine Translation, Visual Representation, Multimodal Machine Translation, Language Representation

0

0

0

0

4:50

16/11/2020

XGLUE: A New Benchmark Datasetfor Cross-lingual Pre-training, Understanding and Generation

Yaobo Liang, Nan Duan, Yeyun Gong and
Ning Wu, Fenfei Guo, Weizhen Qi, Ming Gong, Linjun Shou, Daxin Jiang, Guihong Cao, Xiaodong Fan, Ruofei Zhang, Rahul Agrawal, Edward Cui, Sining Wei, Taroon Bharti, Ying Qiao, Jiun-Hung Chen, Winnie Wu, Shuguang Liu, Fan Yang, Daniel Campos, Rangan Majumder, Ming Zhou

Keywords Paper

large-scale models, cross-lingual tasks, natural tasks, cross-lingual pre-training

0

0

0

0

10:06

03/05/2021

Structured Prediction as Translation between Augmented Natural Languages

Giovanni Paolini, Ben Athiwaratkun, Jason Krone and
Jie Ma, Alessandro Achille, RISHITA ANUBHAI, Cicero Nogueira dos Santos, Bing Xiang, Stefano Soatto

Keywords Paper

sequence to sequence, structured prediction, language models, transfer learning, few-shot learning, multi-task learning, generative modeling

0

0

0

0

12:16

02/02/2021

Joint Semantic Analysis with Document-Level Cross-Task Coherence Rewards

Rahul Aralikatte, Mostafa Abdou, Heather C Lent and
Daniel Hershcovich, Anders Søgaard

Keywords Paper

0

0

0

0

14:41

03/05/2021

Filtered Inner Product Projection for Crosslingual Embedding Alignment

Vin Sachidananda, Ziyi Yang, Chenguang Zhu

Keywords Paper

multilingual representations, natural language processing, word embeddings

0

0

0

0

5:22

06/12/2021

Pay Better Attention to Attention: Head Selection in Multilingual and Multi-Domain Sequence Modeling

Hongyu Gong, Yun Tang, Juan Pino, Xian Li

Keywords Paper

0

0

0

0

10:04

04/07/2020

Unsupervised Multilingual Sentence Embeddings for Parallel Corpus Mining

Ivana Kvapilíková, Mikel Artetxe, Gorka Labaka and
Eneko Agirre, Ondřej Bojar

Keywords Paper

Unsupervised Embeddings, Parallel Mining, multilingual embeddings, parallel tasks

0

0

0

0

11:30

26/04/2020

Cross-lingual Alignment vs Joint Training: A Comparative Study and A Simple Unified Framework

Zirui Wang, Jiateng Xie, Ruochen Xu and
Yiming Yang, Graham Neubig, Jaime G. Carbonell

Keywords Paper

Cross-lingual Representation

0

0

0

0

4:53

08/12/2020

Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

Adam Dahlgren Lindström, Johanna Björklund, Suna Bensch, Frank Drewes

Keywords Paper

0

0

0

0

14:20

19/04/2021

Coordinate constructions in English enhanced Universal Dependencies: Analysis and computational modeling

Stefan Grünewald, Prisca Piccirilli, Annemarie Friedrich

Keywords Paper

0

0

0

0

12:44

16/11/2020

Exploring Semantic Capacity of Terms

Jie Huang, Zilong Wang, Kevin Chang and
Wen-mei Hwu, JinJun Xiong

Keywords Paper

natural processing, artificial intelligence, linear regression, semantic capacity

0

0

0

0

9:49

04/07/2020

Enhancing Word Embeddings with Knowledge Extracted from Lexical Resources

Magdalena Biesialska, Bardia Rafieian, Marta R. Costa-jussà

Keywords Paper

semantic representations, dialog tracking, word embeddings, specialization methods

0

0

0

0

8:23

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

02/02/2021

Object Relation Attention for Image Paragraph Captioning

Li-Chuan Yang, Chih-Yuan Yang, Jane Yung-jen Hsu

Keywords Paper

0

0

0

0

15:03

16/11/2020

A Bilingual Generative Transformer for Semantic Sentence Embedding

John Wieting, Graham Neubig, Taylor Berg-Kirkpatrick

Keywords Paper

source separation, semantic encoding, data distributions, unsupervised evaluations

0

0

0

0

14:32

08/12/2020

BME-TUW at SR’20: Lexical grammar induction for surface realization

Gábor Recski, Ádám Kovács, Kinga Gémes and
Judit Ács, Andras Kornai

Keywords Paper

0

0

0

0

15:32

05/12/2020

Fairseq S2T: Fast speech-to-text modeling with fairseq

Changhan Wang, Yun Tang, Xutai Ma and
Anne Wu, Dmytro Okhonko, Juan Pino

Keywords Paper

0

0

0

0

8:51

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

04/07/2020

CLIReval: Evaluating Machine Translation as a Cross-Lingual Information Retrieval Task

Shuo Sun, Suzanna Sia, Kevin Duh

Keywords Paper

Machine Translation, Cross-Lingual Task, machine MT, MT

0

0

0

0

9:37

16/11/2020

Character-level Representations Improve DRS-based Semantic Parsing Even in the Age of BERT

Rik van Noord, Antonio Toral, Johan Bos

Keywords Paper

discourse parsing, analysis, character-level representations, character representations

0

0

0

0

11:26

04/07/2020

Multi-Granularity Interaction Network for Extractive and Abstractive Multi-Document Summarization

Hanqi Jin, Tianming Wang, Xiaojun Wan

Keywords Paper

Extractive Summarization, Extractive , abstractive summarization, Multi-Granularity Network

0

0

0

0

10:38

16/11/2020

Zero-Shot Crosslingual Sentence Simplification

Jonathan Mallinson, Rico Sennrich, Mirella Lapata

Keywords Paper

sentence simplification, translation, simplification, encoder-decoder models

0

0

0

0

10:34

04/07/2020

Parallel Sentence Mining by Constrained Decoding

Pinzhen Chen, Nikolay Bogoychev, Kenneth Heafield, Faheem Kirefu

Keywords Paper

Parallel Mining, decoding, Constrained Decoding, neural translation

0

0

0

0

6:22

02/02/2021

Consecutive Decoding for Speech-to-text Translation

Qianqian Dong, Mingxuan Wang, Hao Zhou and
Shuang Xu, Bo Xu, Lei Li

Keywords Paper

0

0

0

0

14:20

16/11/2020

Multi-Stage Pre-training for Low-Resource Domain Adaptation

Rong Zhang, Revanth Gangi Reddy, Md Arafat Sultan and
Vittorio Castelli, Anthony Ferritto, Radu Florian, Efsun Sarioglu Kayi, Salim Roukos, Avi Sil, Todd Ward

Keywords Paper

nlp tasks, fine-tuning, auxiliary tasks, lm transfer

0

0

0

0

6:56

04/07/2020

Should All Cross-Lingual Embeddings Speak English?

Antonios Anastasopoulos, Graham Neubig

Keywords Paper

cross-lingual embeddings, lexicon tagging, lexicon dictionaries, cross-lingual baselines

0

0

0

0

9:25

12/07/2020

Recurrent Hierarchical Topic-Guided RNN for Language Generation

Dandan Guo, Bo Chen, Ruiying Lu, Mingyuan Zhou

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

16:05

16/11/2020

CCAligned: A Massive Collection of Cross-Lingual Web-Document Pairs

Ahmed El-Kishky, Vishrav Chaudhary, Francisco Guzmán, Philipp Koehn

Keywords Paper

cross-lingual alignment, mining sentences, cross-lingual nlp, cross-lingual representations

0

0

0

0

11:47

16/11/2020

With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All-Round Word Sense Disambiguation

Bianca Scarlini, Tommaso Pasini, Roberto Navigli

Keywords Paper

natural processing, english task, word-in-context task, contextualized embeddings

0

0

0

0

12:11

08/12/2020

Span-based Joint Entity and Relation Extraction with Attention-based Span-specific and Contextual Semantic Representations

Bin Ji, Jie Yu, Shasha Li and
Jun Ma, Qingbo Wu, Yusong Tan, Huijun Liu

Keywords Paper

0

0

0

0

10:13

02/02/2021

Multi-modal Graph Fusion for Named Entity Recognition with Targeted Visual Guidance

Dong Zhang, Suzhong Wei, Shoushan Li and
Hanqian Wu, Qiaoming Zhu, Guodong Zhou

Keywords Paper

0

0

0

0

16:28

04/07/2020

Emerging Cross-lingual Structure in Pretrained Language Models

Alexis Conneau, Shijie Wu, Haoran Li and
Luke Zettlemoyer, Veselin Stoyanov

Keywords Paper

multilingual modeling, cross-lingual transfer, transfer, Cross-lingual Models

0

0

0

0

11:49

19/08/2021

Keep the Structure: A Latent Shift-Reduce Parser for Semantic Parsing

Yuntao Li, Bei Chen, Qian Liu and
Yan Gao, Jian-Guang Lou, Yan Zhang, Dongmei Zhang

Keywords Paper

Natural Language Processing, Natural Language Semantics

0

0

0

0

12:37

19/08/2021

Step-Wise Hierarchical Alignment Network for Image-Text Matching

Zhong Ji, Kexin Chen, Haoran Wang

Keywords Paper

Computer Vision, Language and Vision

0

0

0

0

6:07

02/02/2021

Bridging Towers of Multi-task Learning with a Gating Mechanism for Aspect-based Sentiment Analysis and Sequential Metaphor Identification

Rui Mao, Xiao Li

Keywords Paper

0

0

0

0

19:27

04/07/2020

Self-Attention with Cross-Lingual Position Representation

Liang Ding, Longyue Wang, Dacheng Tao

Keywords Paper

natural tasks, WMT'17 tasks, Cross-Lingual Representation, Position encoding

0

0

0

0

7:46

06/12/2020

Unsupervised Text Generation by Learning from Search

Jingjing Li, Zichao Li, Lili Mou and
Xin Jiang, Michael Lyu, Irwin King

Keywords Paper

0

0

0

0

3:24

08/12/2020

SpanAlign: Sentence Alignment Method based on Cross-Language Span Prediction and ILP

Katsuki Chousa, Masaaki Nagata, Masaaki Nishino

Keywords Paper

0

0

0

0

14:39