LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

16/11/2020

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Ikuya Yamada, Akari Asai, Hiroyuki Shindo, Hideaki Takeda, Yuji Matsumoto

Keywords: natural tasks, pretraining task, transformer, entity-related tasks

Abstract Paper Similar Papers

Abstract: Entity representations are useful in natural language tasks involving entities. In this paper, we propose new pretrained contextualized representations of words and entities based on the bidirectional transformer. The proposed model treats words and entities in a given text as independent tokens, and outputs contextualized representations of them. Our model is trained using a new pretraining task based on the masked language model of BERT. The task involves predicting randomly masked words and entities in a large entity-annotated corpus retrieved from Wikipedia. We also propose an entity-aware self-attention mechanism that is an extension of the self-attention mechanism of the transformer, and considers the types of tokens (words or entities) when computing attention scores. The proposed model achieves impressive empirical performance on a wide range of entity-related tasks. In particular, it obtains state-of-the-art results on five well-known datasets: Open Entity (entity typing), TACRED (relation classification), CoNLL-2003 (named entity recognition), ReCoRD (cloze-style question answering), and SQuAD 1.1 (extractive question answering). Our source code and pretrained representations are available at https://github.com/studio-ousia/luke.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at EMNLP 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Pseudo-Masked Language Models for Unified Language Model Pre-Training

Hangbo Bao, Li Dong, Furu Wei and
Wenhui Wang, Nan Yang, Xiaodong Liu, Yu Wang, Jianfeng Gao, Songhao Piao, Ming Zhou, Hsiao-Wuen Hon

Keywords Paper

Applications - Language, Speech and Dialog

0

0

0

0

13:55

19/04/2021

On robustness of neural semantic parsers

Shuo Huang, Zhuang Li, Lizhen Qu, Lei Pan

Keywords Paper

0

0

0

0

11:11

03/05/2021

Autoregressive Entity Retrieval

Nicola De Cao, Gautier Izacard, Sebastian Riedel, Fabio Petroni

Keywords Paper

constrained beam search, entity disambiguation, end-to-end entity linking, entity linking, autoregressive language model, document retrieval, entity retrieval

0

0

0

0

10:14

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

16/11/2020

Affective Event Classification with Discourse-enhanced Self-training

Yuan Zhuang, Tianyu Jiang, Ellen Riloff

Keywords Paper

affective classification, classification models, bert-based model, classifier

0

0

0

0

11:41

08/12/2020

A Mixture-of-Experts Model for Learning Multi-Facet Entity Embeddings

Rana Alshaikh, Zied Bouraoui, Shelan Jeawak, Steven Schockaert

Keywords Paper

0

0

0

0

14:13

16/11/2020

Local Additivity Based Data Augmentation for Semi-supervised NER

Jiaao Chen, Zhenghui Wang, Ran Tian and
Zichao Yang, Diyi Yang

Keywords Paper

named recognition, deep understanding, semi-supervised ner, entity learning

0

0

0

0

11:18

16/11/2020

Asking without Telling: Exploring Latent Ontologies in Contextual Representations

Julian Michael, Jan A. Botha, Ian Tenney

Keywords Paper

pretrained encoders, elmo, bert, latent learning

0

0

0

0

12:45

08/12/2020

MZET: Memory Augmented Zero-Shot Fine-grained Named Entity Typing

Tao Zhang, Congying Xia, Chun-Ta Lu, Philip Yu

Keywords Paper

0

0

0

0

20:19

19/10/2020

Evaluating the impact of knowledge graph context on entity disambiguation models

Isaiah Onando Mulang’, Kuldeep Singh, Chaitali Prabhu and
Abhishek Nadgeri, Johannes Hoffart, Jens Lehmann

Keywords Paper

roberta, knowledge graph, context, pretrained transformers, named entity disambiguation, xlnet, wikidata

0

0

0

0

7:03

04/07/2020

Empower Entity Set Expansion via Language Model Probing

Yunyi Zhang, Jiaming Shen, Jingbo Shang, Jiawei Han

Keywords Paper

Empower Expansion, Entity expansion, NLP applications, question answering

0

0

0

0

11:16

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

05/12/2020

Exploiting WordNet synset and hypernym representations for answer selection

Weikang Li, Yunfang Wu

Keywords Paper

0

0

0

0

7:04

04/07/2020

A Methodology for Creating Question Answering Corpora Using Inverse Data Annotation

Jan Deriu, Katsiaryna Mlynchyk, Philippe Schläpfer and
Alvaro Rodrigo, Dirk von Grünigen, Nicolas Kaiser, Kurt Stockinger, Eneko Agirre, Mark Cieliebak

Keywords Paper

question answering, annotation, Inverse Annotation, intermediate representation

0

0

0

0

12:51

02/02/2021

KEML: A Knowledge-Enriched Meta-Learning Framework for Lexical Relation Classification

Chengyu Wang, Minghui Qiu, Jun Huang, Xiaofeng He

Keywords Paper

0

0

0

0

15:47

04/07/2020

Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus

Hao Fei, Meishan Zhang, Donghong Ji

Keywords Paper

Cross-Lingual Labeling, semantic labeling, natural understanding, model transferring

0

0

0

0

10:32

16/11/2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models

Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki and
Haibo Ding, Graham Neubig

Keywords Paper

factual retrieval, language models, lms, probing methods

0

0

0

0

9:45

19/08/2021

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:47

22/06/2020

Exploiting Semantic Relations for Fine-grained Entity Typing

Hongliang Dai, Yangqiu Song, Xin Li

Keywords Paper

Fine-grained Entity Typing, Hypernym Extraction, Semantic Role Labeling

0

0

0

0

4:45

03/05/2021

Distilling Knowledge from Reader to Retriever for Question Answering

Gautier Izacard, Edouard Grave

Keywords Paper

question answering, information retrieval

0

0

0

0

5:14

19/04/2021

BERTese: Learning to speak to BERT

Adi Haviv, Jonathan Berant, Amir Globerson

Keywords Paper

0

0

0

0

6:54

05/12/2020

Named entity recognition in multi-level contexts

Yubo Chen, Chuhan Wu, Tao Qi and
Zhigang Yuan, Yongfeng Huang

Keywords Paper

0

0

0

0

14:10

19/10/2020

Schema-agnostic entity matching using pre-trained language models

Kai-Sheng Teong, Lay-Ki Soon, Tin Tin Su

Keywords Paper

language models, schema agnostic, entity matching

0

0

0

0

6:33

16/11/2020

ToTTo: A Controlled Table-To-Text Generation Dataset

Ankur Parikh, Xuezhi Wang, Sebastian Gehrmann and
Manaal Faruqui, Bhuwan Dhingra, Diyi Yang, Dipanjan Das

Keywords Paper

controlled task, high-precision generation, totto, dataset process

0

0

0

0

11:53

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

16/11/2020

SelfORE: Self-supervised Relational Feature Learning for Open Relation Extraction

Xuming Hu, Lijie Wen, Yusong Xu and
Chenwei Zhang, Philip Yu

Keywords Paper

open extraction, extracting facts, adaptive clustering, relation classification

0

0

0

0

11:25

06/12/2021

Grammar-Based Grounded Lexicon Learning

Jiayuan Mao, Freda Shi, Jiajun Wu and
Roger Levy, Josh Tenenbaum

Keywords Paper

deep learning

0

0

0

0

13:41

03/05/2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng and
Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Keywords Paper

adversarial training, QA, NLI, BERT, information theory, adversarial robustness

0

0

0

0

5:21

08/12/2020

Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

Adam Dahlgren Lindström, Johanna Björklund, Suna Bensch, Frank Drewes

Keywords Paper

0

0

0

0

14:20

02/02/2021

Exploring Explainable Selection to Control Abstractive Summarization

Haonan Wang, Yang Gao, Yu Bai and
Mirella Lapata, Heyan Huang

Keywords Paper

0

0

0

0

18:50

16/11/2020

Cross-Thought for Sentence Encoder Pre-training

Shuohang Wang, Yuwei Fang, Siqi Sun and
Zhe Gan, Yu Cheng, Jingjing Liu, Jing Jiang

Keywords Paper

pre-training encoder, large-scale tasks, question answering, predicting words

0

0

0

0

12:06

16/11/2020

PALM: Pre-training an Autoencoding&Autoregressive Language Model for Context-conditioned Generation

Bin Bi, Chenliang Li, Chen Wu and
Ming Yan, Wei Wang, Songfang Huang, Fei Huang, Luo Si

Keywords Paper

natural generation, language tasks, generative answering, conversational generation

0

0

0

0

11:02

08/12/2020

Automatic Word Association Norms (AWAN)

Jorge Reyes-Magaña, Gerardo Sierra Martínez, Gemma Bel-Enguix, Helena Gomez-Adorno

Keywords Paper

0

0

0

0

14:34

06/12/2021

Integrating Tree Path in Transformer for Code Representation

Han Peng, Ge Li, Wenhan Wang and
YunFei Zhao, Zhi Jin

Keywords Paper

machine learning, transformers

0

0

0

0

4:42

16/11/2020

Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation

Minki Kang, Moonsu Han, Sung Ju Hwang

Keywords Paper

self-supervised pre-training, question answering, task, reinforcement learning

0

0

0

0

12:00

06/12/2021

Improving Visual Quality of Image Synthesis by A Token-based Generator with Transformers

Yanhong Zeng, Huan Yang, Hongyang Chao and
Jianbo Wang, Jianlong Fu

Keywords Paper

transformers, generative model

0

0

0

0

9:28

04/07/2020

Analysing Lexical Semantic Change with Contextualised Word Representations

Mario Giulianelli, Marco Del Tredici, Raquel Fernández

Keywords Paper

Contextualised Representations, unsupervised approach, BERT model, model representations

0

0

0

0

11:56

19/04/2021

Interpretability for morphological inflection: From character-level predictions to subword-level rules

Tatyana Ruzsics, Olga Sozinova, Ximena Gutierrez-Vasques, Tanja Samardzic

Keywords Paper

0

0

0

0

10:53

04/07/2020

A Novel Cascade Binary Tagging Framework for Relational Triple Extraction

Zhepei Wei, Jianlin Su, Yue Wang and
Yuan Tian, Yi Chang

Keywords Paper

Relational Extraction, large-scale construction, overlapping problem, relational task

0

0

0

0

11:05

02/06/2020

SchemaTree: Maximum-Likelihood Property Recommendation for Wikidata

Lars C. Gleim, Rafael Schimassek, Dominik Hüser and
Maximilian Peters, Christoph Krämer, Michael Cochez et al.

Keywords Paper

0

0

0

0

29:33