Probing for Referential Information in Language Models

Abstract: Language models keep track of complex information about the preceding context -- including, e.g., syntactic relations in a sentence. We investigate whether they also capture information beneficial for resolving pronominal anaphora in English. We analyze two state of the art models with LSTM and Transformer architectures, via probe tasks and analysis on a coreference annotated corpus. The Transformer outperforms the LSTM in all analyses. Our results suggest that language models are more successful at learning grammatical constraints than they are at learning truly referential information, in the sense of capturing the fact that we use language to refer to entities in the world. However, we find traces of the latter aspect, too.

04/07/2020

Probing for Referential Information in Language Models

Ionut-Teodor Sorodoc, Kristina Gulordava, Gemma Boleda

Comments

Similar Papers

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and Natalia Talmina, Tal Linzen

Keywords Abstract Paper

Cross-Linguistic Syntax, Syntax, Cross-Linguistic Models, neural models

Aspectuality Across Genre: A Distributional Semantics Approach

Thomas Kober, Malihe Alikhani, Matthew Stone, Mark Steedman

Keywords Abstract Paper

Zero-Shot Crosslingual Sentence Simplification

Jonathan Mallinson, Rico Sennrich, Mirella Lapata

Keywords Abstract Paper

sentence simplification, translation, simplification, encoder-decoder models

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Abstract Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

Eleftheria Briakou, Marine Carpuat

Keywords Abstract Paper

detecting content, cross-lingual nlp, machine problem, annotation

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models

Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki and Haibo Ding, Graham Neubig

Keywords Abstract Paper

factual retrieval, language models, lms, probing methods

On the Linguistic Representational Power of Neural Machine Translation Models

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and Hassan Sajjad, James Glass

Keywords Abstract Paper

Linguistic Models, natural processing, artificial intelligence, translating languages

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and Luxi Xing, Heng Yu, Weihua Luo

Keywords Abstract Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Abstract Paper

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and Xisen Jin, Xiang Ren

Keywords Abstract Paper

machine learning, fairness, language

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Abstract Paper

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Mostafa Abdou, Vinit Ravishankar, Maria Barrett and Yonatan Belinkov, Desmond Elliott, Anders Søgaard

Keywords Abstract Paper

human understanding, Language Models, Winograd Perturbations, Large-scale models

Attending to inter-sentential features in neural text classification

Billy Chiu, Sunil Kumar Sahu, Neha Sengupta and Derek Thomas, Mohammady Mahdy

Keywords Abstract Paper

graph network, hybrid neural network, attention mechanism

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?

Xuancheng Ren, Xu Sun, Houfeng Wang, Qun Liu

Keywords Abstract Paper

ASSET: A Dataset for Tuning and Evaluation of Sentence Simplification Models with Multiple Rewriting Transformations

Fernando Alva-Manchego, Louis Martin, Antoine Bordes and Carolina Scarton, Benoît Sagot, Lucia Specia

Keywords Abstract Paper

Tuning Models, rewriting transformations, automatic simplification, splitting

Does she wink or does she nod? A challenging benchmark for evaluating word understanding of language models

Lutfi Kerem Senel, Hinrich Schütze

Keywords Abstract Paper

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension

Fei Yuan, Linjun Shou, Xuanyu Bai and Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Keywords Abstract Paper

Multilingual Comprehension, multilingual MRC, MRC, sentence tasks

A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation

Raúl Vázquez, Alessandro Raganato, Mathias Creutz, Jörg Tiedemann

Keywords Abstract Paper

Multilingual Translation, Neural translation, transfer learning, translation

GLUECoS: An Evaluation Benchmark for Code-Switched NLP

Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan and Sunayana Sitaram, Monojit Choudhury

Keywords Abstract Paper

Code-Switched NLP, cross-lingual tasks, NLP tasks, Language Identification

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Abstract Paper

Syntactic Structure Distillation Pretraining for Bidirectional Encoders

Adhiguna Kuncoro, Lingpeng Kong, Daniel Fried and Dani Yogatama, Laura Rimell, Chris Dyer, Phil Blunsom

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and
Natalia Talmina, Tal Linzen

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki and
Haibo Ding, Graham Neubig

Keywords Paper

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and
Hassan Sajjad, James Glass

Keywords Paper

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Keywords Paper

Mostafa Abdou, Vinit Ravishankar, Maria Barrett and
Yonatan Belinkov, Desmond Elliott, Anders Søgaard

Keywords Paper

Billy Chiu, Sunil Kumar Sahu, Neha Sengupta and
Derek Thomas, Mohammady Mahdy

Keywords Paper

Keywords Paper

Fernando Alva-Manchego, Louis Martin, Antoine Bordes and
Carolina Scarton, Benoît Sagot, Lucia Specia

Keywords Paper

Keywords Paper

Fei Yuan, Linjun Shou, Xuanyu Bai and
Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Keywords Paper

Keywords Paper

Simran Khanuja, Sandipan Dandapat, Anirudh Srinivasan and
Sunayana Sitaram, Monojit Choudhury

Keywords Paper

Keywords Paper

Adhiguna Kuncoro, Lingpeng Kong, Daniel Fried and
Dani Yogatama, Laura Rimell, Chris Dyer, Phil Blunsom

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yang Hong, Xinhuai Tang, Tiancheng Tang and
Yunlong Hu, Jintai Tian

Keywords Paper

Keywords Paper

Keywords Paper

Marius Mosbach, Stefania Degaetano-Ortlieb, Marie-Pauline Krielke and
Badr M. Abdullah, Dietrich Klakow

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper