The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

04/07/2020

The Sensitivity of Language Models and Humans to Winograd Schema Perturbations

Mostafa Abdou, Vinit Ravishankar, Maria Barrett, Yonatan Belinkov, Desmond Elliott, Anders Søgaard

Keywords: human understanding, Language Models, Winograd Perturbations, Large-scale models

Abstract Paper Similar Papers

Abstract: Large-scale pretrained language models are the major driving force behind recent improvements in perfromance on the Winograd Schema Challenge, a widely employed test of commonsense reasoning ability. We show, however, with a new diagnostic dataset, that these models are sensitive to linguistic perturbations of the Winograd examples that minimally affect human understanding. Our results highlight interesting differences between humans and language models: language models are more sensitive to number or gender alternations and synonym replacements than humans, and humans are more stable and consistent in their predictions, maintain a much higher absolute performance, and perform better on non-associative instances than associative ones.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Probing for Referential Information in Language Models

Ionut-Teodor Sorodoc, Kristina Gulordava, Gemma Boleda

Keywords Paper

Probing, probe tasks, Language Models, LSTM architectures

0

0

0

0

11:31

08/12/2020

An analysis of language models for metaphor recognition

Arthur Neidlein, Philip Wiesenbach, Katja Markert

Keywords Paper

0

0

0

0

13:52

19/04/2021

Disambiguatory signals are stronger in word-initial positions

Tiago Pimentel, Ryan Cotterell, Brian Roark

Keywords Paper

0

0

0

0

11:35

16/11/2020

Precise Task Formalization Matters in Winograd Schema Evaluations

Haokun Liu, William Huang, Dhara Mungra, Samuel R. Bowman

Keywords Paper

task formalization, input specification, ablation, formalization decisions

0

0

0

0

4:43

04/07/2020

How does BERT's attention change when you fine-tune? An analysis methodology and a case study in negation scope

Yiyun Zhao, Steven Bethard

Keywords Paper

downstream task, NLP problems, knowledge-related tasks, downstream tasks

0

0

0

0

11:43

04/07/2020

What BERT Is Not: Lessons from a New Suite of Psycholinguistic Diagnostics for Language Models

Allyson Ettinger

Keywords Paper

Pre-training, NLP tasks, inference, role-based prediction

0

0

0

0

12:39

04/07/2020

Are we Estimating or Guesstimating Translation Quality?

Shuo Sun, Francisco Guzmán, Lucia Specia

Keywords Paper

Estimating Quality, quality estimation, machine translation, QE task

0

0

0

0

5:56

03/05/2021

FairFil: Contrastive Neural Debiasing Method for Pretrained Text Encoders

Pengyu Cheng, Weituo Hao, Siyang Yuan and
Shijing Si, Lawrence Carin

Keywords Paper

Mutual Information, Pretrained Text Encoders, Contrastive Learning, Fairness

0

0

0

0

4:43

08/12/2020

Evaluating Cross-Lingual Transfer Learning Approaches in Multilingual Conversational Agent Models

Lizhen Tan, Olga Golovneva

Keywords Paper

0

0

0

0

9:23

19/04/2021

Continuous learning in neural machine translation using bilingual dictionaries

Jan Niehues

Keywords Paper

0

0

0

0

11:48

19/04/2021

Does she wink or does she nod? A challenging benchmark for evaluating word understanding of language models

Lutfi Kerem Senel, Hinrich Schütze

Keywords Paper

0

0

0

0

7:43

02/02/2021

Improving Commonsense Causal Reasoning by Adversarial Training and Data Augmentation

Ieva Staliūnaitė, Philip John Gorinski, Ignacio Iacobacci

Keywords Paper

0

0

0

0

16:40

16/11/2020

Improving Multilingual Models with Language-Clustered Vocabularies

Hyung Won Chung, Dan Garrette, Kiat Chuan Tan, Jason Riesa

Keywords Paper

massively applications, multilingual generation, cross-lingual sharing, multilingual models

0

0

0

0

6:59

04/07/2020

Unsupervised Cross-lingual Representation Learning at Scale

Alexis Conneau, Kartikay Khandelwal, Naman Goyal and
Vishrav Chaudhary, Guillaume Wenzek, Francisco Guzmán, Edouard Grave, Myle Ott, Luke Zettlemoyer, Veselin Stoyanov

Keywords Paper

cross-lingual tasks, XNLI, MLQA, NER

0

0

0

0

12:15

16/11/2020

X-SRL: A Parallel Cross-Lingual Semantic Role Labeling Dataset

Angel Daza, Anette Frank

Keywords Paper

generalization learning, multilingual learning, high-quality translation, srl

0

0

0

0

9:24

19/08/2021

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:47

04/07/2020

Automatic Detection of Generated Text is Easiest when Humans are Fooled

Daphne Ippolito, Daniel Duckworth, Chris Callison-Burch, Douglas Eck

Keywords Paper

Automatic Text, detection, humanness systems, neural modelling

0

0

0

0

11:01

05/12/2020

An exploratory study on multilingual quality estimation

Shuo Sun, Marina Fomicheva, Frédéric Blain and
Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia

Keywords Paper

0

0

0

0

14:31

26/04/2020

Pretrained Encyclopedia: Weakly Supervised Knowledge-Pretrained Language Model

Wenhan Xiong, Jingfei Du, William Yang Wang, Veselin Stoyanov

Keywords Paper

0

0

0

0

5:00

03/05/2021

Variational Information Bottleneck for Effective Low-Resource Fine-Tuning

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

variational information bottleneck, biases, robust, over-fitting, large-scale pre-trained language models, NLP, Transfer learning

0

0

0

0

5:07

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

19/04/2021

Semantic parsing of disfluent speech

Priyanka Sen, Isabel Groves

Keywords Paper

0

0

0

0

7:14

16/11/2020

An Empirical Study on Robustness to Spurious Correlations using Pre-trained Language Models

Lifu Tu, Garima Lalwani, Spandana Gella, He He

Keywords Paper

generalization, natural inference, paraphrase identification, pre-trained models

0

0

0

0

11:55

04/07/2020

On Faithfulness and Factuality in Abstractive Summarization

Joshua Maynez, Shashi Narayan, Bernd Bohnet, Ryan McDonald

Keywords Paper

Abstractive Summarization, likelihood objectives, open-ended tasks, language modeling

0

0

0

1

12:41

06/12/2021

Multilingual Pre-training with Universal Dependency Learning

Kailai Sun, Zuchao Li, Hai Zhao

Keywords Paper

language, interpretability

0

0

0

0

10:21

04/07/2020

Don’t Stop Pretraining: Adapt Language Models to Domains and Tasks

Suchin Gururangan, Ana Marasović, Swabha Swayamdipta and
Kyle Lo, Iz Beltagy, Doug Downey, Noah A. Smith

Keywords Paper

NLP, classification tasks, pretraining, domain-adaptive pretraining

0

0

0

0

11:10

16/11/2020

What Have We Achieved on Text Summarization?

Dandan Huang, Leyang Cui, Sen Yang and
Guangsheng Bao, Kun Wang, Jun Xie, Yue Zhang

Keywords Paper

text summarization, deep learning, automatic summarizers, summarization systems

0

0

0

0

11:20

16/11/2020

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

0

0

0

0

11:23

04/07/2020

A Systematic Study of Inner-Attention-Based Sentence Representations in Multilingual Neural Machine Translation

Raúl Vázquez, Alessandro Raganato, Mathias Creutz, Jörg Tiedemann

Keywords Paper

Multilingual Translation, Neural translation, transfer learning, translation

0

0

0

0

14:05

06/12/2021

Mind the Gap: Assessing Temporal Generalization in Neural Language Models

Angeliki Lazaridou, Adhi Kuncoro, Elena Gribovskaya and
Devang Agrawal, Adam Liska, Tayfun Terzi, Mai Gimenez, Cyprien de Masson d'Autume, Tomas Kocisky, Sebastian Ruder, Dani Yogatama, Kris Cao, Susannah Young, Phil Blunsom

Keywords Paper

transformers

0

0

0

0

14:59

08/12/2020

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Paper

0

0

0

0

14:09

16/11/2020

DagoBERT: Generating Derivational Morphology with a Pretrained Language Model

Valentin Hofmann, Janet Pierrehumbert, Hinrich Schütze

Keywords Paper

full finetuning, derivation generation, pretrained models, plms

0

0

0

0

10:15

04/07/2020

Revisiting Higher-Order Dependency Parsers

Erick Fonseca, André F. T. Martins

Keywords Paper

Higher-Order Parsers, Neural encoders, dependency parsers, higher-order models

0

0

0

0

6:46

04/07/2020

Considering Likelihood in NLP Classiﬁcation Explanations with Occlusion and Language Modeling

David Harbecke, Christoph Alt

Keywords Paper

NLP, NLP Explanations, Language Modeling, NLP models

0

0

0

0

12:01

16/11/2020

Towards Better Context-aware Lexical Semantics:Adjusting Contextualized Representations through Static Anchors

Qianchu Liu, Diana McCarthy, Anna Korhonen

Keywords Paper

transformation, contextualized models, dynamic embeddings, post-processing technique

0

0

0

0

6:53

01/07/2020

How Self-Attention Improves Rare Class Performance in a Question-Answering Dialogue Agent

Adam Stiff, Qi Song, Eric Fosler-Lussier

Keywords Paper

0

0

0

0

7:55

16/11/2020

Small but Mighty: New Benchmarks for Split and Rephrase

Li Zhang, Huaiyu Zhu, Siddhartha Brahma, Yunyao Li

Keywords Paper

text task, fine-grained evaluation, automatic process, rule-based model

0

0

0

0

6:58

04/07/2020

Revisiting the Context Window for Cross-lingual Word Embeddings

Ryokan Ri, Yoshimasa Tsuruoka

Keywords Paper

Cross-lingual Embeddings, mapping-based embeddings, bilingual induction, mapping-based embeddings

0

0

0

0

9:05

16/11/2020

Syntactic Structure Distillation Pretraining for Bidirectional Encoders

Adhiguna Kuncoro, Lingpeng Kong, Daniel Fried and
Dani Yogatama, Laura Rimell, Chris Dyer, Phil Blunsom

Keywords Paper

bert pretraining, structured tasks, natural understanding, textual learners

0

0

0

0

12:23

26/04/2020

ReClor: A Reading Comprehension Dataset Requiring Logical Reasoning

Weihao Yu, Zihang Jiang, Yanfei Dong, Jiashi Feng

Keywords Paper

reading comprehension, logical reasoning, natural language processing

0

0

0

0

4:11