Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

02/02/2021

Semantics Altering Modifications for Evaluating Comprehension in Machine Reading

Viktor Schlegel, Goran Nenadic, Riza Batista-Navarro

Keywords:

Abstract Paper Similar Papers

Abstract: Advances in NLP have yielded impressive results for the task of machine reading comprehension (MRC), with approaches having been reported to achieve performance comparable to that of humans. In this paper, we investigate whether state-of-the-art MRC models are able to correctly process Semantics Altering Modifications (SAM): linguistically-motivated phenomena that alter the semantics of a sentence while preserving most of its lexical surface form. We present a method to automatically generate and align challenge sets featuring original and altered examples. We further propose a novel evaluation methodology to correctly assess the capability of MRC systems to process these examples independent of the data they were optimised on, by discounting for effects introduced by domain shift. In a large-scale empirical study, we apply the methodology in order to evaluate extractive MRC models with regard to their capability to correctly process SAM-enriched data. We comprehensively cover 12 different state-of-the-art neural architecture configurations and four training datasets and find that -- despite their well-known remarkable performance -- optimised models consistently struggle to correctly process semantically altered data.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38947840

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Generating Senses and RoLes: An End-to-End Model for Dependency- and Span-based Semantic Role Labeling

Rexhina Blloshmi, Simone Conia, Rocco Tripodi, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Natural Language Generation, Natural Language Processing

0

0

0

0

15:18

12/07/2020

Educating Text Autoencoders: Latent Representation Guidance via Denoising

Tianxiao Shen, Jonas Mueller, Regina Barzilay, Tommi Jaakkola

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

17:06

04/07/2020

Cross-Lingual Semantic Role Labeling with High-Quality Translated Training Corpus

Hao Fei, Meishan Zhang, Donghong Ji

Keywords Paper

Cross-Lingual Labeling, semantic labeling, natural understanding, model transferring

0

0

0

0

10:32

16/11/2020

Calibrated Language Model Fine-Tuning for In- and Out-of-Distribution Data

Lingkai Kong, Haoming Jiang, Yuchen Zhuang and
Jie Lyu, Tuo Zhao, Chao Zhang

Keywords Paper

augmented training, in-distribution calibration, text classification, expectation error

0

0

0

0

11:47

04/07/2020

Logical Natural Language Generation from Open-Domain Tables

Wenhu Chen, Jianshu Chen, Yu Su and
Zhiyu Chen, William Yang Wang

Keywords Paper

Logical Generation, neural NLG, surface-level realizations, logical inference

0

0

0

0

11:48

14/06/2020

Attention-Guided Hierarchical Structure Aggregation for Image Matting

Yu Qiao, Yuhao Liu, Xin Yang and
Dongsheng Zhou, Mingliang Xu, Qiang Zhang, Xiaopeng Wei

Keywords Paper

image matting, attention, hierarchical, aggregation, appearance cues

0

0

0

0

0:59

16/11/2020

Asking without Telling: Exploring Latent Ontologies in Contextual Representations

Julian Michael, Jan A. Botha, Ian Tenney

Keywords Paper

pretrained encoders, elmo, bert, latent learning

0

0

0

0

12:45

19/04/2021

Cognition-aware cognate detection

Diptesh Kanojia, Prashant Sharma, Sayali Ghodekar and
Pushpak Bhattacharyya, Gholamreza Haffari, Malhar Kulkarni

Keywords Paper

0

0

0

0

8:53

19/04/2021

‘just because you are right, doesn’t mean I am wrong’: Overcoming a bottleneck in development and evaluation of open-ended VQA tasks

Man Luo, Shailaja Keyur Sampat, Riley Tallman and
Yankai Zeng, Manuha Vancha, Akarshan Sajja, Chitta Baral

Keywords Paper

0

0

0

0

7:10

16/11/2020

Improving AMR Parsing with Sequence-to-Sequence Pre-training

Dongqin Xu, Junhui Li, Muhua Zhu and
Min Zhang, Guodong Zhou

Keywords Paper

abstract parsing, amr parsing, sequence-to-sequence parsing, machine translation

0

0

0

0

11:42

08/12/2020

Incremental Neural Lexical Coherence Modeling

Sungho Jeon, Michael Strube

Keywords Paper

0

0

0

0

9:08

19/08/2021

Exemplification Modeling: Can You Give Me an Example, Please?

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Natural Language Processing, Natural Language Semantics, Resources and Evaluation

0

0

0

0

14:47

03/05/2021

Semantic Re-tuning with Contrastive Tension

Fredrik Carlsson, Amaru C Gyllensten, Evangelia Gogoulou and
Erik Y Hellqvist, Magnus Sahlgren

Keywords Paper

Fine-tuning, Pre-training, Sentence Representations, Sentence Embeddings, Language Modelling, Semantic Textual Similarity, Transformers

0

0

0

0

5:07

04/07/2020

Reverse Engineering Configurations of Neural Text Generation Models

Yi Tay, Dara Bahri, Che Zheng and
Clifford Brunk, Donald Metzler, Andrew Tomkins

Keywords Paper

Reverse Models, neural modeling, Neural Models, generative models

0

0

0

0

6:16

04/07/2020

SMART: Robust and Efficient Fine-Tuning for Pre-trained Natural Language Models through Principled Regularized Optimization

Haoming Jiang, Pengcheng He, Weizhu Chen and
Xiaodong Liu, Jianfeng Gao, Tuo Zhao

Keywords Paper

NLP, generalization, NLP tasks, SMART

0

0

0

0

11:43

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

08/12/2020

Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

Adam Dahlgren Lindström, Johanna Björklund, Suna Bensch, Frank Drewes

Keywords Paper

0

0

0

0

14:20

04/07/2020

A Frame-based Sentence Representation for Machine Reading Comprehension

Shaoru Guo, Ru Li, Hongye Tan and
Xiaoli Li, Yong Guan, Hongyan Zhao, Yueping Zhang

Keywords Paper

Machine Comprehension, Sentence representation, SR, Machine MRC

0

0

0

0

6:33

16/11/2020

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

Xiangpeng Wei, Heng Yu, Yue Hu and
Rongxiang Weng, Luxi Xing, Weihua Luo

Keywords Paper

sequence-to-sequence task, nmt, inference, translation tasks

0

0

0

0

11:11

02/02/2021

What's the Best Place for an AI Conference, Vancouver or _______: Why Completing Comparative Questions is Difficult

‪Avishai Zagoury‬, Einat Minkov, Idan Szpektor, William W. Cohen

Keywords Paper

0

0

0

0

15:15

04/07/2020

Words Aren't Enough, Their Order Matters: On the Robustness of Grounding Visual Referring Expressions

Arjun Akula, Spandana Gella, Yaser Al-Onaizan and
Song-Chun Zhu, Siva Reddy

Keywords Paper

Robustness Expressions, Grounding Expressions, Visual recognition, natural understanding

0

0

0

0

6:53

02/02/2021

IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization

Wenxuan Zhou, Bill Yuchen Lin, Xiang Ren

Keywords Paper

0

0

0

0

16:25

19/08/2021

Recent Trends in Word Sense Disambiguation: A Survey

Michele Bevilacqua, Tommaso Pasini, Alessandro Raganato, Roberto Navigli

Keywords Paper

Natural language processing, General

0

0

0

0

14:20

16/11/2020

XL-WiC: A Multilingual Benchmark for Evaluating Semantic Contextualization

Alessandro Raganato, Tommaso Pasini, Jose Camacho-Collados, Mohammad Taher Pilehvar

Keywords Paper

disambiguation task, binary problem, evaluation scenarios, zero-shot transfer

0

0

0

0

10:08

03/05/2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng and
Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Keywords Paper

adversarial training, QA, NLI, BERT, information theory, adversarial robustness

0

0

0

0

5:21

05/01/2021

Faces a la Carte: Text-to-Face Generation via Attribute Disentanglement

Tianren Wang, Teng Zhang, Brian Lovell

Keywords Paper

0

0

0

0

4:33

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

16/11/2020

DAGA: Data Augmentation with a Generation Approach forLow-resource Tagging Tasks

Bosheng Ding, Linlin Liu, Lidong Bing and
Canasai Kruengkrai, Thien Hai Nguyen, Shafiq Joty, Luo Si, Chunyan Miao

Keywords Paper

machine learning, generalization, low-resource tasks, named recognition

0

0

0

0

11:09

06/12/2020

Shared Space Transfer Learning for analyzing multi-site fMRI data

Tony Yousefnezhad, Alessandro Selvitella, Daoqiang Zhang and
Andrew Greenshaw, Russell Greiner

Keywords Paper

0

0

0

0

3:06

26/04/2020

Neural Machine Translation with Universal Visual Representation

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Neural Machine Translation, Visual Representation, Multimodal Machine Translation, Language Representation

0

0

0

0

4:50

19/10/2020

ReQue: A configurable workflow and dataset collection for query refinement

Mahtab Tamannaee, Hossein Fani, Fattane Zarrinkalam and
Jamil Samouh, Samad Paydar, Ebrahim Bagheri

Keywords Paper

gold standard dataset, query refinement, reproducibility

0

0

0

0

15:21

04/07/2020

On the Linguistic Representational Power of Neural Machine Translation Models

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and
Hassan Sajjad, James Glass

Keywords Paper

Linguistic Models, natural processing, artificial intelligence, translating languages

0

0

0

0

19:17

16/11/2020

Partially-Aligned Data-to-Text Generation with Distant Supervision

Zihao Fu, Bei Shi, Wai Lam and
Lidong Bing, Zhiyuan Liu

Keywords Paper

data-to-text task, generation task, dataset problem, over-generation problem

0

0

0

0

11:58

04/07/2020

Pre-training Is (Almost) All You Need: An Application to Commonsense Reasoning

Alexandre Tamborrino, Nicola Pellicanò, Baptiste Pannier and
Pascal Voitot, Louise Naudin

Keywords Paper

Commonsense Reasoning, common tasks, plausibility task, pre-training phase

0

0

0

0

11:39

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

06/12/2021

Is Automated Topic Model Evaluation Broken? The Incoherence of Coherence

Alexander Hoyle, Pranav Goel, Andrew Hian-Cheong and
Denis Peskov, Jordan Boyd-Graber, Philip Resnik

Keywords Paper

0

0

0

0

15:00

16/11/2020

X-FACTR: Multilingual Factual Knowledge Retrieval from Pretrained Language Models

Zhengbao Jiang, Antonios Anastasopoulos, Jun Araki and
Haibo Ding, Graham Neubig

Keywords Paper

factual retrieval, language models, lms, probing methods

0

0

0

0

9:45

03/05/2021

Iterated learning for emergent systematicity in VQA

Ankit Vani, Max Schwarzer, Yuchen Lu and
Eeshan Dhekane, Aaron Courville

Keywords Paper

clevr, vqa, shapes, neural module network, cultural transmission, iterated learning, visual question answering, systematic generalization, compositionality

0

0

0

0

15:10