Contextualized Word Embeddings Encode Aspects of Human-Like Word Sense Knowledge

08/12/2020

Contextualized Word Embeddings Encode Aspects of Human-Like Word Sense Knowledge

Sathvik Nair, Mahesh Srinivasan, Stephan Meylan

Keywords:

Abstract Paper Similar Papers

Abstract: Understanding context-dependent variation in word meanings is a key aspect of human language comprehension supported by the lexicon. Lexicographic resources (e.g., WordNet) capture only some of this context-dependent variation; for example, they often do not encode how closely senses, or discretized word meanings, are related to one another. Our work investigates whether recent advances in NLP, specifically contextualized word embeddings, capture human-like distinctions between English word senses, such as polysemy and homonymy. We collect data from a behavioral, web-based experiment, in which participants provide judgments of the relatedness of multiple WordNet senses of a word in a two-dimensional spatial arrangement task. We find that participants’ judgments of the relatedness between senses are correlated with distances between senses in the BERT embedding space. Specifically, homonymous senses (e.g., bat as mammal vs. bat as sports equipment) are reliably more distant from one another in the embedding space than polysemous ones (e.g., chicken as animal vs. chicken as meat). Our findings point towards the potential utility of continuous-space representations of sense meanings.

The video of this talk cannot be embedded. You can watch it here:

https://underline.io/lecture/6479-contextualized-word-embeddings-encode-aspects-of-human-like-word-sense-knowledge

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at COLING Workshops 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Have We Solved The Hard Problem? It’s Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence

Wenqiang Lei, Yisong Miao, Runpeng Xie and
Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Paper

0

0

0

0

18:55

04/07/2020

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and
Natalia Talmina, Tal Linzen

Keywords Paper

Cross-Linguistic Syntax, Syntax, Cross-Linguistic Models, neural models

0

0

0

0

10:48

08/12/2020

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

0

0

0

0

13:01

02/02/2021

Conceptualized and Contextualized Gaussian Embedding

Chen Qian, Fuli Feng, Lijie Wen, Tat-Seng Chua

Keywords Paper

0

0

0

0

14:47

01/07/2020

Are All Languages Created Equal in Multilingual BERT?

Shijie Wu, Mark Dredze

Keywords Paper

0

0

0

0

7:45

08/12/2020

Assessing Polyseme Sense Similarity through Co-predication Acceptability and Contextualised Embedding Distance

Janosch Haber, Massimo Poesio

Keywords Paper

0

0

0

0

14:07

19/04/2021

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Paper

0

0

0

0

11:19

01/07/2020

Syntactic Parsing in Humans and Machines

Paola Merlo

Keywords Paper

0

0

0

0

44:12

02/02/2021

Matching on Sets: Conquer Occluded Person Re-identification Without Alignment

Mengxi Jia, Xinhua Cheng, Yunpeng Zhai and
Shijian Lu, Siwei Ma, Yonghong Tian, Jian Zhang

Keywords Paper

0

0

0

0

15:02

06/12/2021

Can fMRI reveal the representation of syntactic structure in the brain?

Aniketh Janardhan Reddy, Leila Wehbe

Keywords Paper

neuroscience, graph learning

0

0

0

0

15:02

16/11/2020

Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

Vaibhav Kumar, Tenzin Bhotia, Vaibhav Kumar, Tanmoy Chakraborty

Keywords Paper

word embeddings, semantic words, coreference resolution, post-processing methods

0

0

0

0

11:56

16/11/2020

Speakers Fill Lexical Semantic Gaps with Context

Tiago Pimentel, Rowan Hall Maudslay, Damian Blasi, Ryan Cotterell

Keywords Paper

bert-based ambiguity, human annotation, lexical ambiguity, ambiguous words

0

0

0

0

10:05

16/11/2020

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

0

0

0

0

11:23

01/07/2020

Joint Training with Semantic Role Labeling for Better Generalization in Natural Language Inference

Cemil Cengiz, Deniz Yuret

Keywords Paper

0

0

0

0

4:38

04/07/2020

Why is penguin more similar to polar bear than to sea gull? Analyzing conceptual knowledge in distributional models

Pia Sommerauer

Keywords Paper

word ing, distributional models, BERT, ELMO

0

0

0

0

11:17

06/12/2020

Modeling Task Effects on Meaning Representation in the Brain via Zero-Shot MEG Prediction

Mariya Toneva, Otilia Stretcu, Barnabas Poczos and
Leila Wehbe, Tom Mitchell

Keywords Paper

0

0

0

0

3:24

02/02/2021

DecAug: Augmenting HOI Detection via Decomposition

Hao-Shu Fang, Yichen Xie, Dian Shao and
Yong-Lu Li, Cewu Lu

Keywords Paper

0

0

0

0

9:02

02/02/2021

Circles are like Ellipses, or Ellipses are like Circles? Measuring the Degree of Asymmetry of Static and Contextual Word Embeddings and the Implications to Representation Learning

Wei Zhang, Murray Campbell, Yang Yu, Sadhana Kumaravel

Keywords Paper

0

0

0

0

13:34

04/07/2020

Recurrent Neural Network Language Models Always Learn English-Like Relative Clause Attachment

Forrest Davis, Marten van Schijndel

Keywords Paper

production, Recurrent Always, language models, RNN LMs

0

0

0

0

7:48

16/11/2020

Analyzing Individual Neurons in Pre-trained Language Models

Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Yonatan Belinkov

Keywords Paper

neuron-level analysis, linguistic tasks, deep models, pre-trained models

0

0

0

0

13:36

19/04/2021

Stereotype and skew: Quantifying gender bias in pre-trained and fine-tuned language models

Daniel Vassimon Manela, David Errington, Thomas Fisher and
Boris Breugel, Pasquale Minervini

Keywords Paper

0

0

0

0

6:54

08/12/2020

Understanding the effects of word-level linguistic annotations in under-resourced neural machine translation

Víctor M. Sánchez-Cartagena, Juan Antonio Pérez-Ortiz, Felipe Sánchez-Martínez

Keywords Paper

0

0

0

0

14:59

19/08/2021

Feature Space Targeted Attacks by Statistic Alignment

Lianli Gao, Yaya Cheng, Qilong Zhang and
Xing Xu, Jingkuan Song

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Recognition, Adversarial Machine Learning

0

0

0

0

12:17

02/11/2020

Multi-task regularization based on infrequent classes for audio captioning

Emre Çakır, Konstantinos Drossos, Tuomas Virtanen

Keywords Paper

0

0

0

0

16:13

07/09/2020

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

0

0

0

0

7:17

06/12/2021

Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement

Aming WU, Suqi Zhao, Cheng Deng, Wei Liu

Keywords Paper

machine learning, vision

0

0

0

0

9:04

04/07/2020

LINSPECTOR: Multilingual Probing Tasks for Word Representations

Gözde Gül Sahin, Clara Vania, Ilia Kuznetsov, Iryna Gurevych

Keywords Paper

Word Representations, NLP, classification tasks, probing tasks

0

0

0

0

11:51

14/06/2020

Robust Partial Matching for Person Search in the Wild

Yingji Zhong, Xiaoyu Wang, Shiliang Zhang

Keywords Paper

person search, person re-identification, occlusion handling, bounding box refinement, feature alignment, partial matching, pedestrian detection, person retrieval, image retrieval, computer vision

0

0

0

0

1:01

07/06/2021

Discovering and Categorising Language Biases in Reddit

Xavier Ferrer, Tom Van Nuenen, Jose M. Such, Natalia Criado

Keywords Paper

Qualitative and quantitative studies of social media, Social network analysis, communities identification, expertise and authority discovery, Subjectivity in textual data, sentiment analysis, polarity/opinion identification and extraction, linguistic analy

0

0

0

0

8:03

16/11/2020

With More Contexts Comes Better Performance: Contextualized Sense Embeddings for All-Round Word Sense Disambiguation

Bianca Scarlini, Tommaso Pasini, Roberto Navigli

Keywords Paper

natural processing, english task, word-in-context task, contextualized embeddings

0

0

0

0

12:11

04/07/2020

Gender Bias in Multilingual Embeddings and Cross-Lingual Transfer

Jieyu Zhao, Subhabrata Mukherjee, Saghar Hosseini and
Kai-Wei Chang, Ahmed Hassan Awadallah

Keywords Paper

cross-lingual transfer, multilingual embeddings, NLP applications, bias analysis

0

0

0

0

11:42

03/05/2021

On Learning Universal Representations Across Languages

Xiangpeng Wei, Rongxiang Weng, Yue Hu and
Luxi Xing, Heng Yu, Weihua Luo

Keywords Paper

hierarchical contrastive learning, cross-lingual pretraining, universal representation learning

0

0

0

0

3:51

03/05/2021

Probing BERT in Hyperbolic Spaces

Boli Chen, Yao Fu, Guangwei Xu and
Pengjun Xie, Chuanqi Tan, Mosha Chen, Liping Jing

Keywords Paper

Sentiment, Syntax, Probe, BERT, Hyperbolic

0

0

0

0

5:10

08/12/2020

An analysis of language models for metaphor recognition

Arthur Neidlein, Philip Wiesenbach, Katja Markert

Keywords Paper

0

0

0

0

13:52

16/11/2020

Investigating representations of verb bias in neural language models

Robert Hawkins, Takateru Yamakoshi, Thomas Griffiths, Adele Goldberg

Keywords Paper

grammatical construction, dais, neural models, transformer architectures

0

0

0

0

7:06

19/08/2021

Boundary Knowledge Translation based Reference Semantic Segmentation

Lechao Cheng, Zunlei Feng, Xinchao Wang and
Ya Jie Liu, Jie Lei, Mingli Song

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Deep Learning, Transfer, Adaptation, Multi-task Learning

0

0

0

0

14:22

16/11/2020

Detecting Fine-Grained Cross-Lingual Semantic Divergences without Supervision by Learning to Rank

Eleftheria Briakou, Marine Carpuat

Keywords Paper

detecting content, cross-lingual nlp, machine problem, annotation

0

0

0

0

11:06

06/12/2021

PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Yi Ren, Jinglin Liu, Zhou Zhao

Keywords Paper

generative model

0

0

0

0

10:15

16/11/2020

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis

Yao-Hung Hubert Tsai, Martin Ma, Muqiao Yang and
Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

human-centric tasks, sentiment analysis, emotion recognition, multimodal learning

1

0

0

0

10:54

02/02/2021

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Paper

0

0

0

0

19:35