Disentangling syntax and semantics in the brain with deep networks

Abstract: The activations of language transformers like GPT-2 have been shown to linearly map onto brain activity during speech comprehension. However, the nature of these activations remains largely unknown and presumably conflate distinct linguistic classes. Here, we propose a taxonomy to factorize the high-dimensional activations of language models into four combinatorial classes: lexical, compositional, syntactic, and semantic representations. We then introduce a statistical method to decompose, through the lens of GPT-2's activations, the brain activity of 345 subjects recorded with functional magnetic resonance imaging (fMRI) during the listening of ~4.6 hours of narrated text. The results highlight two findings. First, compositional representations recruit a more widespread cortical network than lexical ones, and encompass the bilateral temporal, parietal and prefrontal cortices. Second, contrary to previous claims, syntax and semantics are not associated with separated modules, but, instead, appear to share a common and distributed neural substrate. Overall, this study introduces a versatile framework to isolate, in the brain activity, the distributed representations of linguistic constructs.

06/12/2021

Disentangling syntax and semantics in the brain with deep networks

Charlotte Caucheteux, Alexandre Gramfort, Jean-Remi King

Comments

Similar Papers

Can fMRI reveal the representation of syntactic structure in the brain?

Aniketh Janardhan Reddy, Leila Wehbe

Keywords Abstract Paper

neuroscience, graph learning

On the Linguistic Representational Power of Neural Machine Translation Models

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and Hassan Sajjad, James Glass

Keywords Abstract Paper

Linguistic Models, natural processing, artificial intelligence, translating languages

Emergence of Separable Manifolds in Deep Language Representations

Jonathan Mamou, Hang Le, Miguel del Rio Fernandez and Cory Stephenson, Hanlin Tang, Yoon Kim, SueYeon Chung

Keywords Abstract Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

Bidirectional Variational Inference for Non-Autoregressive Text-to-Speech

Yoonhyung Lee, Joongbo Shin, Kyomin Jung

Keywords Abstract Paper

VAE, non-autoregressive, speech synthesis, text-to-speech

Probing Pretrained Language Models for Lexical Semantics

Ivan Vulić, Edoardo Maria Ponti, Robert Litschko and Goran Glavaš, Anna Korhonen

Keywords Abstract Paper

lexical tasks, pretrained models, lms, lexical strategies

Compositional and Lexical Semantics in RoBERTa, BERT and DistilBERT: A Case Study on CoQA

Ieva Staliūnaitė, Ignacio Iacobacci

Keywords Abstract Paper

nlp tasks, conversational task, semantic labeling, contextualized embeddings

Low-dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

Richard Antonello, Javier S Turek, Vy Vo, Alexander Huth

Keywords Abstract Paper

vision, language, transfer learning

SimulSpeech: End-to-End Simultaneous Speech to Text Translation

Yi Ren, Jinglin Liu, Xu Tan and Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu

Keywords Abstract Paper

simultaneous translation, simultaneous recognition, ASR, NMT

Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems

Menoua Keshishian, Samuel Norman-Haignere, Nima Mesgarani

Keywords Abstract Paper

deep learning, machine learning

Interactive Speech and Noise Modeling for Speech Enhancement

Chengyu Zheng, Xiulian Peng, Yuan Zhang and Sriram Srinivasan, Yan Lu

Keywords Abstract Paper

PortaSpeech: Portable and High-Quality Generative Text-to-Speech

Yi Ren, Jinglin Liu, Zhou Zhao

Keywords Abstract Paper

generative model

Analyzing Individual Neurons in Pre-trained Language Models

Nadir Durrani, Hassan Sajjad, Fahim Dalvi, Yonatan Belinkov

Keywords Abstract Paper

neuron-level analysis, linguistic tasks, deep models, pre-trained models

Training effective neural CLIR by bridging the translation gap

Hamed Bonab, Sheikh Muhammad Sarwar, James Allan

Keywords Abstract Paper

cross-lingual word embedding, cross-lingual information retrieval, neural clir, translation gap

A Tale of Two Perplexities: Sensitivity of Neural Language Models to Lexical Retrieval Deficits in Dementia of the Alzheimer’s Type

Trevor Cohen, Serguei Pakhomov

Keywords Abstract Paper

Lexical Deficits, diagnostic classification, Neural Models, computational methods

Syntactic Parsing in Humans and Machines

Paola Merlo

Keywords Abstract Paper

Mapping the Timescale Organization of Neural Language Models

Hsiang-Yun Sherry Chien, Jinhan Zhang, Christopher Honey

Keywords Abstract Paper

natural language processing, hierarchy, temporal context, timescale, LSTM

DialogXL: All-in-One XLNet for Multi-Party Conversation Emotion Recognition

Weizhou Shen, Junqing Chen, Xiaojun Quan, Zhixian Xie

Keywords Abstract Paper

How Self-Attention Improves Rare Class Performance in a Question-Answering Dialogue Agent

Adam Stiff, Qi Song, Eric Fosler-Lussier

Keywords Abstract Paper

Speech-T: Transducer for Text to Speech and Beyond

Jiawei Chen, Xu Tan, Yichong Leng and Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu

Keywords Abstract Paper

transformers

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Gantavya Bhatt, Hritik Bansal, Rishubh Singh, Sumeet Agarwal

Keywords Abstract Paper

linguistic tasks, unsupervised setting, sentence grammaticality, language tasks

Keywords Paper

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and
Hassan Sajjad, James Glass

Keywords Paper

Jonathan Mamou, Hang Le, Miguel del Rio Fernandez and
Cory Stephenson, Hanlin Tang, Yoon Kim, SueYeon Chung

Keywords Paper

Keywords Paper

Ivan Vulić, Edoardo Maria Ponti, Robert Litschko and
Goran Glavaš, Anna Korhonen

Keywords Paper

Keywords Paper

Keywords Paper

Yi Ren, Jinglin Liu, Xu Tan and
Chen Zhang, Tao Qin, Zhou Zhao, Tie-Yan Liu

Keywords Paper

Keywords Paper

Chengyu Zheng, Xiulian Peng, Yuan Zhang and
Sriram Srinivasan, Yan Lu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Jiawei Chen, Xu Tan, Yichong Leng and
Jin Xu, Guihua Wen, Tao Qin, Tie-Yan Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yutai Hou, Sanyuan Chen, Wanxiang Che and
Cheng Chen, Ting Liu

Keywords Paper

Aaron Mueller, Garrett Nicolai, Panayiota Petrou-Zeniou and
Natalia Talmina, Tal Linzen

Keywords Paper

Zhuosheng Zhang, Kehai Chen, Rui Wang and
Masao Utiyama, Eiichiro Sumita, Zuchao Li, Hai Zhao

Keywords Paper

Mingda Li, Xinyue Liu, Weitong Ruan and
Luca Soldaini, Wael Hamza, Chengwei Su

Keywords Paper

Zhiqi Huang, Fenglin Liu, Xian Wu and
Shen Ge, Helin Wang, Wei Fan, Yuexian Zou

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

Sang-Hoon Lee, Hyun-Wook Yoon, Hyeong-Rae Noh and
Ji-Hoon Kim, Seong-Whan Lee

Keywords Paper