Investigating representations of verb bias in neural language models

Abstract: Languages typically provide more than one grammatical construction to express certain types of messages. A speaker′s choice of construction is known to depend on multiple factors, including the choice of main verb -- a phenomenon known as verb bias. Here we introduce DAIS, a large benchmark dataset containing 50K human judgments for 5K distinct sentence pairs in the English dative alternation. This dataset includes 200 unique verbs and systematically varies the definiteness and length of arguments. We use this dataset, as well as an existing corpus of naturally occurring data, to evaluate how well recent neural language models capture human preferences. Results show that larger models perform better than smaller models, and transformer architectures (e.g. GPT-2) tend to out-perform recurrent architectures (e.g. LSTMs) even under comparable parameter and training settings. Additional analyses of internal feature representations suggest that transformers may better integrate specific lexical information with grammatical constructions.

04/07/2020

Investigating representations of verb bias in neural language models

Robert Hawkins, Takateru Yamakoshi, Thomas Griffiths, Adele Goldberg

Comments

Similar Papers

To compress or not to compress? A Finite-State approach to Nen verbal morphology

Saliha Muradoglu, Nicholas Evans, Hanna Suominen

Keywords Abstract Paper

Finite-State approach, verbal parser, Chunking, decomposition model

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and Arthur Szlam, Marc'Aurelio Ranzato

Keywords Abstract Paper

energy-based models, text generation

Leveraging Sentence Similarity in Natural Language Generation: Improving Beam Search using Range Voting

Sebastian Borgeaud, Guy Emerson

Keywords Abstract Paper

Improving Multilingual Models with Language-Clustered Vocabularies

Hyung Won Chung, Dan Garrette, Kiat Chuan Tan, Jason Riesa

Keywords Abstract Paper

massively applications, multilingual generation, cross-lingual sharing, multilingual models

MODE-LSTM: A Parameter-efficient Recurrent Network with Multi-Scale for Sentence Classification

Qianli Ma, Zhenxi Lin, Jiangyue Yan and Zipeng Chen, Liuhong Yu

Keywords Abstract Paper

sentence classification, extracting features, generalization, cnn models

Multi-task regularization based on infrequent classes for audio captioning

Emre Çakır, Konstantinos Drossos, Tuomas Virtanen

Keywords Abstract Paper

Have We Solved The Hard Problem? It’s Not Easy! Contextual Lexical Contrast as a Means to Probe Neural Coherence

Wenqiang Lei, Yisong Miao, Runpeng Xie and Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Abstract Paper

Multimodal Routing: Improving Local and Global Interpretability of Multimodal Language Analysis

Yao-Hung Hubert Tsai, Martin Ma, Muqiao Yang and Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Abstract Paper

human-centric tasks, sentiment analysis, emotion recognition, multimodal learning

Which transformer architecture fits my data? A vocabulary bottleneck in self-attention

Noam Wies, Yoav Levine, Daniel Jannai, Amnon Shashua

Keywords Abstract Paper

Theory, Deep learning Theory

Machine translationese: Effects of algorithmic bias on linguistic complexity in machine translation

Eva Vanmassenhove, Dimitar Shterionov, Matthew Gwilliam

Keywords Abstract Paper

Understanding Adaptive, Multiscale Temporal Integration In Deep Speech Recognition Systems

Menoua Keshishian, Samuel Norman-Haignere, Nima Mesgarani

Keywords Abstract Paper

deep learning, machine learning

Disambiguatory signals are stronger in word-initial positions

Tiago Pimentel, Ryan Cotterell, Brian Roark

Keywords Abstract Paper

Structured Pruning of Large Language Models

Ziheng Wang, Jeremy Wohlwend, Tao Lei

Keywords Abstract Paper

natural tasks, model compression, language tasks, pruning embeddings

Bayesian Methods for Semi-supervised Text Annotation

Kristian Miok, Gregor Pirs, Marko Robnik-Sikonja

Keywords Abstract Paper

Do you have the right scissors? Tailoring Pre-trained Language Models via Monte-Carlo Methods

Ning Miao, Yuxuan Song, Hao Zhou, Lei Li

Keywords Abstract Paper

over- problem, text tasks, Tailoring Models, Monte-Carlo Methods

ContraCAT: Contrastive Coreference Analytical Templates for Machine Translation

Dario Stojanovski, Benno Krojer, Denis Peskov, Alexander Fraser

Keywords Abstract Paper

C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

Yutai Hou, Sanyuan Chen, Wanxiang Che and Cheng Chen, Ting Liu

Keywords Abstract Paper

Disentangling syntax and semantics in the brain with deep networks

Charlotte Caucheteux, Alexandre Gramfort, Jean-Remi King

Keywords Abstract Paper

Applications, Neuroscience and Cognitive Science

Document-Level Event Role Filler Extraction using Multi-Granularity Contextualized Encoding

Xinya Du, Claire Cardie

Keywords Abstract Paper

Document-Level Extraction, event extraction, extraction decisions, Multi-Granularity Encoding

Seq2Edits: Sequence Transduction Using Span-level Edit Operations

Felix Stahlberg, Shankar Kumar

Keywords Abstract Paper

sequence editing, natural tasks, nlp tasks, text normalization

Deep subjecthood: Higher-order grammatical features in multilingual BERT

Isabel Papadimitriou, Ethan A. Chi, Richard Futrell, Kyle Mahowald

Keywords Abstract Paper

On Importance Sampling-Based Evaluation of Latent Language Models

Keywords Paper

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

Keywords Paper

Keywords Paper

Qianli Ma, Zhenxi Lin, Jiangyue Yan and
Zipeng Chen, Liuhong Yu

Keywords Paper

Keywords Paper

Wenqiang Lei, Yisong Miao, Runpeng Xie and
Bonnie Webber, Meichun Liu, Tat-Seng Chua, Nancy F. Chen

Keywords Paper

Yao-Hung Hubert Tsai, Martin Ma, Muqiao Yang and
Ruslan Salakhutdinov, Louis-Philippe Morency

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yutai Hou, Sanyuan Chen, Wanxiang Che and
Cheng Chen, Ting Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Changsheng Zhao, Ting Hua, Yilin Shen and
Qian Lou, Hongxia Jin

Keywords Paper

Jennifer Hu, Jon Gauthier, Peng Qian and
Ethan Wilcox, Roger Levy

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Ivan Vulić, Edoardo Maria Ponti, Robert Litschko and
Goran Glavaš, Anna Korhonen

Keywords Paper

Sumanth Dathathri, Andrea Madotto, Janice Lan and
Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xisen Jin, Zhongyu Wei, Junyi Du and
Xiangyang Xue, Xiang Ren

Keywords Paper