Compositional Generalization by Factorizing Alignment and Translation

Abstract: Standard methods in deep learning for natural language processing fail to capture the compositional structure of human language that allows for systematic generalization outside of the training distribution. However, human learners readily generalize in this way, e.g. by applying known grammatical rules to novel words. Inspired by work in cognitive science suggesting a functional distinction between systems for syntactic and semantic processing, we implement a modification to an existing approach in neural machine translation, imposing an analogous separation between alignment and translation. The resulting architecture substantially outperforms standard recurrent networks on the SCAN dataset, a compositional generalization task, without any additional supervision. Our work suggests that learning to align and to translate in separate modules may be a useful heuristic for capturing compositional structure.

26/04/2020

Compositional Generalization by Factorizing Alignment and Translation

Jacob Russin, Jason Jo, Randall O'Reilly, Yoshua Bengio

Comments

Similar Papers

Permutation Equivariant Models for Compositional Generalization in Language

Jonathan Gordon, David Lopez-Paz, Marco Baroni, Diane Bouchacourt

Keywords Abstract Paper

Compositionality, Permutation Equivariance, Language Processing

Emergent Communication Pretraining for Few-Shot Machine Translation

Yaoyiran Li, Edoardo Maria Ponti, Ivan Vulić, Anna Korhonen

Keywords Abstract Paper

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Abstract Paper

Generating syntactically controlled paraphrases without using annotated parallel pairs

Kuan-Hao Huang, Kai-Wei Chang

Keywords Abstract Paper

On the Linguistic Representational Power of Neural Machine Translation Models

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and Hassan Sajjad, James Glass

Keywords Abstract Paper

Linguistic Models, natural processing, artificial intelligence, translating languages

Learning to Recombine and Resample Data For Compositional Generalization

Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

Keywords Abstract Paper

sequence models, language processing, compositional generalization, data augmentation, generative modeling

Named Entity Recognition Only from Word Embeddings

Ying Luo, Hai Zhao, Junlang Zhan

Keywords Abstract Paper

named recognition, entity detection, type prediction, deep models

Quality estimation without human-labeled data

Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala and Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

Keywords Abstract Paper

Encodings of Source Syntax: Similarities in NMT Representations Across Target Languages

Tyler A. Chang, Anna Rafferty

Keywords Abstract Paper

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Michael Cogswell, Jiasen Lu, Rishabh Jain and Stefan Lee, Devi Parikh, Dhruv Batra

Keywords Abstract Paper

A Latent Morphology Model for Open-Vocabulary Neural Machine Translation

Duygu Ataman, Wilker Aziz, Alexandra Birch

Keywords Abstract Paper

neural machine translation, low-resource languages, latent-variable models

Uncertainty-Aware Semantic Augmentation for Neural Machine Translation

Xiangpeng Wei, Heng Yu, Yue Hu and Rongxiang Weng, Luxi Xing, Weihua Luo

Keywords Abstract Paper

sequence-to-sequence task, nmt, inference, translation tasks

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Abstract Paper

Natural Inference, data augmentation, Robustifying Models, deep models

Learning Spoken Language Representations with Neural Lattice Language Modeling

Chao-Wei Huang, Yun-Nung Chen

Keywords Abstract Paper

NLP tasks, spoken tasks, intent detection, Spoken Representations

Neural Machine Translation Models with Back-Translation for the Extremely Low-Resource Indigenous Language Bribri

Isaac Feldman, Rolando Coto-Solano

Keywords Abstract Paper

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and Weiming Lu, Yueting Zhuang

Keywords Abstract Paper

Natural Language Processing, Information Extraction, Named Entities

Sequence-Level Mixed Sample Data Augmentation

Demi Guo, Yoon Kim, Alexander Rush

Keywords Abstract Paper

sequence-to-sequence problems, scan, semantic parsing, neural networks

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Abstract Paper

language

It's Easier to Translate out of English than into it: Measuring Neural Translation Difficulty by Cross-Mutual Information

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos and Ryan Cotterell, Naoaki Okazaki

Keywords Abstract Paper

Measuring Difficulty, generation, asymmetric difficulty, machine difficulty

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

Pengyu Cheng, Martin Renqiang Min, Dinghan Shen and Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

Keywords Abstract Paper

Learning language, NLP tasks, conditional generation, style transfer

An exploratory study on multilingual quality estimation

Shuo Sun, Marina Fomicheva, Frédéric Blain and Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia

Keywords Abstract Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and
Hassan Sajjad, James Glass

Keywords Paper

Keywords Paper

Keywords Paper

Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala and
Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

Keywords Paper

Keywords Paper

Michael Cogswell, Jiasen Lu, Rishabh Jain and
Stefan Lee, Devi Parikh, Dhruv Batra

Keywords Paper

Keywords Paper

Xiangpeng Wei, Heng Yu, Yue Hu and
Rongxiang Weng, Luxi Xing, Weihua Luo

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Keywords Paper

Keywords Paper

Emanuele Bugliarello, Sabrina J. Mielke, Antonios Anastasopoulos and
Ryan Cotterell, Naoaki Okazaki

Keywords Paper

Pengyu Cheng, Martin Renqiang Min, Dinghan Shen and
Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

Keywords Paper

Shuo Sun, Marina Fomicheva, Frédéric Blain and
Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xi Ye, Qiaochu Chen, Xinyu Wang and
Isil Dillig, Greg Durrett

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yuwei Fang, Shuohang Wang, Zhe Gan and
Siqi Sun, Jingjing Liu

Keywords Paper

Rahma Chaabouni, Eugene Kharitonov, Diane Bouchacourt and
Emmanuel Dupoux, Marco Baroni

Keywords Paper

Keywords Paper

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

Keywords Paper