Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Abstract: Can we develop visually grounded dialog agents that can efficiently adapt to new tasks without forgetting how to talk to people? Such agents could leverage a larger variety of existing data to generalize to a new task, minimizing expensive data collection and annotation. In this work, we study a setting we call "Dialog without Dialog", which requires agents to develop visually grounded dialog models that can adapt to new tasks without language level supervision. By factorizing intention and language, our model minimizes linguistic drift after fine-tuning for new tasks. We present qualitative results, automated metrics, and human studies that all show our model can adapt to new tasks and maintain language quality. Baselines either fail to perform well at new tasks or experience language drift, becoming unintelligible to humans. Code has been made available at: https://github.com/mcogswell/dialog_without_dialog.

06/12/2021

Dialog without Dialog Data: Learning Visual Dialog Agents from VQA Data

Michael Cogswell, Jiasen Lu, Rishabh Jain, Stefan Lee, Devi Parikh, Dhruv Batra

Comments

Similar Papers

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and Xisen Jin, Xiang Ren

Keywords Abstract Paper

machine learning, fairness, language

Modular Meta-Learning with Shrinkage

Yutian Chen, Abe Friesen, Feryal Behbahani and Arnaud Doucet, David Budden, Matthew Hoffman, Nando de Freitas

Keywords Abstract Paper

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh and Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Abstract Paper

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision

Hao Tan, Mohit Bansal

Keywords Abstract Paper

speaking, writing, text-only self-supervision, pure-language tasks

Learning Spoken Language Representations with Neural Lattice Language Modeling

Chao-Wei Huang, Yun-Nung Chen

Keywords Abstract Paper

NLP tasks, spoken tasks, intent detection, Spoken Representations

Compositional Generalization by Factorizing Alignment and Translation

Jacob Russin, Jason Jo, Randall O'Reilly, Yoshua Bengio

Keywords Abstract Paper

Compositional Generalization, Translation, natural processing, cognitive science

Shaping Visual Representations with Language for Few-Shot Classification

Jesse Mu, Percy Liang, Noah Goodman

Keywords Abstract Paper

Few-Shot Classification, human learning, supervision, machine models

Semantic Role Labeling Guided Multi-turn Dialogue ReWriter

Kun Xu, Haochen Tan, Linfeng Song and Han Wu, Haisong Zhang, Linqi Song, Dong Yu

Keywords Abstract Paper

multi-turn rewriting, attentive models, semantic labeling, semantic

Self-Supervised Meta-Learning for Few-Shot Natural Language Classification Tasks

Trapit Bansal, Rishikesh Jha, Tsendsuren Munkhdalai, Andrew McCallum

Keywords Abstract Paper

nlp applications, fine-tuning, meta-learning problem, supervised tasks

Multilingual and cross-lingual document classification: A meta-learning approach

Niels Heijden, Helen Yannakoudakis, Pushkar Mishra, Ekaterina Shutova

Keywords Abstract Paper

Learning to Recombine and Resample Data For Compositional Generalization

Ekin Akyürek, Afra Feyza Akyürek, Jacob Andreas

Keywords Abstract Paper

sequence models, language processing, compositional generalization, data augmentation, generative modeling

Visually Grounded Continual Learning of Compositional Phrases

Xisen Jin, Junyi Du, Arka Sadhu and Ram Nevatia, Xiang Ren

Keywords Abstract Paper

visually task, continual phrases, visually-grounded task, compositional generalization

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Abstract Paper

Revisiting Mahalanobis Distance for Transformer-Based Out-of-Domain Detection

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and Ekaterina Artemova, Irina Piontkovskaya

Keywords Abstract Paper

Plug and Play Language Models: A Simple Approach to Controlled Text Generation

Sumanth Dathathri, Andrea Madotto, Janice Lan and Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu

Keywords Abstract Paper

controlled text generation, generative models, conditional generative models, language modeling, transformer

C2C-GenDA: Cluster-to-Cluster Generation for Data Augmentation of Slot Filling

Yutai Hou, Sanyuan Chen, Wanxiang Che and Cheng Chen, Ting Liu

Keywords Abstract Paper

Permutation Equivariant Models for Compositional Generalization in Language

Jonathan Gordon, David Lopez-Paz, Marco Baroni, Diane Bouchacourt

Keywords Abstract Paper

Compositionality, Permutation Equivariance, Language Processing

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Abstract Paper

Less is Better: A cognitively inspired unsupervised model for language segmentation

Jinbiao Yang, Stefan L. Frank, Antal van den Bosch

Keywords Abstract Paper

Leveraging User Paraphrasing Behavior In Dialog Systems To Automatically Collect Annotations For Long-Tail Utterances

Tobias Falke, Markus Boese, Daniil Sorokin and Caglar Tirkaz, Patrick Lehnen

Keywords Abstract Paper

Supervised Seeded Iterated Learning for Interactive Language Learning

Yuchen Lu, Soumye Singhal, Florian Strub and Olivier Pietquin, Aaron Courville

Keywords Abstract Paper

language drift, language-drift game, language models, word-based agents

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Yutian Chen, Abe Friesen, Feryal Behbahani and
Arnaud Doucet, David Budden, Matthew Hoffman, Nando de Freitas

Keywords Paper

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kun Xu, Haochen Tan, Linfeng Song and
Han Wu, Haisong Zhang, Linqi Song, Dong Yu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xisen Jin, Junyi Du, Arka Sadhu and
Ram Nevatia, Xiang Ren

Keywords Paper

Keywords Paper

Alexander Podolskiy, Dmitry Lipin, Andrey Bout and
Ekaterina Artemova, Irina Piontkovskaya

Keywords Paper

Sumanth Dathathri, Andrea Madotto, Janice Lan and
Jane Hung, Eric Frank, Piero Molino, Jason Yosinski, Rosanne Liu

Keywords Paper

Yutai Hou, Sanyuan Chen, Wanxiang Che and
Cheng Chen, Ting Liu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tobias Falke, Markus Boese, Daniil Sorokin and
Caglar Tirkaz, Patrick Lehnen

Keywords Paper

Yuchen Lu, Soumye Singhal, Florian Strub and
Olivier Pietquin, Aaron Courville

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Edoardo Barba, Luigi Procopio, Caterina Lacerra and
Tommaso Pasini, Roberto Navigli

Keywords Paper

Keywords Paper

Keywords Paper

Heng Gong, Yawei Sun, Xiaocheng Feng and
Bing Qin, Wei Bi, Xiaojiang Liu, Ting Liu

Keywords Paper

Shuo Sun, Marina Fomicheva, Frédéric Blain and
Vishrav Chaudhary, Ahmed El-Kishky, Adithya Renduchintala, Francisco Guzmán, Lucia Specia

Keywords Paper

Keywords Paper

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

Keywords Paper

Keywords Paper

Yi-Lin Tuan, Ahmed El-Kishky, Adithya Renduchintala and
Vishrav Chaudhary, Francisco Guzmán, Lucia Specia

Keywords Paper

Laura Ruis, Jacob Andreas, Marco Baroni and
Diane Bouchacourt, Brenden Lake

Keywords Paper

Keywords Paper