Supervised Seeded Iterated Learning for Interactive Language Learning

Abstract: Language drift has been one of the major obstacles to train language models through interaction. When word-based conversational agents are trained towards completing a task, they tend to invent their language rather than leveraging natural language. In recent literature, two general methods partially counter this phenomenon: Supervised Selfplay (S2P) and Seeded Iterated Learning (SIL). While S2P jointly trains interactive and supervised losses to counter the drift, SIL changes the training dynamics to prevent language drift from occurring. In this paper, we first highlight their respective weaknesses, i.e., late-stage training collapses and higher negative likelihood when evaluated on human corpus. Given these observations, we introduce Supervised Seeded Iterated Learning (SSIL) to combine both methods to minimize their respective weaknesses. We then show the effectiveness of in the language-drift translation game.

12/07/2020

Supervised Seeded Iterated Learning for Interactive Language Learning

Yuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron Courville

Comments

Similar Papers

Countering Language Drift with Seeded Iterated Learning

Yuchen Lu, Soumye Singhal, Florian Strub and Aaron Courville, Olivier Pietquin

Keywords Abstract Paper

Deep Learning - Algorithms

Leveraging adversarial training in self-learning for cross-lingual text classification

Xin Dong, Yaxin Zhu, Yupeng Zhang and Zuohui Fu, Dongkuan Xu, Sen Yang, Gerard Melo

Keywords Abstract Paper

multilingual, semantics, text classification, cross-lingual

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and Xisen Jin, Xiang Ren

Keywords Abstract Paper

machine learning, fairness, language

On the interaction between supervision and self-play in emergent communication

Ryan Lowe*, Abhinav Gupta*, Jakob Foerster and Douwe Kiela, Joelle Pineau

Keywords Abstract Paper

multi-agent communication, self-play, emergent languages

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?

Xuancheng Ren, Xu Sun, Houfeng Wang, Qun Liu

Keywords Abstract Paper

Modelling Hierarchical Structure between Dialogue Policy and Natural Language Generator with Option Framework for Task-oriented Dialogue System

Jianhong Wang, Yuan Zhang, Tae-Kyun Kim, Yunjie Gu

Keywords Abstract Paper

Task-oriented Dialogue System, Hierarchical Reinforcement Learning, Policy Optimization, Natural Language Processing

Meta-Learning of Structured Task Distributions in Humans and Machines

Sreejan Kumar, Ishita Dasgupta, Jonathan Cohen and Nathaniel Daw, Thomas L Griffiths

Keywords Abstract Paper

reinforcement learning, compositionality, human cognition, meta-learning

Learning Rewards From Linguistic Feedback

Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins and Karthik Narasimhan, Thomas L. Griffiths

Keywords Abstract Paper

Multi-agent Communication meets Natural Language: Synergies between Functional and Structural Language Learning

Angeliki Lazaridou, Anna Potapenko, Olivier Tieleman

Keywords Abstract Paper

Multi-agent Communication, natural learning, visual task, Functional Learning

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision

Hao Tan, Mohit Bansal

Keywords Abstract Paper

speaking, writing, text-only self-supervision, pure-language tasks

Does typological blinding impede cross-lingual sharing?

Johannes Bjerva, Isabelle Augenstein

Keywords Abstract Paper

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Abstract Paper

Learning to summarize with human feedback

Nisan Stiennon, Long Ouyang, Jeffrey Wu and Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Abstract Paper

Linguistic Features for Readability Assessment

Tovly Deutsch, Masoud Jasbi, Stuart Shieber

Keywords Abstract Paper

Contrastive Learning with Adversarial Perturbations for Conditional Text Generation

Seanie Lee, Dong Bok Lee, Sung Ju Hwang

Keywords Abstract Paper

contrastive learning, conditional text generation

Shaping Visual Representations with Language for Few-Shot Classification

Jesse Mu, Percy Liang, Noah Goodman

Keywords Abstract Paper

Few-Shot Classification, human learning, supervision, machine models

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Abstract Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Abstract Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

On Negative Interference in Multilingual Models: Findings and A Meta-Learning Treatment

Zirui Wang, Zachary C. Lipton, Yulia Tsvetkov

Keywords Abstract Paper

multilingual models, meta-learning algorithm, multilingual representations, negative interference

Latent Template Induction with Gumbel-CRFs

Yao Fu, Chuanqi Tan, Bin Bi and Mosha Chen, Yansong Feng, Alexander Rush

Keywords Abstract Paper

Enhancing Answer Boundary Detection for Multilingual Machine Reading Comprehension

Fei Yuan, Linjun Shou, Xuanyu Bai and Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Keywords Abstract Paper

Yuchen Lu, Soumye Singhal, Florian Strub and
Aaron Courville, Olivier Pietquin

Keywords Paper

Xin Dong, Yaxin Zhu, Yupeng Zhang and
Zuohui Fu, Dongkuan Xu, Sen Yang, Gerard Melo

Keywords Paper

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

Ryan Lowe, Abhinav Gupta, Jakob Foerster and
Douwe Kiela, Joelle Pineau

Keywords Paper

Keywords Paper

Keywords Paper

Sreejan Kumar, Ishita Dasgupta, Jonathan Cohen and
Nathaniel Daw, Thomas L Griffiths

Keywords Paper

Theodore R. Sumers, Mark K. Ho, Robert D. Hawkins and
Karthik Narasimhan, Thomas L. Griffiths

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Nisan Stiennon, Long Ouyang, Jeffrey Wu and
Daniel Ziegler, Ryan Lowe, Chelsea Voss, Alec Radford, Dario Amodei, Paul Christiano

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yao Fu, Chuanqi Tan, Bin Bi and
Mosha Chen, Yansong Feng, Alexander Rush

Keywords Paper

Fei Yuan, Linjun Shou, Xuanyu Bai and
Ming Gong, Yaobo Liang, Nan Duan, Yan Fu, Daxin Jiang

Keywords Paper

Keywords Paper

Keywords Paper

Ruijian Xu, Chongyang Tao, Daxin Jiang and
Xueliang Zhao, Dongyan Zhao, Rui Yan

Keywords Paper

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Mohamed Afham Mohamed Aflal, Salman Khan, Muhammad Haris Khan and
Muzammal Naseer, Fahad Shahbaz Khan

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yannis Kalantidis, Mert Bulent Sariyildiz, Noe Pion and
Philippe Weinzaepfel, Diane Larlus

Keywords Paper

Keywords Paper

Keywords Paper

Siddharth Desai, Ishan Durugkar, Haresh Karnan and
Garrett Warnell, Josiah Hanna, Peter Stone

Keywords Paper