RTFM: Generalising to New Environment Dynamics via Reading

26/04/2020

RTFM: Generalising to New Environment Dynamics via Reading

Victor Zhong, Tim Rocktäschel, Edward Grefenstette

Keywords: reinforcement learning, policy learning, reading comprehension, generalisation

Abstract Paper Similar Papers

Abstract: Obtaining policies that can generalise to new environments in reinforcement learning is challenging. In this work, we demonstrate that language understanding via a reading policy learner is a promising vehicle for generalisation to new environments. We propose a grounded policy learning problem, Read to Fight Monsters (RTFM), in which the agent must jointly reason over a language goal, relevant dynamics described in a document, and environment observations. We procedurally generate environment dynamics and corresponding language descriptions of the dynamics, such that agents must read to understand new environment dynamics instead of memorising any particular information. In addition, we propose txt2π, a model that captures three-way interactions between the goal, document, and observations. On RTFM, txt2π generalises to new environments with dynamics not seen during training via reading. Furthermore, our model outperforms baselines such as FiLM and language-conditioned CNNs on RTFM. Through curriculum learning, txt2π produces policies that excel on complex RTFM tasks requiring several reasoning and coreference steps.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Counterfactual Vision-and-Language Navigation: Unravelling the Unseen

Amin Parvaneh, Ehsan Abbasnejad, Damien Teney and
Javen Qinfeng Shi, Anton van den Hengel

Keywords Paper

0

0

0

0

3:17

14/06/2020

Object Relational Graph With Teacher-Recommended Learning for Video Captioning

Ziqi Zhang, Yaya Shi, Chunfeng Yuan and
Bing Li, Peijin Wang, Weiming Hu, Zheng-Jun Zha

Keywords Paper

vison and language, video captioning, seq2seq learning, object relational graph, teacher-recommended learning, gcn, visual relational reasoning, external language model, knowledge distillation, long-tailed problem

0

0

0

0

1:05

16/11/2020

Visually Grounded Continual Learning of Compositional Phrases

Xisen Jin, Junyi Du, Arka Sadhu and
Ram Nevatia, Xiang Ren

Keywords Paper

visually task, continual phrases, visually-grounded task, compositional generalization

0

0

0

0

10:50

25/07/2020

Balancing reinforcement learning training experiences in interactive information retrieval

Limin Chen, Zhiwen Tang, Grace Hui Yang

Keywords Paper

deep reinforcement learning, interactive IR, dynamic search

0

0

0

0

10:22

03/05/2021

Pre-training Text-to-Text Transformers for Concept-centric Common Sense

Wangchunshu Zhou, Dong-Ho Lee, Ravi Kiran Selvam and
Seyeon Lee, Xiang Ren

Keywords Paper

Self-supervised Learning, Commonsense Reasoning, Language Model Pre-training

0

0

0

0

4:56

03/05/2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng and
Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Keywords Paper

adversarial training, QA, NLI, BERT, information theory, adversarial robustness

0

0

0

0

5:21

03/05/2021

Grounding Language to Autonomously-Acquired Skills via Goal Generation

Ahmed Akakzia, Cédric Colas, Pierre-Yves Oudeyer and
Mohamed CHETOUANI, Olivier Sigaud

Keywords Paper

intrinsic motivations, Deep reinforcement learning, autonomous learning, symbolic representations

0

0

0

0

5:01

08/12/2020

Sequence-to-Sequence Networks Learn the Meaning of Reflexive Anaphora

Robert Frank, Jackson Petty

Keywords Paper

0

0

0

0

15:28

03/05/2021

Monte-Carlo Planning and Learning with Language Action Value Estimates

Youngsoo Jang, Seokin Seo, Jongmin Lee, Kee-Eung Kim

Keywords Paper

reinforcement learning, interactive fiction, Monte-Carlo tree search, natural language processing

0

0

0

0

4:57

06/12/2021

Grounding Spatio-Temporal Language with Transformers

Tristan Karch, Laetitia Teodorescu, Katja Hofmann and
Clément Moulin-Frier, Pierre-Yves Oudeyer

Keywords Paper

transformers

0

0

0

0

7:25

06/12/2021

Learning Knowledge Graph-based World Models of Textual Environments

Prithviraj Ammanabrolu, Mark Riedl

Keywords Paper

reinforcement learning and planning, transformers, graph learning, language

0

0

0

0

15:32

04/07/2020

Distilling Knowledge Learned in BERT for Text Generation

Yen-Chun Chen, Zhe Gan, Yu Cheng and
Jingzhou Liu, Jingjing Liu

Keywords Paper

Text Generation, language tasks, language generation, generation tasks

0

0

0

0

10:41

02/02/2021

Towards Semantics-Enhanced Pre-Training: Can Lexicon Definitions Help Learning Sentence Meanings?

Xuancheng Ren, Xu Sun, Houfeng Wang, Qun Liu

Keywords Paper

0

0

0

0

16:04

05/12/2020

Grounded PCFG induction with images

Lifeng Jin, William Schuler

Keywords Paper

0

0

0

0

15:07

08/12/2020

Bayesian Methods for Semi-supervised Text Annotation

Kristian Miok, Gregor Pirs, Marko Robnik-Sikonja

Keywords Paper

0

0

0

0

11:18

04/07/2020

Gated Convolutional Bidirectional Attention-based Model for Off-topic Spoken Response Detection

Yefei Zha, Ruobing Li, Hui Lin

Keywords Paper

Off-topic Detection, automated system, real-world applications, detecting responses

0

0

0

0

10:12

16/11/2020

Feature Adaptation of Pre-Trained Language Models across Languages and Domains with Robust Self-Training

Hai Ye, Qingyu Tan, Ruidan He and
Juntao Li, Hwee Tou Ng, Lidong Bing

Keywords Paper

unsupervised adaptation, self-training, pre-trained models, bert

0

0

0

0

10:33

16/11/2020

Supervised Seeded Iterated Learning for Interactive Language Learning

Yuchen Lu, Soumye Singhal, Florian Strub and
Olivier Pietquin, Aaron Courville

Keywords Paper

language drift, language-drift game, language models, word-based agents

0

0

0

0

6:56

04/07/2020

Semantic Parsing for English as a Second Language

Yuanyuan Zhao, Weiwei Sun, Junjie Cao, Xiaojun Wan

Keywords Paper

semantic parsing, second acquisition, Semantic Parsing, ESL

0

0

0

0

11:04

02/02/2021

Curriculum-Meta Learning for Order-Robust Continual Relation Extraction

Tongtong Wu, Xuekai Li, Yuan-Fang Li and
Gholamreza Haffari, Guilin Qi, Yujin Zhu, Guoqiang Xu

Keywords Paper

0

0

0

0

11:33

03/05/2021

Grounded Language Learning Fast and Slow

Felix Hill, Olivier Tieleman, Tamara von Glehn and
Nathaniel Wong, Hamza Merzic, Stephen Clark

Keywords Paper

memory, meta-learning, word-learning, grounding, fast-mapping, language, cognition

0

0

0

0

11:44

02/02/2021

Learning an Effective Context-Response Matching Model with Self-Supervised Tasks for Retrieval-based Dialogues

Ruijian Xu, Chongyang Tao, Daxin Jiang and
Xueliang Zhao, Dongyan Zhao, Rui Yan

Keywords Paper

0

0

0

1

16:40

04/07/2020

Temporally-Informed Analysis of Named Entity Recognition

Shruti Rijhwani, Daniel Preotiuc-Pietro

Keywords Paper

named recognition, NLP tasks, Natural models, language use

0

0

0

0

11:30

06/12/2021

Grad2Task: Improved Few-shot Text Classification Using Gradients for Task Representation

Jixuan Wang, Kuan-Chieh Wang, Frank Rudzicz, Michael Brudno

Keywords Paper

machine learning, transformers, meta learning, language, transfer learning

0

0

0

0

14:45

02/02/2021

Learning Contextual Representations for Semantic Parsing with Generation-Augmented Pre-Training

Peng Shi, Patrick Ng, Zhiguo Wang and
Henghui Zhu, Alexander Hanbo Li, Jun Wang, Cicero Nogueira dos Santos, Bing Xiang

Keywords Paper

0

0

0

0

15:15

25/07/2020

Improving contextual language models for response retrieval in multi-turn conversation

Junyu Lu, Xiancong Ren, Yazhou Ren and
Ao Liu, Zenglin Xu

Keywords Paper

pre-trained language model, augmentation, response retrieval

0

0

0

0

8:08

04/07/2020

Enhancing Cross-target Stance Detection with Transferable Semantic-Emotion Knowledge

Bowen Zhang, Min Yang, Xutao Li and
Yunming Ye, Xiaofei Xu, Kuai Dai

Keywords Paper

Cross-target Detection, Stance detection, knowledge transfer, stance classifier

0

0

0

0

11:57

16/11/2020

Learning to Represent Image and Text with Denotation Graph

Bowen Zhang, Hexiang Hu, Vihan Jain and
Eugene Ie, Fei Sha

Keywords Paper

cross-modal retrieval, referring expression, compositional recognition, pre-training

0

0

0

0

10:59

14/06/2020

Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks

Fengda Zhu, Yi Zhu, Xiaojun Chang, Xiaodan Liang

Keywords Paper

computer vision, vision language navigation, reinforcement learning

0

0

0

0

4:25

02/02/2021

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines

Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi and
Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

Keywords Paper

0

0

0

0

17:44

04/07/2020

Exploiting the Syntax-Model Consistency for Neural Relation Extraction

Amir Pouran Ben Veyseh, Franck Dernoncourt, Dejing Dou, Thien Huu Nguyen

Keywords Paper

Neural Extraction, Relation Extraction, RE, syntactic injection

0

0

0

0

11:03

16/11/2020

Vokenization: Improving Language Understanding with Contextualized, Visual-Grounded Supervision

Hao Tan, Mohit Bansal

Keywords Paper

speaking, writing, text-only self-supervision, pure-language tasks

0

0

0

0

11:59

19/04/2021

TrNews: Heterogeneous user-interest transfer learning for news recommendation

Guangneng Hu, Qiang Yang

Keywords Paper

0

0

0

0

11:51

04/07/2020

Shaping Visual Representations with Language for Few-Shot Classification

Jesse Mu, Percy Liang, Noah Goodman

Keywords Paper

Few-Shot Classification, human learning, supervision, machine models

0

0

0

0

6:59

03/05/2021

Rethinking Positional Encoding in Language Pre-training

Guolin Ke, Di He, Tie-Yan Liu

Keywords Paper

Natural Language Processing, Pre-training

0

0

0

0

4:49

04/07/2020

BabyWalk: Going Farther in Vision-and-Language Navigation by Taking Baby Steps

Wang Zhu, Hexiang Hu, Jiacheng Chen and
Zhiwei Deng, Vihan Jain, Eugene Ie, Fei Sha

Keywords Paper

Vision-and-Language Navigation, vision-and-language VLN, VLN, navigation tasks

0

0

0

0

11:25

14/06/2020

Vision-Dialog Navigation by Exploring Cross-Modal Memory

Yi Zhu, Fengda Zhu, Zhaohuan Zhan and
Bingqian Lin, Jianbin Jiao, Xiaojun Chang, Xiaodan Liang

Keywords Paper

vision-dialog navigation, cross-modal reasoning, memory network.

0

0

0

0

1:04

14/06/2020

Cross-Domain Face Presentation Attack Detection via Multi-Domain Disentangled Representation Learning

Guoqing Wang, Hu Han, Shiguang Shan, Xilin Chen

Keywords Paper

face presentation attack detection, face anti-spoofing, cross-domain, disentangled representation learning, multi-domain learning.

0

0

0

0

1:01

02/02/2021

Do Response Selection Models Really Know What’s Next? Utterance Manipulation Strategies for Multi-turn Response Selection

Taesun Whang, Dongyub Lee, Dongsuk Oh and
Chanhee Lee, Kijong Han, Dong-hun Lee, Saebyeok Lee

Keywords Paper

0

0

0

0

17:37

04/07/2020

Cross-Modality Relevance for Reasoning on Language and Vision

Chen Zheng, Quan Guo, Parisa Kordjamshidi

Keywords Paper

Cross-Modality Relevance, Language Vision, visual answering, VQA

0

0

0

0

10:59