Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning

18/07/2021

Grounding Language to Entities and Dynamics for Generalization in Reinforcement Learning

Austin W. Hanjie, Victor Zhong, Karthik Narasimhan

Keywords: Reinforcement Learning and Planning, Deep RL

Abstract Paper Similar Papers

Abstract: We investigate the use of natural language to drive the generalization of control policies and introduce the new multi-task environment Messenger with free-form text manuals describing the environment dynamics. Unlike previous work, Messenger does not assume prior knowledge connecting text and state observations — the control policy must simultaneously ground the game manual to entity symbols and dynamics in the environment. We develop a new model, EMMA (Entity Mapper with Multi-modal Attention) which uses an entity-conditioned attention module that allows for selective focus over relevant descriptions in the manual for each entity in the environment. EMMA is end-to-end differentiable and learns a latent grounding of entities and dynamics from text to observations using only environment rewards. EMMA achieves successful zero-shot generalization to unseen games with new dynamics, obtaining a 40% higher win rate compared to multiple baselines. However, win rate on the hardest stage of Messenger remains low (10%), demonstrating the need for additional work in this direction.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Towards Open Domain Event Trigger Identification using Adversarial Domain Adaptation

Aakanksha Naik, Carolyn Rose

Keywords Paper

Open Identification, trigger identification, Adversarial Adaptation, supervised models

0

0

0

0

7:00

16/11/2020

ALICE: Active Learning with Contrastive Natural Language Explanations

Weixin Liang, James Zou, Zhou Yu

Keywords Paper

active learning, data learning, learning, visual tasks

0

0

0

0

10:26

26/04/2020

Graph Constrained Reinforcement Learning for Natural Language Action Spaces

Prithviraj Ammanabrolu, Matthew Hausknecht

Keywords Paper

natural language generation, deep reinforcement learning, knowledge graphs, interactive fiction

0

0

0

0

5:13

22/11/2021

Unsupervised Domain Adaptation of Black-Box Source Models

Haojian Zhang, Yabin Zhang, Kui Jia, Lei Zhang

Keywords Paper

domain adaptation, black box, unsupervised, noisy label, iterative

0

0

0

0

2:57

16/11/2020

Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning

Xiaoxiao Guo, Mo Yu, Yupeng Gao and
Chuang Gan, Murray Campbell, Shiyu Chang

Keywords Paper

language techniques, language challenges, action generation, if solving

0

0

0

0

7:31

19/04/2021

Jointly improving language understanding and generation with quality-weighted weak supervision of automatic labeling

Ernie Chang, Vera Demberg, Alex Marin

Keywords Paper

0

0

0

0

7:28

19/08/2021

Correlation-Guided Representation for Multi-Label Text Classification

Qian-Wen Zhang, Ximing Zhang, Zhao Yan and
Ruifang Liu, Yunbo Cao, Min-Ling Zhang

Keywords Paper

Machine Learning, Multi-instance; Multi-label; Multi-view learning, Classification, Text Classification

0

0

0

0

11:13

08/12/2020

Detecting Urgency Status of Crisis Tweets: A Transfer Learning Approach for Low Resource Languages

Efsun Sarioglu Kayi, Linyong Nan, Bohan Qu and
Mona Diab, Kathleen McKeown

Keywords Paper

0

0

0

0

14:37

06/12/2021

Safe Reinforcement Learning with Natural Language Constraints

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and
Peter J Ramadge, Karthik Narasimhan

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:32

16/11/2020

Bootstrapped Q-learning with Context Relevant Observation Pruning to Generalize in Text-based Games

Subhajit Chaudhury, Daiki Kimura, Kartik Talamadupula and
Michiaki Tatsubori, Asim Munawar, Ryuki Tachibana

Keywords Paper

irrelevant removal, generalization, observation pruning, reinforcement methods

0

0

0

0

7:02

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

02/02/2021

Token-Aware Virtual Adversarial Training in Natural Language Understanding

Linyang Li, Xipeng Qiu

Keywords Paper

0

0

0

0

12:49

04/07/2020

Adversarial and Domain-Aware BERT for Cross-Domain Sentiment Analysis

Chunning Du, Haifeng Sun, Jingyu Wang and
Qi Qi, Jianxin Liao

Keywords Paper

Cross-Domain Analysis, Cross-domain classification, unsupervised adaptation, transferring knowledge

0

0

0

0

10:54

06/12/2021

Curriculum Offline Imitating Learning

Minghuan Liu, Hanye Zhao, Zhengyu Yang and
Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:28

16/11/2020

An Empirical Study on Large-Scale Multi-Label Text Classification Including Few and Zero-Shot Labels

Ilias Chalkidis, Manos Fergadiotis, Sotiris Kotitsas and
Prodromos Malakasiotis, Nikolaos Aletras, Ion Androutsopoulos

Keywords Paper

flat classification, hierarchical approaches, zero-shot learning, few learning

0

0

0

0

12:21

16/11/2020

Simple Data Augmentation with the Mask Token Improves Domain Adaptation for Dialog Act Tagging

Semih Yavuz, Kazuma Hashimoto, Wenhao Liu and
Nitish Shirish Keskar, Richard Socher, Caiming Xiong

Keywords Paper

da tagging, da, da taggers, maskaugment

0

0

0

0

6:55

16/11/2020

Discriminative Nearest Neighbor Few-Shot Intent Detection by Transferring Natural Language Inference

Jianguo Zhang, Kazuma Hashimoto, Wenhao Liu and
Chien-Sheng Wu, Yao Wan, Philip Yu, Richard Socher, Caiming Xiong

Keywords Paper

intent detection, detecting intents, oos detection, large-scale task

0

0

0

0

11:43

02/02/2021

A Free Lunch for Unsupervised Domain Adaptive Object Detection without Source Data

Xianfeng Li, Weijie Chen, Di Xie and
Shicai Yang, Peng Yuan, Shiliang Pu, Yueting Zhuang

Keywords Paper

0

0

0

0

19:06

16/11/2020

Effective Unsupervised Domain Adaptation with Adversarially Trained Language Models

Thuy-Trang Vu, Dinh Phung, Gholamreza Haffari

Keywords Paper

combinatorial problem, unsupervised tasks, named recognition, broad-coverage models

0

0

0

0

11:57

03/05/2021

Behavioral Cloning from Noisy Demonstrations

Fumihiro Sasaki, Ryota Yamashina

Keywords Paper

Inverse Reinforcement Learning, Noisy Demonstrations, Imitation Learning

0

0

0

0

9:36

16/11/2020

Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Shunyu Yao, Rohan Rao, Matthew Hausknecht, Karthik Narasimhan

Keywords Paper

autonomous agents, contextual model, calm, language models

0

0

0

0

12:09

19/10/2020

Falcon 2.0: An entity and relation linking tool over wikidata

Ahmad Sakor, Kuldeep Singh, Anery Patel, Maria-Esther Vidal

Keywords Paper

background knowledge, entity linking, nlp, dbpedia, english morphology, relation linking, wikidata

0

0

0

0

10:00

04/07/2020

Programming in Natural Language with fuSE: Synthesizing Methods from Spoken Utterances Using Deep Natural Language Understanding

Sebastian Weigelt, Vanessa Steurer, Tobias Hey, Walter F. Tichy

Keywords Paper

intelligent systems, information retrieval, Deep Understanding, end-user programming

0

0

0

0

11:41

16/11/2020

Joint Constrained Learning for Event-Event Relation Extraction

Haoyu Wang, Muhao Chen, Hongming Zhang, Dan Roth

Keywords Paper

understanding language, temporal extraction, event construction, inducing complexes

0

0

0

0

12:04

06/12/2021

Fast Pure Exploration via Frank-Wolfe

Po-An Wang, Ruo-Chun Tzeng, Alexandre Proutiere

Keywords Paper

theory, optimization, reinforcement learning and planning, bandits

0

0

0

0

9:36

06/12/2021

MST: Masked Self-Supervised Transformer for Visual Representation

Zhaowen Li, Zhiyang Chen, Fan Yang and
Wei Li, Yousong Zhu, Chaoyang Zhao, Rui Deng, Liwei Wu, Rui Zhao, Ming Tang, Jinqiao Wang

Keywords Paper

self-supervised learning, transformers, vision, language

0

0

0

0

7:13

08/12/2020

Specializing Unsupervised Pretraining Models for Word-Level Semantic Similarity

Anne Lauscher, Ivan Vulić, Edoardo Maria Ponti and
Anna Korhonen, Goran Glavaš

Keywords Paper

0

0

0

0

13:01

16/11/2020

Unsupervised Natural Language Inference via Decoupled Multimodal Contrastive Learning

Wanyun Cui, Guangyu Zheng, Wei Wang

Keywords Paper

natural problem, plain inference, task-agnostic pretraining, multimodal learning

0

0

0

0

11:25

03/05/2021

FOCAL: Efficient Fully-Offline Meta-Reinforcement Learning via Distance Metric Learning and Behavior Regularization

Lanqing Li, Rui Yang, Dijun Luo

Keywords Paper

distance metric learning, offline/batch reinforcement learning, meta-reinforcement learning, contrastive learning, multi-task reinforcement learning

1

0

0

0

6:21

19/04/2021

Zero-shot generalization in dialog state tracking through generative question answering

Shuyang Li, Jin Cao, Mukund Sridhar and
Henghui Zhu, Shang-Wen Li, Wael Hamza, Julian McAuley

Keywords Paper

0

0

0

1

11:13

19/08/2021

Enhance Image as You Like with Unpaired Learning

Xiaopeng Sun, Muxingzi Li, Tianyu He, Lubin Fan

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Applications of Unsupervised Learning

0

0

0

0

11:20

14/06/2020

Noise-Aware Fully Webly Supervised Object Detection

Yunhang Shen, Rongrong Ji, Zhiwei Chen and
Xiaopeng Hong, Feng Zheng, Jianzhuang Liu, Mingliang Xu, Qi Tian

Keywords Paper

webly supervised object detection, weakly supervised object detection, object detection

0

0

0

0

1:01

16/11/2020

Named Entity Recognition Only from Word Embeddings

Ying Luo, Hai Zhao, Junlang Zhan

Keywords Paper

named recognition, entity detection, type prediction, deep models

0

0

0

0

9:54

04/07/2020

SenseBERT: Driving Some Sense into BERT

Yoav Levine, Barak Lenz, Or Dagan and
Ori Ram, Dan Padnos, Or Sharir, Shai Shalev-Shwartz, Amnon Shashua, Yoav Shoham

Keywords Paper

natural understanding, lexical understanding, SemEval Disambiguation, task

0

0

0

0

10:53

14/06/2020

Phase Consistent Ecological Domain Adaptation

Yanchao Yang, Dong Lao, Ganesh Sundaramoorthi, Stefano Soatto

Keywords Paper

domain adaptation, unsupervised, semantic consistency, semantic segmentation, ecological statistics, scene compatibility, phase consistency, image transformation, synthetic to real, self-supervised

0

0

0

0

1:01

06/12/2021

Towards Context-Agnostic Learning Using Synthetic Data

Charles Jin, Martin Rinard

Keywords Paper

machine learning, vision

0

0

0

0

14:20

06/12/2021

Object-Aware Regularization for Addressing Causal Confusion in Imitation Learning

Jongjin Park, Younggyo Seo, Chang Liu and
Li Zhao, Tao Qin, Jinwoo Shin, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning, causality

0

0

0

0

12:12

06/12/2021

Mitigating Covariate Shift in Imitation Learning via Offline Data With Partial Coverage

Jonathan Chang, Masatoshi Uehara, Dhruv Sreenivas and
Rahul Kidambi, Wen Sun

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:37

06/12/2021

On Model Calibration for Long-Tailed Object Detection and Instance Segmentation

Tai-Yu Pan, Cheng Zhang, Yandong Li and
Hexiang Hu, Dong Xuan, Soravit Changpinyo, Boqing Gong, Wei-Lun Chao

Keywords Paper

machine learning, vision

0

0

0

0

11:49

06/12/2021

MobILE: Model-Based Imitation Learning From Observation Alone

Rahul Kidambi, Jonathan Chang, Wen Sun

Keywords Paper

theory, reinforcement learning and planning, bandits

0

0

0

0

14:38