Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

19/08/2021

Look Wide and Interpret Twice: Improving Performance on Interactive Instruction-following Tasks

Van-Quang Nguyen, Masanori Suganuma, Takayuki Okatani

Keywords: Computer Vision, Language and Vision

Abstract Paper Similar Papers

Abstract: There is a growing interest in the community in making an embodied AI agent perform a complicated task while interacting with an environment following natural language directives. Recent studies have tackled the problem using ALFRED, a well-designed dataset for the task, but achieved only very low accuracy. This paper proposes a new method, which outperforms the previous methods by a large margin. It is based on a combination of several new ideas. One is a two-stage interpretation of the provided instructions. The method first selects and interprets an instruction without using visual information, yielding a tentative action sequence prediction. It then integrates the prediction with the visual information etc., yielding the final prediction of an action and an object. As the object's class to interact is identified in the first stage, it can accurately select the correct object from the input image. Moreover, our method considers multiple egocentric views of the environment and extracts essential information by applying hierarchical attention conditioned on the current instruction. This contributes to the accurate prediction of actions for navigation. A preliminary version of the method won the ALFRED Challenge 2020. The current version achieves the unseen environment's success rate of 4.45% with a single view, which is further improved to 8.37% with multiple views.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

The role of Disentanglement in Generalisation

Milton Montero, Casimir JH Ludwig, Rui Ponte Costa and
Gaurav Malhotra, Jeffrey Bowers

Keywords Paper

generalisation, compositional generalization, generative models, compositionality, variational autoencoders, disentanglement

0

0

0

0

4:16

12/07/2020

Generalization to New Actions in Reinforcement Learning

Ayush Jain, Andrew Szot, Joseph Lim

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:01

14/09/2020

A Generic and Model-Agnostic Exemplar Synthetization Framework for Explainable AI

Antonio Barbalau, Adrian Cosma, Radu Tudor Ionescu, Marius Popescu

Keywords Paper

explainable ai, black-box, generative modelling, evolutionary algorithm, prototype synthetization, exemplar generation

0

0

0

0

10:08

19/10/2020

ALEX: Active learning based enhancement of a classification model’s EXplainability

Ishani Mondal, Debasis Ganguly

Keywords Paper

image classification, model interpretability, active learning

0

0

0

0

5:00

06/12/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

0

0

0

0

3:23

06/12/2021

Searching Parameterized AP Loss for Object Detection

Tao Chenxin, Zizhang Li, Xizhou Zhu and
Gao Huang, Yong Liu, jifeng dai

Keywords Paper

machine learning, vision

0

0

0

0

6:13

06/12/2020

Unsupervised Learning of Object Landmarks via Self-Training Correspondence

Dimitrios Mallis, Enrique Sanchez, Matthew Bell, Georgios Tzimiropoulos

Keywords Paper

Applications -> Privacy, Anonymity, and Security, Algorithms -> Adversarial Learning

0

0

0

0

3:16

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

02/02/2021

Inverse Reinforcement Learning with Natural Language Goals

Li Zhou, Kevin Small

Keywords Paper

0

0

0

0

19:59

16/11/2020

Robust Policies via Mid-Level Visual Representations: An Experimental Study in Manipulation and Navigation

Bryan Chen, Alexander Sax, Francis Lewis and
Iro Armeni, Silvio Savarese, Amir Zamir, Jitendra Malik, Lerrel Pinto

Keywords Paper

0

0

0

0

5:06

16/11/2020

Few-Shot Complex Knowledge Base Question Answering via Meta Reinforcement Learning

Yuncheng Hua, Yuan-Fang Li, Gholamreza Haffari and
Guilin Qi, Tongtong Wu

Keywords Paper

program induction, meta-training, cqa, neural approach

0

0

0

0

12:41

22/09/2020

Making neural networks interpretable with attribution: Application to implicit signals prediction

Darius Afchar, Romain Hennequin

Keywords Paper

Implicit Recommender System, Interpretable machine learning

0

0

0

0

2:28

03/05/2021

Task-Agnostic Morphology Evolution

Donald Hejna III, Pieter Abbeel, Lerrel Pinto

Keywords Paper

evolution, morphology, empowerment, unsupervised, information theory

0

0

0

0

3:59

02/02/2021

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition

Siteng Huang, Min Zhang, Yachen Kang, Donglin Wang

Keywords Paper

0

0

0

0

17:04

05/01/2021

Deep Template-Based Object Instance Detection

Jean-Philippe Mercier, Mathieu Garon, Philippe Giguere, Jean-Francois Lalonde

Keywords Paper

0

0

0

0

5:01

05/01/2021

Few-Shot Learning via Feature Hallucination With Variational Inference

Qinxuan Luo, Lingfeng Wang, Jingguo Lv and
Shiming Xiang, Chunhong Pan

Keywords Paper

0

0

0

0

4:56

06/12/2021

SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection

Muhammad Akhtar Munir, Muhammad Haris Khan, M. Sarfraz, Mohsen Ali

Keywords Paper

deep learning, vision, domain adaptation

0

0

0

0

7:50

06/12/2021

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Lin Guan, Mudit Verma, Suna (Sihang) Guo and
Ruohan Zhang, Subbarao Kambhampati

Keywords Paper

reinforcement learning and planning, machine learning

0

0

0

0

13:41

18/07/2021

Data-efficient Hindsight Off-policy Option Learning

Markus Wulfmeier, Dushyant Rao, Roland Hafner and
Thomas Lampe, Abbas Abdolmaleki, Tim Hertweck, Michael Neunert, Dhruva Tirumala Bukkapatnam, Noah Siegel, Nicolas Heess, Martin Riedmiller

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:09

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

29/06/2020

Embedding java classes with Code2vec: Improvements from variable obfuscation

Rhys Compton, Eibe Frank, Panos Patros, Abigail Koay

Keywords Paper

code2vec, machine learning, code obfuscation, source code, neural networks

0

0

0

0

14:20

03/05/2021

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

Ossama Ahmed, Frederik Träuble, Anirudh Goyal and
Alexander Neitz, Manuel Wuthrich, Yoshua Bengio, Bernhard Schoelkopf, Stefan Bauer

Keywords Paper

reinforcement learning, transfer learning, robotics, domain adaptation, generalization, causality, sim2real transfer

0

0

0

0

5:03

18/07/2021

Backdoor Scanning for Deep Neural Networks through K-Arm Optimization

Guangyu Shen, Yingqi Liu, Guanhong Tao and
Shengwei An, Qiuling Xu, Siyuan Cheng, Shiqing Ma, Xiangyu Zhang

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:12

03/05/2021

Grounded Language Learning Fast and Slow

Felix Hill, Olivier Tieleman, Tamara von Glehn and
Nathaniel Wong, Hamza Merzic, Stephen Clark

Keywords Paper

memory, meta-learning, word-learning, grounding, fast-mapping, language, cognition

0

0

0

0

11:44

07/09/2020

Unsupervised Domain Adaptation for Spatio-Temporal Action Localization

Nakul Agarwal, Yi-Ting Chen, Behzad Dariush, Ming-Hsuan Yang

Keywords Paper

Spatio-Temporal Action Localization, Unsupervised Domain Adaptation, Adversarial Learning, Video Analysis, Deep Learning

0

0

0

0

9:28

12/07/2020

Learning and Evaluating Contextual Embedding of Source Code

Aditya Kanade, Petros Maniatis, Gogul Balakrishnan, Kensen Shi

Keywords Paper

Representation Learning

0

0

0

0

12:51

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

19/08/2021

Meta-Reinforcement Learning by Tracking Task Non-stationarity

Riccardo Poiani, Andrea Tirinzoni, Marcello Restelli

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning, Transfer, Adaptation, Multi-task Learning

0

0

0

0

13:23

16/11/2020

Chaining Behaviors from Data with Model-Free Reinforcement Learning

Avi Singh, Albert Yu, Jonathan Yang and
Jesse Zhang, Aviral Kumar, Sergey Levine

Keywords Paper

0

0

0

0

5:01

06/12/2020

Restoring Negative Information in Few-Shot Object Detection

Yukuan Yang, Fangyun Wei, Miaojing Shi, Guoqi Li

Keywords Paper

0

0

0

0

3:24

12/07/2020

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills

Victor Campos, Alexander Trott, Caiming Xiong and
Richard Socher, Xavier Giro-i-Nieto, Jordi Torres

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:28

16/11/2020

Learning a natural-language to LTL executable semantic parser for grounded robotics

Christopher Wang, Candace Ross, Yen-Ling Kuo and
Boris Katz, Andrei Barbu

Keywords Paper

0

0

0

0

5:01

03/05/2021

Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning

Valerie Chen, Abhinav Gupta, Kenny Marino

Keywords Paper

0

0

0

0

5:04

04/07/2020

TAG : Type Auxiliary Guiding for Code Comment Generation

Ruichu Cai, Zhihao Liang, Boyan Xu and
zijian li, Yuexing Hao, Yao Chen

Keywords Paper

Code Generation, code task, adaptive code, TAG

0

0

0

0

11:22

03/05/2021

Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples

Ziang Yan, Yiwen Guo, Jian Liang, Changshui Zhang

Keywords Paper

hard-label attack, adversarial attack, black-box attack, reinforcement learning

0

0

0

0

4:55

06/12/2021

Automatic Data Augmentation for Generalization in Reinforcement Learning

Roberta Raileanu, Maxwell Goldstein, Denis Yarats and
Ilya Kostrikov, Rob Fergus

Keywords Paper

reinforcement learning and planning, machine learning

0

0

0

0

14:26

06/12/2021

Grounding Spatio-Temporal Language with Transformers

Tristan Karch, Laetitia Teodorescu, Katja Hofmann and
Clément Moulin-Frier, Pierre-Yves Oudeyer

Keywords Paper

transformers

0

0

0

0

7:25

03/05/2021

Generating Adversarial Computer Programs using Optimized Obfuscations

Shashank Srikant, Sijia Liu, Tamara Mitrovska and
Shiyu Chang, Quanfu Fan, Gaoyuan Zhang, Una-May O'Reilly

Keywords Paper

Models for code, Differentiable program generator, Combinatorial optimization, Program obfuscation, Adversarial computer programs, Machine Learning (ML) for Programming Languages (PL)/Software Engineering (SE)

0

0

0

0

6:27

07/09/2020

BriNet: Towards Bridging the Intra-class and Inter-class Gaps in One-Shot Segmentation

Xianghui Yang, Bairun Wang, Xinchi Zhou and
Kaige Chen, Shuai Yi, Wanli Ouyang, Luping Zhou

Keywords Paper

Few-shot Semantic Segmentation, Few-shot learning, Semantic Segmentation

0

0

0

0

8:26