Auxiliary Tasks Speed Up Learning PointGoal Navigation

16/11/2020

Auxiliary Tasks Speed Up Learning PointGoal Navigation

Joel Ye, Dhruv Batra Georgia Tech &amp, Facebook AI Research, Erik Wijmans, Abhishek Das

Keywords:

Abstract Paper Code Similar Papers

Abstract: PointGoal Navigation is an embodied task that requires agents to navigate to a specified point in an unseen environment. Wijmans et al. showed that this task is solvable in simulation but their method is computationally prohibitive – requiring 2.5 billion frames of experience and 180 GPU-days. We develop a method to significantly improve sample efficiency in learning PointNav using self-supervised auxiliary tasks (e.g. predicting the action taken between two egocentric observations, predicting the distance between two observations from a trajectory, etc.). We find that naively combining multiple auxiliary tasks improves sample efficiency, but only provides marginal gains beyond a point. To overcome this, we use attention to combine representations from individual auxiliary tasks. Our best agent is 5.5x faster to match the performance of the previous state-of-the-art, DD-PPO, at 40M frames, and improves on DD-PPO’s performance at 40M frames by 0.16 SPL. Our code is publicly available at github.com/joel99/habitat-pointnav-aux.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CoRL 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

26/04/2020

ReMixMatch: Semi-Supervised Learning with Distribution Matching and Augmentation Anchoring

David Berthelot, Nicholas Carlini, Ekin D. Cubuk and
Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel

Keywords Paper

semi-supervised learning

0

0

0

0

4:48

06/12/2021

SIMILAR: Submodular Information Measures Based Active Learning In Realistic Scenarios

Suraj Kothawade, Nathan Beck, Krishnateja Killamsetty, Rishabh Iyer

Keywords Paper

machine learning, vision, active learning

0

0

0

0

12:09

06/12/2021

RETRIEVE: Coreset Selection for Efficient and Robust Semi-Supervised Learning

Krishnateja Killamsetty, Xujiang Zhao, Feng Chen, Rishabh Iyer

Keywords Paper

optimization, semi-supervised learning

0

0

0

0

13:59

16/11/2020

Learning hierarchical relationships for object-goal navigation

Anwesan Pal, Yiding Qiu, Henrik Christensen

Keywords Paper

0

0

0

0

4:55

06/12/2021

Robust Predictable Control

Ben Eysenbach, Russ Salakhutdinov, Sergey Levine

Keywords Paper

reinforcement learning and planning, robustness

0

0

0

0

11:32

04/07/2020

The Right Tool for the Job: Matching Model and Instance Complexities

Roy Schwartz, Gabriel Stanovsky, Swabha Swayamdipta and
Jesse Dodge, Noah A. Smith

Keywords Paper

inference, early decisions, costly retraining, Job Model

0

0

0

0

11:27

06/12/2020

Memory Based Trajectory-conditioned Policies for Learning from Sparse Rewards

Yijie Guo, Jongwook Choi, Marcin Moczulski and
Shengyu Feng, Samy Bengio, Mohammad Norouzi, Honglak Lee

Keywords Paper

0

0

1

1

3:30

14/06/2020

Interactive Two-Stream Decoder for Accurate and Fast Saliency Detection

Huajun Zhou, Xiaohua Xie, Jian-Huang Lai and
Zixuan Chen, Lingxiao Yang

Keywords Paper

saliency object detection, contour, decoder, loss, real-time

0

0

0

0

1:01

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

14/06/2020

HyperSTAR: Task-Aware Hyperparameters for Deep Networks

Gaurav Mittal, Chang Liu, Nikolaos Karianakis and
Victor Fragoso, Mei Chen, Yun Fu

Keywords Paper

auto ml, hyperparameter optimization, meta learning, task aware, hyperband, hyperparameters, warm start, image classication, resnet, shufflenet

0

0

0

0

4:58

25/07/2020

HME: A hyperbolic metric embedding approach for next-POI recommendation

Shanshan Feng, Lucas Vinh Tran, Gao Cong and
Lisi Chen, Jing Li, Fan Li

Keywords Paper

metric embedding, hyperbolic space, next-poi recommendation

0

0

0

0

17:18

05/04/2021

Accounting for Variance in Machine Learning Benchmarks

Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi and
Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Nazanin Mohammadi Sepahvand, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Tal Arbel, Chris Pal, Gael Varoquaux, Pascal Vincent

Keywords Paper

0

0

0

0

5:06

05/04/2021

Accounting for Variance in Machine Learning Benchmarks

Xavier Bouthillier, Pierre Delaunay, Mirko Bronzi and
Assya Trofimov, Brennan Nichyporuk, Justin Szeto, Nazanin Mohammadi Sepahvand, Edward Raff, Kanika Madan, Vikram Voleti, Samira Ebrahimi Kahou, Vincent Michalski, Tal Arbel, Chris Pal, Gael Varoquaux, Pascal Vincent

Keywords Paper

0

0

0

0

19:40

06/12/2021

Boost Neural Networks by Checkpoints

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

deep learning

1

0

0

0

4:45

18/07/2021

ConViT: Improving Vision Transformers with Soft Convolutional Inductive Biases

Stéphane d'Ascoli, Hugo Touvron, Matthew Leavitt and
Ari Morcos, Giulio Biroli, Levent Sagun

Keywords Paper

Deep Learning, Architectures

0

0

0

0

5:16

26/04/2020

Selection via Proxy: Efficient Data Selection for Deep Learning

Cody Coleman, Christopher Yeh, Stephen Mussmann and
Baharan Mirzasoleiman, Peter Bailis, Percy Liang, Jure Leskovec, Matei Zaharia

Keywords Paper

data selection, active-learning, core-set selection, deep learning, uncertainty sampling

0

0

0

0

4:46

12/07/2020

Semi-Supervised StyleGAN for Disentanglement Learning

Weili Nie, Tero Karras, Animesh Garg and
Shoubhik Debnath, Anjul Patney, Ankit Patel, Anima Anandkumar

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

16:02

22/11/2021

Attention to Action: Leveraging Attention for Object Navigation

Shi Chen, Qi Zhao

Keywords Paper

Object-goal Navigation, Attention, Visual Navigation

0

0

0

0

2:51

05/01/2021

SALAD: Self-Assessment Learning for Action Detection

Guillaume Vaudaux-Ruth, Adrien Chan-Hon-Tong, Catherine Achard

Keywords Paper

0

0

0

0

4:31

05/01/2021

SynDistNet: Self-Supervised Monocular Fisheye Camera Distance Estimation Synergized With Semantic Segmentation for Autonomous Driving

Varun Ravi Kumar, Marvin Klingner, Senthil Yogamani and
Stefan Milz, Tim Fingscheidt, Patrick Mader

Keywords Paper

0

0

0

0

4:18

06/12/2021

Chasing Sparsity in Vision Transformers: An End-to-End Exploration

Tianlong Chen, Yu Cheng, Zhe Gan and
Lu Yuan, Lei Zhang, Zhangyang Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

11:29

26/04/2020

Computation Reallocation for Object Detection

Feng Liang, Chen Lin, Ronghao Guo and
Ming Sun, Wei Wu, Junjie Yan, Wanli Ouyang

Keywords Paper

Neural Architecture Search, Object Detection

0

0

0

0

5:29

26/04/2020

PC-DARTS: Partial Channel Connections for Memory-Efficient Architecture Search

Yuhui Xu, Lingxi Xie, Xiaopeng Zhang and
Xin Chen, Guo-Jun Qi, Qi Tian, Hongkai Xiong

Keywords Paper

Neural Architecture Search, DARTS, Regularization, Normalization

0

0

0

0

4:40

26/04/2020

Projection-Based Constrained Policy Optimization

Tsung-Yen Yang, Justinian Rosca, Karthik Narasimhan, Peter J. Ramadge

Keywords Paper

Reinforcement learning with constraints, Safe reinforcement learning

0

0

0

0

4:52

04/07/2020

Posterior Calibrated Training on Sentence Classification Tasks

Taehee Jung, Dongyeop Kang, Hua Cheng and
Lucas Mentch, Thomas Schaaf

Keywords Paper

Sentence Tasks, classifications, xSLUE, classification tasks

0

0

0

0

7:00

19/04/2021

Few shot dialogue state tracking using meta-learning

Saket Dingliwal, Shuyang Gao, Sanchit Agarwal and
Chien-Wei Lin, Tagyoung Chung, Dilek Hakkani-Tur

Keywords Paper

0

0

0

0

10:33

13/04/2021

Transforming gaussian processes with normalizing flows

Juan Maroñas, Oliver Hamelijnck, Jeremias Knoblauch, Theodoros Damoulas

Keywords Paper

0

0

0

0

3:05

26/04/2020

Training binary neural networks with real-to-binary convolutions

Brais Martinez, Jing Yang, Adrian Bulat, Georgios Tzimiropoulos

Keywords Paper

binary networks

0

0

0

0

4:41

18/07/2021

Bias-Variance Reduced Local SGD for Less Heterogeneous Federated Learning

Tomoya Murata, Taiji Suzuki

Keywords Paper

Optimization, Distributed and Parallel Optimization

0

0

0

0

5:20

14/06/2020

Fast-MVSNet: Sparse-to-Dense Multi-View Stereo With Learned Propagation and Gauss-Newton Refinement

Zehao Yu, Shenghua Gao

Keywords Paper

multi-view stereo, sparse-to-dense, gauss-newton optimization, propagation, coarse-to-fine

0

0

0

0

1:01

16/11/2020

Pronoun-Targeted Fine-tuning for NMT with Hybrid Losses

Prathyusha Jwalapuram, Shafiq Joty, Youlin Shen

Keywords Paper

pronoun translations, pronoun translation, neural training, backtranslation

0

0

0

0

11:37

03/05/2021

Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?

Balázs Kégl, Gabriel Hurtado, Albert Thomas

Keywords Paper

model-based reinforcement learning, heteroscedasticity, dynamic systems, mixture density nets, generative models

0

0

0

0

6:26

06/12/2020

Learning to Decode: Reinforcement Learning for Decoding of Sparse Graph-Based Channel Codes

Salman Habib, Allison Beemer, Joerg Kliewer

Keywords Paper

Algorithms -> Similarity and Distance Learning, Applications -> Network Analysis

0

0

0

0

3:19

03/05/2021

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian, Gabriel Loaiza-Ganem, Harry Braviner and
Anthony Caterini, Jesse C Cresswell, Tong Li, Animesh Garg

Keywords Paper

reinforcement learning, goal reaching, Q-learning

0

0

0

0

4:49

07/09/2020

Meta-RetinaNet for Few-shot Object Detection

Shaoqi Li, Wenfeng Song, Shuai Li and
Aimin Hao, Hong Qin

Keywords Paper

Few shot, object detection, meta-learning, Meta-RetinaNet, Balanced Loss, coefficient vector

0

0

0

0

8:51

02/02/2021

DIBS: Diversity Inducing Information Bottleneck in Model Ensembles

Samarth Sinha, Homanga Bharadhwaj, Anirudh Goyal and
Hugo Larochelle, Animesh Garg, Florian Shkurti

Keywords Paper

0

0

0

0

16:26

22/11/2021

Searching for TrioNet: Combining Convolution with Local and Global Self-Attention

Huaijin Pi, Huiyu Wang, Yingwei Li and
Zizhang Li, Alan Yuille

Keywords Paper

Self-Attention, Neural Architecture Search

0

0

0

0

2:56

06/12/2020

Evolving Graphical Planner: Contextual Global Planning for Vision-and-Language Navigation

Zhiwei Deng, Karthik Narasimhan, Olga Russakovsky

Keywords Paper

0

0

0

0

3:09

05/01/2021

Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation

Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Shi

Keywords Paper

0

0

0

0

4:58