Learning Object-conditioned Exploration using Distributed Soft Actor Critic

Abstract: Object navigation is defined as navigating to an object of a given label in a complex, unexplored environment. In its general form, this problem poses several challenges for Robotics: semantic exploration of unknown environments in search of an object and low-level control. In this work we study object-guided exploration and low-level control, and present an end-to-end trained navigation policy achieving a success rate of 0.68 and SPL of 0.58 on unseen, visually complex scans of real homes. We propose a highly scalable implementation of an off-policy Reinforcement Learning algorithm, distributed Soft Actor Critic, which allows the system to utilize 98M experience steps in 24 hours on 8 GPUs. Our system learns to control a differential drive mobile base in simulation from a stack of high dimensional observations commonly used on robotic platforms. The learned policy is capable of object-guided exploratory behaviors and low-level control learned from pure experiences in realistic environments.

03/05/2021

Andy Zeng, Pete Florence, Jonathan Tompson and
Stefan Welker, Jonathan Chien, Maria Attarian, Travis Armstrong, Ivan Krasin, Dan Duong, Vikas Sindhwani, Johnny Lee

Keywords Paper

5:01

16/11/2020

Sim-to-Real Transfer for Vision-and-Language Navigation

Peter Anderson, Ayush Shrivastava, Joanne Truong and
Arjun Majumdar, Devi Parikh Georgia Tech &, Facebook AI Research, Dhruv Batra Georgia Tech &, Facebook AI Research, Stefan Lee

Keywords Paper

6:45

07/09/2020

POMP: Pomcp-based Online Motion Planning for active visual search in indoor environments

Yiming Wang, Francesco Giuliari, Riccardo Berra and
Alberto Castellini, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Francesco Setti

gan, game, simulation, video generation, memory, disentangle, interactive

1:01

23/08/2020

Ossama Ahmed, Frederik Träuble, Anirudh Goyal and
Alexander Neitz, Manuel Wuthrich, Yoshua Bengio, Bernhard Schoelkopf, Stefan Bauer

robotics, grasping, 6d pose, grasp pose, manipulation, dataset, pick and place, bin picking

1:01

12/08/2020

Yevgen Chebotar, Karol Hausman, Yao Lu and
Ted Xiao, Dmitry Kalashnikov, Jacob Varley, Alex Irpan, Benjamin Eysenbach, Ryan C Julian, Chelsea Finn, Sergey Levine

Samyak Datta, Oleksandr Maksymets, Judy Hoffman and
Stefan Lee, Dhruv Batra Georgia Tech &, Facebook AI Research, Devi Parikh Georgia Tech &, Facebook AI Research