Latent World Models For Intrinsically Motivated Exploration

06/12/2020

Latent World Models For Intrinsically Motivated Exploration

Aleksandr Ermolov, Nicu Sebe

Keywords:

Abstract Paper Similar Papers

Abstract: In this work we consider partially observable environments with sparse rewards. We present a self-supervised representation learning method for image-based observations, which arranges embeddings respecting temporal distance of observations. This representation is empirically robust to stochasticity and suitable for novelty detection from the error of a predictive forward model. We consider episodic and life-long uncertainties to guide the exploration. We propose to estimate the missing information about the environment with the world model, which operates in the learned latent space. As a motivation of the method, we analyse the exploration problem in a tabular Partially Observable Labyrinth. We demonstrate the method on image-based hard exploration environments from the Atari benchmark and report significant improvement with respect to prior work. The source code of the method and all the experiments is available at https://github.com/htdt/lwm.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2021

Dynamic Bottleneck for Robust Self-Supervised Exploration

Chenjia Bai, Lingxiao Wang, Lei Han and
Animesh Garg, Jianye Hao, Peng Liu, Zhaoran Wang

Keywords Paper

reinforcement learning and planning

0

0

0

0

6:17

26/04/2020

Discriminative Particle Filter Reinforcement Learning for Complex Partial observations

Xiao Ma, Peter Karkus, David Hsu and
Wee Sun Lee, Nan Ye

Keywords Paper

Reinforcement Learning, Partial Observability, Differentiable Particle Filtering

0

0

0

0

5:08

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

14/06/2020

Background Data Resampling for Outlier-Aware Classification

Yi Li, Nuno Vasconcelos

Keywords Paper

out-of-distribution detection, anomaly detection, dataset resampling

0

0

0

0

1:00

26/04/2020

Disagreement-Regularized Imitation Learning

Kiante Brantley, Wen Sun, Mikael Henaff

Keywords Paper

imitation learning, reinforcement learning, uncertainty

0

0

0

0

4:53

06/12/2020

Novelty Search in Representational Space for Sample Efficient Exploration

David Tao, Vincent Francois-Lavet, Joelle Pineau

Keywords Paper

0

0

0

0

3:04

06/12/2021

Rebooting ACGAN: Auxiliary Classifier GANs with Stable Training

Minguk Kang, Woohyeon Shim, Minsu Cho, Jaesik Park

Keywords Paper

generative model

0

0

0

0

9:03

03/05/2021

Unsupervised Object Keypoint Learning using Local Spatial Predictability

Anand Gopalakrishnan, Sjoerd van Steenkiste, Jürgen Schmidhuber

Keywords Paper

unsupervised representation learning, visual saliency, object-keypoint representations

0

0

0

0

9:45

03/05/2021

Learning and Evaluating Representations for Deep One-Class Classification

Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon and
Minho Jin, Tomas Pfister

Keywords Paper

self-supervised learning, deep one-class classification

0

0

0

1

5:13

02/02/2021

Augmenting Policy Learning with Routines Discovered from a Single Demonstration

Zelin Zhao, Chuang Gan, Jiajun Wu and
Xiaoxiao Guo, Joshua B. Tenenbaum

Keywords Paper

0

0

0

0

14:48

03/05/2021

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha, Wenye Ma, Lei Yuan and
Xia Hu, Ji Liu

Keywords Paper

Exploration, Reinforcement Learning, Self-Imitation, Generalization of Reinforcement Learning

0

0

0

0

5:10

14/06/2020

Adaptive Subspaces for Few-Shot Learning

Christian Simon, Piotr Koniusz, Richard Nock, Mehrtash Harandi

Keywords Paper

subspace, few, shot, meta, learning, classification

0

0

0

0

1:01

06/12/2020

Restoring Negative Information in Few-Shot Object Detection

Yukuan Yang, Fangyun Wei, Miaojing Shi, Guoqi Li

Keywords Paper

0

0

0

0

3:24

06/12/2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning

CAGLAR Gulcehre, Ziyu Wang, Alexander Novikov and
Thomas Paine, Sergio Gómez, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matthew Hoffman, Nicolas Heess, Nando de Freitas

Keywords Paper

0

0

0

0

3:25

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

06/12/2020

Grasp Proposal Networks: An End-to-End Solution for Visual Learning of Robotic Grasps

Chaozheng Wu, Jian Chen, Qiaoyu Cao and
Jianchi Zhang, Yunxin Tai, Lin Sun, Kui Jia

Keywords Paper

0

0

0

0

3:19

18/07/2021

Prior Image-Constrained Reconstruction using Style-Based Generative Models

Varun A. Kelkar, Mark Anastasio

Keywords Paper

Algorithms, Kernel Methods, Theory, Frequentist Statistics, Algorithms, Sparsity and Compressed Sensing

0

0

0

0

5:56

18/07/2021

Emphatic Algorithms for Deep Reinforcement Learning

Ray Jiang, Tom Zahavy, Zhongwen Xu and
Adam White, Matteo Hessel, Charles Blundell, Hado van Hasselt

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

5:21

06/12/2021

On Calibration and Out-of-Domain Generalization

Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

Keywords Paper

machine learning, domain adaptation, causality

0

0

0

0

11:00

14/06/2020

Density-Based Clustering for 3D Object Detection in Point Clouds

Syeda Mariam Ahmed, Chee Meng Chew

Keywords Paper

3d object detection, edge-aware pointnet, instance segmentation, unsupervised clustering, cascaded modules, semantic segmentation, amodal bounding box detection

0

0

0

0

0:51

14/06/2020

Multi-Path Learning for Object Pose Estimation Across Domains

Martin Sundermeyer, Maximilian Durner, En Yen Puang and
Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel

Keywords Paper

object pose estimation, encodings, multi object, synthetic data, symmetries, autoencoder, embedding, 6d object detection, t-less, relative pose estimation

0

0

0

0

1:01

14/06/2020

Multi-Scale Interactive Network for Salient Object Detection

Youwei Pang, Xiaoqi Zhao, Lihe Zhang, Huchuan Lu

Keywords Paper

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

0

0

0

0

1:01

06/12/2020

Unsupervised Learning of Object Landmarks via Self-Training Correspondence

Dimitrios Mallis, Enrique Sanchez, Matthew Bell, Georgios Tzimiropoulos

Keywords Paper

Applications -> Privacy, Anonymity, and Security, Algorithms -> Adversarial Learning

0

0

0

0

3:16

02/02/2021

GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal Babbling

Rohan Chitnis, Tom Silver, Joshua B. Tenenbaum and
Leslie Pack Kaelbling, Tomás Lozano-Pérez

Keywords Paper

0

0

0

0

19:55

06/12/2020

Stationary Activations for Uncertainty Calibration in Deep Learning

Lassi Meronen, Christabella Irwanto, Arno Solin

Keywords Paper

0

0

0

1

3:15

07/09/2020

POMP: Pomcp-based Online Motion Planning for active visual search in indoor environments

Yiming Wang, Francesco Giuliari, Riccardo Berra and
Alberto Castellini, Alessio Del Bue, Alessandro Farinelli, Marco Cristani, Francesco Setti

Keywords Paper

Object Recognition Active Visual Search Partially Observable Markov Decision Process Monte Carlo Tree Search

0

0

0

0

5:10

06/12/2020

Munchausen Reinforcement Learning

Nino Vieillard, Olivier Pietquin, Matthieu Geist

Keywords Paper

0

0

0

0

3:19

18/07/2021

Self-Paced Context Evaluation for Contextual Reinforcement Learning

Theresa Eimer, André Biedenkapp, Frank Hutter, Marius Lindauer

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:25

19/08/2021

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

0

0

0

0

10:39

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

18/07/2021

Understanding Invariance via Feedforward Inversion of Discriminatively Trained Classifiers

Piotr Teterwak, Chiyuan Zhang, Dilip Krishnan, Mike Mozer

Keywords Paper

Deep Learning

0

0

0

0

4:52

06/12/2020

Fine-Grained Dynamic Head for Object Detection

Lin Song, Yanwei Li, Zhengkai Jiang and
Zeming Li, Hongbin Sun, Jian Sun, Nanning Zheng

Keywords Paper

Applications -> Computational Biology and Bioinformatics; Applications -> Health; Applications -> Time Series Analysis; Neurosc, Neuroscience and Cognitive Science -> Brain Imaging

0

0

0

0

3:19

14/06/2020

Learning From Noisy Anchors for One-Stage Object Detection

Hengduo Li, Zuxuan Wu, Chen Zhu and
Caiming Xiong, Richard Socher, Larry S. Davis

Keywords Paper

object detection, noisy label, ground-truth assignment, anchor, one-stage detector, training, deep learning

0

0

0

0

1:01

26/04/2020

Never Give Up: Learning Directed Exploration Strategies

Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi and
Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martin Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell

Keywords Paper

deep reinforcement learning, exploration, intrinsic motivation

0

0

0

0

5:30

03/05/2021

Evolving Reinforcement Learning Algorithms

John Co-Reyes, Yingjie Miao, Daiyi Peng and
Esteban Real, Quoc V Le, Sergey Levine, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning, genetic programming, meta-learning, evolutionary algorithms

0

0

0

0

13:59

06/12/2021

Towards Context-Agnostic Learning Using Synthetic Data

Charles Jin, Martin Rinard

Keywords Paper

machine learning, vision

0

0

0

0

14:20

05/01/2021

Proposal Learning for Semi-Supervised Object Detection

Peng Tang, Chetan Ramaiah, Yan Wang and
Ran Xu, Caiming Xiong

Keywords Paper

0

0

0

0

4:51

06/12/2020

Non-Crossing Quantile Regression for Distributional Reinforcement Learning

Fan Zhou, Jianing Wang, Xingdong Feng

Keywords Paper

0

0

0

0

3:11