Episodic Reinforcement Learning with Associative Memory

26/04/2020

Episodic Reinforcement Learning with Associative Memory

Guangxiang Zhu, Zichuan Lin, Guangwen Yang, Chongjie Zhang

Keywords: Deep Reinforcement Learning, Episodic Control, Episodic Memory, Associative Memory, Non-Parametric Method, Sample Efficiency

Abstract Paper Similar Papers

Abstract: Sample efficiency has been one of the major challenges for deep reinforcement learning. Non-parametric episodic control has been proposed to speed up parametric reinforcement learning by rapidly latching on previously successful policies. However, previous work on episodic reinforcement learning neglects the relationship between states and only stored the experiences as unrelated items. To improve sample efficiency of reinforcement learning, we propose a novel framework, called Episodic Reinforcement Learning with Associative Memory (ERLAM), which associates related experience trajectories to enable reasoning effective strategies. We build a graph on top of states in memory based on state transitions and develop a reverse-trajectory propagation strategy to allow rapid value propagation through the graph. We use the non-parametric associative memory as early guidance for a parametric reinforcement learning model. Results on navigation domain and Atari games show our framework achieves significantly higher sample efficiency than state-of-the-art episodic reinforcement learning models.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Efficient Learning of Discrete-Continuous Computation Graphs

David Friede, Mathias Niepert

Keywords Paper

deep learning, reinforcement learning and planning, graph learning

0

0

0

0

12:31

03/05/2021

Return-Based Contrastive Representation Learning for Reinforcement Learning

Guoqing Liu, Chuheng Zhang, Li Zhao and
Tao Qin, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu

Keywords Paper

reinforcement learning, auxiliary task, contrastive learning, representation learning

0

0

0

0

5:20

12/07/2020

Closed Loop Neural-Symbolic Learning via Integrating Neural Perception, Grammar Parsing, and Symbolic Reasoning

Qing Li, Siyuan Huang, Yining Hong and
Yixin Chen, Ying Nian Wu, Song-Chun Zhu

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:01

06/12/2021

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Hung Le, Thommen Karimpanal George, Majid Abdolshah and
Truyen Tran, Svetha Venkatesh

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:06

18/07/2021

Representation Matters: Offline Pretraining for Sequential Decision Making

Mengjiao Yang, Ofir Nachum

Keywords Paper

Reinforcement Learning and Planning

1

0

0

0

5:06

06/12/2021

Efficient Training of Retrieval Models using Negative Cache

Erik Lindgren, Sashank Reddi, Ruiqi Guo, Sanjiv Kumar

Keywords Paper

deep learning, machine learning

0

0

0

0

10:41

06/12/2020

Untangling tradeoffs between recurrence and self-attention in artificial neural networks

Giancarlo Kerg, bhargav104 Kanuparthi, Anirudh Goyal ALIAS PARTH GOYAL and
Kyle Goyette, Yoshua Bengio, Guillaume Lajoie

Keywords Paper

0

0

0

0

3:20

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04

06/12/2020

Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning

Younggyo Seo, Kimin Lee, Ignasi Clavera Gilaberte and
Thanard Kurutach, Jinwoo Shin, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

06/12/2021

Provable Guarantees for Self-Supervised Deep Learning with Spectral Contrastive Loss

Jeff Z. HaoChen, Colin Wei, Adrien Gaidon, Tengyu Ma

Keywords Paper

theory, deep learning, self-supervised learning, graph learning, contrastive learning, representation learning

0

0

0

0

20:43

19/08/2021

Solving Continuous Control with Episodic Memory

Igor Kuznetsov, Andrey Filchenkov

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

1

0

0

0

12:13

18/07/2021

Accelerating Safe Reinforcement Learning with Constraint-mismatched Baseline Policies

Jimmy Yang, Justinian Rosca, Karthik Narasimhan, Peter Ramadge

Keywords Paper

Algorithms, Adversarial Learning, Applications, Computer Vision; Deep Learning, Adversarial Networks; Deep Learning, Generative Models, Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:20

02/02/2021

Self-Supervised Hypergraph Convolutional Networks for Session-based Recommendation

Xin Xia, Hongzhi Yin, Junliang Yu and
Qinyong Wang, Lizhen Cui, Xiangliang Zhang

Keywords Paper

0

0

0

0

21:04

06/12/2020

Adaptive Discretization for Model-Based Reinforcement Learning

Sean Sinclair, Tianyu Wang, Gauri Jain and
Sid Banerjee, Christina Yu

Keywords Paper

0

0

0

0

3:12

03/05/2021

Meta-Learning with Neural Tangent Kernels

Yufan Zhou, Zhenyi Wang, Jiayi Xian and
Changyou Chen, Jinhui Xu

Keywords Paper

neural tangent kernel, meta-learning

0

0

0

0

3:54

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

06/12/2021

Learning Markov State Abstractions for Deep Reinforcement Learning

Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

12:31

06/12/2021

Compositional Reinforcement Learning from Logical Specifications

Kishor Jothimurugan, Suguman Bansal, Osbert Bastani, Rajeev Alur

Keywords Paper

deep learning, reinforcement learning and planning, graph learning

0

0

0

0

14:50

06/12/2020

An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

0

0

0

0

2:53

06/12/2020

Reward Propagation Using Graph Convolutional Networks

Martin Klissarov, Doina Precup

Keywords Paper

0

0

0

0

3:06

18/07/2021

The Impact of Record Linkage on Learning from Feature Partitioned Data

Richard Nock, Stephen J Hardy, Wilko Henecka and
Hamish Ivey-Law, Jakub Nabaglo, Giorgio Patrini, Guillaume Smith, Brian Thorne

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

6:02

02/02/2021

Semi-supervised Sequence Classification through Change Point Detection

Nauman Ahad, Mark A. Davenport

Keywords Paper

0

0

0

0

14:21

18/07/2021

DriftSurf: Stable-State / Reactive-State Learning under Concept Drift

Ashraf Tahmasbi, Ellango Jothimurugesan, Srikanta Tirthapura, Phil Gibbons

Keywords Paper

Algorithms, Online Learning Algorithms

0

0

0

0

5:07

13/04/2021

List learning with attribute noise

Mahdi Cheraghchi, Elena Grigorescu, Brendan Juba and
Karl Wimmer, Ning Xie

Keywords Paper

0

0

0

0

2:51

06/12/2020

A Local Temporal Difference Code for Distributional Reinforcement Learning

Pablo Tano, Peter Dayan, Alexandre Pouget

Keywords Paper

0

0

0

0

3:24

06/12/2021

Continual Auxiliary Task Learning

Matthew McLeod, Chunlok Lo, Matthew Schlegel and
Andrew Jacobsen, Raksha Kumaraswamy, Martha White, Adam White

Keywords Paper

reinforcement learning and planning

0

0

0

0

5:36

26/04/2020

Maxmin Q-learning: Controlling the Estimation Bias of Q-learning

Qingfeng Lan, Yangchen Pan, Alona Fyshe, Martha White

Keywords Paper

reinforcement learning, bias and variance reduction

0

0

0

0

4:27

19/08/2021

Hindsight Trust Region Policy Optimization

Hanbo Zhang, Site Bai, Xuguang Lan and
David Hsu, Nanning Zheng

Keywords Paper

Machine Learning, Deep Reinforcement Learning, Reinforcement Learning

0

0

0

0

13:14

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

26/04/2020

Reinforced Genetic Algorithm Learning for Optimizing Computation Graphs

Aditya Paliwal, Felix Gimeno, Vinod Nair and
Yujia Li, Miles Lubin, Pushmeet Kohli, Oriol Vinyals

Keywords Paper

reinforcement learning, learning to optimize, combinatorial optimization, computation graphs, model parallelism, learning for systems

0

0

0

0

4:21

06/12/2021

Learning State Representations from Random Deep Action-conditional Predictions

Zeyu Zheng, Vivek Veeriah, Risto Vuorio and
Richard L Lewis, Satinder Singh

Keywords Paper

reinforcement learning and planning, representation learning

0

0

0

0

11:44

03/08/2020

Robust Collective Classification against Structural Attacks

Kai Zhou, Yevgeniy Vorobeychik

Keywords Paper

0

0

0

0

8:42

26/04/2020

Learning Self-Correctable Policies and Value Functions from Demonstrations with Negative Sampling

Yuping Luo, Huazhe Xu, Tengyu Ma

Keywords Paper

imitation learning, model-based imitation learning, model-based RL, behavior cloning, covariate shift

0

0

0

0

4:38

19/08/2021

Ordering-Based Causal Discovery with Reinforcement Learning

Xiaoqiang Wang, Yali Du, Shengyu Zhu and
Liangjun Ke, Zhitang Chen, Jianye Hao, Jun Wang

Keywords Paper

Machine Learning Applications, Applications of Reinforcement Learning, Bayesian Networks

0

0

0

0

8:44

06/12/2021

Structural Credit Assignment in Neural Networks using Reinforcement Learning

Dhawal Gupta, Gabor Mihucz, Matthew Schlegel and
James Kostas, Philip S. Thomas, Martha White

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

7:15

26/04/2020

Causal Discovery with Reinforcement Learning

Shengyu Zhu, Ignavier Ng, Zhitang Chen

Keywords Paper

causal discovery, structure learning, reinforcement learning, directed acyclic graph

0

0

0

0

12:51

13/04/2021

Adaptive approximate policy iteration

Botao Hao, Nevena Lazic, Yasin Abbasi-Yadkori and
Pooria Joulani, Csaba Szepesvari

Keywords Paper

0

0

0

0

3:01

18/07/2021

Generalizable Episodic Memory for Deep Reinforcement Learning

Hao Hu, Jianing Ye, Guangxiang Zhu and
Zhizhou Ren, Chongjie Zhang

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:51

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09