Solving Continuous Control with Episodic Memory

Abstract: Episodic memory lets reinforcement learning algorithms remember and exploit promising experience from the past to improve agent performance. Previous works on memory mechanisms show benefits of using episodic-based data structures for discrete action problems in terms of sample-efficiency. The application of episodic memory for continuous control with a large action space is not trivial. Our study aims to answer the question: can episodic memory be used to improve agent's performance in continuous control? Our proposed algorithm combines episodic memory with Actor-Critic architecture by modifying critic's objective. We further improve performance by introducing episodic-based replay buffer prioritization. We evaluate our algorithm on OpenAI gym domains and show greater sample-efficiency compared with the state-of-the art model-free off-policy algorithms.

06/12/2021

Deep Reinforcement Learning, Episodic Control, Episodic Memory, Associative Memory, Non-Parametric Method, Sample Efficiency

4:43

06/12/2020

Solving Continuous Control with Episodic Memory

Igor Kuznetsov, Andrey Filchenkov

Comments

Similar Papers

Model-Based Episodic Memory Induces Dynamic Hybrid Controls

Hung Le, Thommen Karimpanal George, Majid Abdolshah and Truyen Tran, Svetha Venkatesh

Keywords Abstract Paper

reinforcement learning and planning

Learning to Sample with Local and Global Contexts in Experience Replay Buffer

Youngmin Oh, Kimin Lee, Jinwoo Shin and Eunho Yang, Sung Ju Hwang

Keywords Abstract Paper

reinforcement learning, off-policy RL, experience replay buffer

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Abstract Paper

Episodic Reinforcement Learning with Associative Memory

Guangxiang Zhu*, Zichuan Lin*, Guangwen Yang, Chongjie Zhang

Keywords Abstract Paper

Deep Reinforcement Learning, Episodic Control, Episodic Memory, Associative Memory, Non-Parametric Method, Sample Efficiency

An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

Scott Fujimoto, David Meger, Doina Precup

Keywords Abstract Paper

Improving Computational Efficiency in Visual Reinforcement Learning via Stored Embeddings

Lili Chen, Kimin Lee, Aravind Srinivas, Pieter Abbeel

Keywords Abstract Paper

deep learning, reinforcement learning and planning

Rewriting History with Inverse RL: Hindsight Inference for Policy Improvement

Benjamin Eysenbach, XINYANG GENG, Sergey Levine, Russ Salakhutdinov

Keywords Abstract Paper

Optimization -> Non-Convex Optimization, Theory -> Statistical Physics of Learning

Learning Markov State Abstractions for Deep Reinforcement Learning

Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

Keywords Abstract Paper

reinforcement learning and planning, contrastive learning, representation learning

A New Representation of Successor Features for Transfer across Dissimilar Environments

Majid Abdolshah, Hung Le, Thommen Karimpanal George and Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Abstract Paper

Reinforcement Learning and Planning

Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting

Sayna Ebrahimi, Suzanne Petryk, Akash Gokul and William Gan, Joseph E Gonzalez, Marcus Rohrbach, trevor darrell

Keywords Abstract Paper

Explainability, Catastrophic Forgetting, Continual Learning, XAI, Lifelong Learning

Regret Minimization Experience Replay in Off-Policy Reinforcement Learning

Xu-Hui Liu, Zhenghai Xue, Jingcheng Pang and Shengyi Jiang, Feng Xu, Yang Yu

Keywords Abstract Paper

theory, reinforcement learning and planning

Neurally Augmented ALISTA

Freya Behrens, Jonathan Sauder, Peter Jung

Keywords Abstract Paper

learned ISTA, unrolled algorithms, compressed sensing, sparse reconstruction

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

Gregory Dexter, Kevin Bello, Jean Honorio

Keywords Abstract Paper

theory, reinforcement learning and planning

Revisiting Fundamentals of Experience Replay

William Fedus, Prajit Ramachandran, Rishabh Agarwal and Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney

Keywords Abstract Paper

Reinforcement Learning - Deep RL

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Abstract Paper

reinforcement learning, imitation learning, reward learning

Encoding Human Domain Knowledge to Warm Start Reinforcement Learning

Andrew Silva, Matthew Gombolay

Keywords Abstract Paper

Dynamic Automaton-Guided Reward Shaping for Monte Carlo Tree Search

Alvaro Velasquez, Brett Bissey, Lior Barak and Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Abstract Paper

Adaptive Discretization for Model-Based Reinforcement Learning

Sean Sinclair, Tianyu Wang, Gauri Jain and Sid Banerjee, Christina Yu

Keywords Abstract Paper

Meta-Q-Learning

Rasool Fakoor, Pratik Chaudhari, Stefano Soatto, Alexander J. Smola

Keywords Abstract Paper

meta reinforcement learning, propensity estimation, off-policy

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Abstract Paper

Theory, RL, Decisions and Control Theory

Learning Routines for Effective Off-Policy Reinforcement Learning

Hung Le, Thommen Karimpanal George, Majid Abdolshah and
Truyen Tran, Svetha Venkatesh

Keywords Paper

Youngmin Oh, Kimin Lee, Jinwoo Shin and
Eunho Yang, Sung Ju Hwang

Keywords Paper

Keywords Paper

Guangxiang Zhu, Zichuan Lin, Guangwen Yang, Chongjie Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Majid Abdolshah, Hung Le, Thommen Karimpanal George and
Sunil Gupta, Santu Rana, Svetha Venkatesh

Keywords Paper

Sayna Ebrahimi, Suzanne Petryk, Akash Gokul and
William Gan, Joseph E Gonzalez, Marcus Rohrbach, trevor darrell

Keywords Paper

Xu-Hui Liu, Zhenghai Xue, Jingcheng Pang and
Shengyi Jiang, Feng Xu, Yang Yu

Keywords Paper

Keywords Paper

Keywords Paper

William Fedus, Prajit Ramachandran, Rishabh Agarwal and
Yoshua Bengio, Hugo Larochelle, Mark Rowland, Will Dabney

Keywords Paper

Keywords Paper

Keywords Paper

Alvaro Velasquez, Brett Bissey, Lior Barak and
Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Paper

Sean Sinclair, Tianyu Wang, Gauri Jain and
Sid Banerjee, Christina Yu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hang Lai, Jian Shen, Weinan Zhang and
Yimin Huang, Xing Zhang, Ruiming Tang, Yong Yu, Zhenguo Li

Keywords Paper

Keywords Paper

Keywords Paper

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Kirill Struminsky, Artyom Gadetsky, Denis Rakitin and
Danil Karpushkin, Dmitry Vetrov

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Vlad Mikulik, Grégoire Delétang, Tom McGrath and
Tim Genewein, Miljan Martic, Shane Legg, Pedro Ortega

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper