Momentum in Reinforcement Learning

26/08/2020

Momentum in Reinforcement Learning

Nino Vieillard, Bruno Scherrer, Olivier Pietquin, Matthieu Geist

Keywords:

Abstract Paper Similar Papers

Abstract: We adapt the optimization's concept of momentum to reinforcement learning. Seeing the state-action value functions as an anlog to the gradients in optimization, we interpret momentum as an average of consecutive $q$-functions. We derive Momentum Value Iteration (MoVI), a variation of Value iteration that incorporates this momentum idea. Our analysis shows that this allows MoVI to average errors over successive iterations. We show that the proposed approach can be readily extended to deep learning. Specifically,we propose a simple improvement on DQN based on MoVI, and experiment it on Atari games.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AISTATS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

Will Dabney, André Barreto, Mark Rowland and
Robert Dadashi, John Quan, Marc G. Bellemare, David Silver

Keywords Paper

0

0

0

0

20:06

02/02/2021

Bayesian Distributional Policy Gradients

Luchen Li, A. Aldo Faisal

Keywords Paper

1

0

0

0

18:06

06/12/2020

An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

0

0

0

0

2:53

18/07/2021

Convex Regularization in Monte-Carlo Tree Search

Tuan Q Dam, Carlo D'Eramo, Jan Peters, Joni Pajarinen

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

4:52

13/04/2021

Logistic q-learning

Joan Bas-Serrano, Sebastian Curi, Andreas Krause, Gergely Neu

Keywords Paper

0

0

0

0

2:44

06/12/2020

A new convergent variant of Q-learning with linear function approximation

Diogo Carvalho, Francisco S. Melo, Pedro A. Santos

Keywords Paper

0

0

0

0

2:30

26/04/2020

Fast Task Inference with Variational Intrinsic Successor Features

Steven Hansen, Will Dabney, Andre Barreto and
David Warde-Farley, Tom Van de Wiele, Volodymyr Mnih

Keywords Paper

Reinforcement Learning, Variational Intrinsic Control, Successor Features

0

0

0

0

14:47

26/04/2020

Meta-Q-Learning

Rasool Fakoor, Pratik Chaudhari, Stefano Soatto, Alexander J. Smola

Keywords Paper

meta reinforcement learning, propensity estimation, off-policy

0

0

0

0

15:50

18/07/2021

Spectral Normalisation for Deep Reinforcement Learning: An Optimisation Perspective

Florin Gogianu, Tudor Berariu, Mihaela Rosca and
Claudia Clopath, Lucian Busoniu, Razvan Pascanu

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:04

03/05/2021

Evolving Reinforcement Learning Algorithms

John Co-Reyes, Yingjie Miao, Daiyi Peng and
Esteban Real, Quoc V Le, Sergey Levine, Honglak Lee, Aleksandra Faust

Keywords Paper

reinforcement learning, genetic programming, meta-learning, evolutionary algorithms

0

0

0

0

13:59

06/12/2020

Discovering Reinforcement Learning Algorithms

Junhyuk Oh, Matteo Hessel, Wojciech Czarnecki and
Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:21

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2020

Meta-Gradient Reinforcement Learning with an Objective Discovered Online

Zhongwen Xu, Hado van Hasselt, Matteo Hessel and
Junhyuk Oh, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:24

06/12/2021

Decision Transformer: Reinforcement Learning via Sequence Modeling

Lili Chen, Kevin Lu, Aravind Rajeswaran and
Kimin Lee, Aditya Grover, Misha Laskin, Pieter Abbeel, Aravind Srinivas, Igor Mordatch

Keywords Paper

deep learning, reinforcement learning and planning, transformers, generative model

0

0

0

0

6:51

18/07/2021

Muesli: Combining Improvements in Policy Optimization

Matteo Hessel, Ivo Danihelka, Fabio Viola and
Arthur Guez, Simon Schmitt, Laurent Sifre, Theo Weber, David Silver, Hado van Hasselt

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2020

Munchausen Reinforcement Learning

Nino Vieillard, Olivier Pietquin, Matthieu Geist

Keywords Paper

0

0

0

0

3:19

06/12/2021

Heuristic-Guided Reinforcement Learning

Ching-An Cheng, Andrey Kolobov, Adith Swaminathan

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

8:39

06/12/2021

An online passive-aggressive algorithm for difference-of-squares classification

Lawrence Saul

Keywords Paper

machine learning, online learning

0

0

0

0

14:00

18/07/2021

Putting the ``Learning" into Learning-Augmented Algorithms for Frequency Estimation

Elbert Du, Franklyn Wang, Michael Mitzenmacher

Keywords Paper

Applications, Hardware and Systems

0

0

0

0

5:17

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

02/02/2021

Strategy and Benchmark for Converting Deep Q-Networks to Event-Driven Spiking Neural Networks

Weihao Tan, Devdhar Patel, Robert Kozma

Keywords Paper

0

0

0

0

18:09

02/02/2021

Harmonized Dense Knowledge Distillation Training for Multi-Exit Architectures

Xinglu Wang, Yingming Li

Keywords Paper

0

0

0

0

15:12

12/07/2020

Taylor Expansion Policy Optimization

Yunhao Tang, Michal Valko, Remi Munos

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:05

12/07/2020

Off-Policy Actor-Critic with Shared Experience Replay

Simon Schmitt, Matteo Hessel, Karen Simonyan

Keywords Paper

Reinforcement Learning - Deep RL

1

0

0

1

14:38

26/04/2020

On the Variance of the Adaptive Learning Rate and Beyond

Liyuan Liu, Haoming Jiang, Pengcheng He and
Weizhu Chen, Xiaodong Liu, Jianfeng Gao, Jiawei Han

Keywords Paper

warmup, adam, adaptive learning rate, variance

0

0

0

0

4:38

02/02/2021

Value-Decomposition Multi-Agent Actor-Critics

Jianyu Su, Stephen Adams, Peter Beling

Keywords Paper

0

0

0

0

19:21

03/05/2021

EigenGame: PCA as a Nash Equilibrium

Ian Gemp, Brian McWilliams, Claire Vernade, Thore Graepel

Keywords Paper

singular value decomposition, svd, eigendecomposition, nash, principal components analysis, pca, games

0

0

0

0

14:56

18/07/2021

Linear Transformers Are Secretly Fast Weight Programmers

Imanol Schlag, Kazuki Irie, Jürgen Schmidhuber

Keywords Paper

Deep Learning

0

0

0

0

5:18

06/12/2020

Modeling and Optimization Trade-off in Meta-learning

Katelyn Gao, Ozan Sener

Keywords Paper

0

0

0

0

3:21

26/04/2020

Never Give Up: Learning Directed Exploration Strategies

Adrià Puigdomènech Badia, Pablo Sprechmann, Alex Vitvitskyi and
Daniel Guo, Bilal Piot, Steven Kapturowski, Olivier Tieleman, Martin Arjovsky, Alexander Pritzel, Andrew Bolt, Charles Blundell

Keywords Paper

deep reinforcement learning, exploration, intrinsic motivation

0

0

0

0

5:30

19/01/2020

Proving Expected Sensitivity of Probabilistic Programs with Randomized Variable-Dependent Termination Time

Peixin Wang, Hongfei Fu, Krishnendu Chatterjee and
Yuxin Deng, Ming Xu

Keywords Paper

Martingales, Expected Sensitivity, Probabilistic Programs

0

0

0

0

21:04

06/12/2021

Model Selection for Bayesian Autoencoders

Ba-Hien Tran, Simone Rossi, Dimitrios Milios and
Pietro Michiardi, Edwin Bonilla, Maurizio Filippone

Keywords Paper

optimization, self-supervised learning, generative model, representation learning

0

0

0

0

10:49

06/12/2021

Regret Minimization Experience Replay in Off-Policy Reinforcement Learning

Xu-Hui Liu, Zhenghai Xue, Jingcheng Pang and
Shengyi Jiang, Feng Xu, Yang Yu

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

14:06

06/12/2020

RL Unplugged: A Collection of Benchmarks for Offline Reinforcement Learning

CAGLAR Gulcehre, Ziyu Wang, Alexander Novikov and
Thomas Paine, Sergio Gómez, Konrad Zolna, Rishabh Agarwal, Josh Merel, Daniel Mankowitz, Cosmin Paduraru, Gabriel Dulac-Arnold, Jerry Li, Mohammad Norouzi, Matthew Hoffman, Nicolas Heess, Nando de Freitas

Keywords Paper

0

0

0

0

3:25

06/12/2021

Offline Reinforcement Learning as One Big Sequence Modeling Problem

Michael Janner, Qiyang Li, Sergey Levine

Keywords Paper

reinforcement learning and planning, transformers, language

0

0

0

0

9:48

06/12/2020

Direct Policy Gradients: Direct Optimization of Policies in Discrete Action Spaces

Guy Lorberbom, Chris J. Maddison, Nicolas Heess and
Tamir Hazan, Daniel Tarlow

Keywords Paper

0

0

0

0

3:16

18/07/2021

Bilevel Optimization: Convergence Analysis and Enhanced Design

Kaiyi Ji, Junjie Yang, Yingbin LIANG

Keywords Paper

Optimization, Non-Convex Optimization

0

0

0

0

5:02

18/07/2021

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

0

0

0

0

5:16

12/07/2020

Randomized Block-Diagonal Preconditioning for Parallel Learning

Celestine Mendler-Dünner, Aurelien Lucchi

Keywords Paper

Optimization - Large Scale, Parallel and Distributed

0

0

0

0

12:57

12/07/2020

Sub-Goal Trees -- a Framework for Goal-Based Reinforcement Learning

Tom Jurgenson, Or Avner, Edward Groshev, Aviv Tamar

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:04