Learning the Arrow of Time for Problems in Reinforcement Learning

26/04/2020

Learning the Arrow of Time for Problems in Reinforcement Learning

Nasim Rahaman, Steffen Wolf, Anirudh Goyal, Roman Remme, Yoshua Bengio

Keywords: Arrow of Time, Reinforcement Learning, AI-Safety

Abstract Paper Code Similar Papers

Abstract: We humans have an innate understanding of the asymmetric progression of time, which we use to efficiently and safely perceive and manipulate our environment. Drawing inspiration from that, we approach the problem of learning an arrow of time in a Markov (Decision) Process. We illustrate how a learned arrow of time can capture salient information about the environment, which in turn can be used to measure reachability, detect side-effects and to obtain an intrinsic reward signal. Finally, we propose a simple yet effective algorithm to parameterize the problem at hand and learn an arrow of time with a function approximator (here, a deep neural network). Our empirical results span a selection of discrete and continuous environments, and demonstrate for a class of stochastic processes that the learned arrow of time agrees reasonably well with a well known notion of an arrow of time due to Jordan, Kinderlehrer and Otto (1998).

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

Gregory Dexter, Kevin Bello, Jean Honorio

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

14:49

03/05/2021

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers

Ben Eysenbach, Shreyas Chaudhari, Swapnil Asawa and
Sergey Levine, Ruslan Salakhutdinov

Keywords Paper

reinforcement learning, domain adaptation, transfer learning

0

0

0

0

4:31

18/07/2021

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

16:11

19/08/2021

Probabilistic Sufficient Explanations

Eric Wang, Pasha Khosravi, Guy Van den Broeck

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Explainability, Exact Probabilistic Inference

0

0

0

0

12:13

06/12/2021

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Nathan Grinsztajn, Johan Ferret, Olivier Pietquin and
philippe preux, Matthieu Geist

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:31

06/12/2020

ICE-BeeM: Identifiable Conditional Energy-Based Deep Models Based on Nonlinear ICA

Ilyes Khemakhem, Ricardo Monti, Diederik P. Kingma, Aapo Hyvarinen

Keywords Paper

0

0

0

0

3:02

06/12/2021

Time-series Generation by Contrastive Imitation

Daniel Jarrett, Ioana Bica, Mihaela van der Schaar

Keywords Paper

generative model

0

0

0

0

8:47

07/08/2020

Learning to Ask Medical Questions using Reinforcement Learning

Uri Shaham, Tom Zahavy, Cesar Caraballo and
Shiwani Mahajan, Daisy Massey, Harlan Krumholz

Keywords Paper

0

0

0

0

3:11

06/12/2020

The Advantage of Conditional Meta-Learning for Biased Regularization and Fine Tuning

Giulia Denevi, Massimiliano Pontil, Carlo Ciliberto

Keywords Paper

0

0

0

0

3:24

06/12/2020

First Order Constrained Optimization in Policy Space

Yiming Zhang, Quan Vuong, Keith Ross

Keywords Paper

0

0

0

0

3:15

06/12/2021

Neural Algorithmic Reasoners are Implicit Planners

Andreea-Ioana Deac, Petar Veličković, Ognjen Milinkovic and
Pierre-Luc Bacon, Jian Tang, Mladen Nikolic

Keywords Paper

deep learning, reinforcement learning and planning, self-supervised learning, generative model, graph learning

0

0

0

0

13:10

26/10/2020

Symbolic Plans as High-Level Instructions for Reinforcement Learning

León Illanes, Xi Yan, Rodrigo Toro Icarte, Sheila A. McIlraith

Keywords Paper

Planning, Reinforcement Learning, Sparse rewards, Sample efficiency, High-level instructions

0

0

0

0

9:06

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

02/02/2021

Towards Trustworthy Predictions from Deep Neural Networks with Fast Adversarial Calibration

Christian Tomani, Florian Buettner

Keywords Paper

0

1

0

0

15:26

06/12/2020

Deep Reinforcement and InfoMax Learning

Bogdan Mazoure, Remi Tachet des Combes, Thang Doan and
Philip Bachman, R Devon Hjelm

Keywords Paper

0

0

0

0

3:15

26/08/2020

Truly Batch Model-Free Inverse Reinforcement Learning about Multiple Intentions

Giorgia Ramponi, Amarildo Likmeta, Alberto Maria Metelli and
Andrea Tirinzoni, Marcello Restelli

Keywords Paper

0

0

0

0

9:41

06/12/2021

Gradient Starvation: A Learning Proclivity in Neural Networks

Mohammad Pezeshki, Oumar Kaba, Yoshua Bengio and
Aaron Courville, Doina Precup, Guillaume Lajoie

Keywords Paper

theory, deep learning, optimization, robustness

0

0

0

0

10:52

14/06/2020

ROAM: Recurrently Optimizing Tracking Model

Tianyu Yang, Pengfei Xu, Runbo Hu and
Hua Chai, Antoni B. Chan

Keywords Paper

resizable tracking model, recurrent neural optimizer, meta learning, random filter scaling, visual tracking.

0

0

0

0

1:01

03/05/2021

Regularized Inverse Reinforcement Learning

Wonseok Jeon, Chen-Yang Su, Paul Barde and
Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Keywords Paper

reinforcement learning, regularized markov decision processes, reward learning, inverse reinforcement learning

0

0

0

0

9:50

06/12/2021

Self-Supervised Learning with Data Augmentations Provably Isolates Content from Style

Julius von Kügelgen, Yash Sharma, Luigi Gresele and
Wieland Brendel, Bernhard Schölkopf, Michel Besserve, Francesco Locatello

Keywords Paper

theory, self-supervised learning, representation learning

0

0

0

0

16:02

06/12/2021

Learning Markov State Abstractions for Deep Reinforcement Learning

Cameron Allen, Neev Parikh, Omer Gottesman, George Konidaris

Keywords Paper

reinforcement learning and planning, contrastive learning, representation learning

0

0

0

0

12:31

18/07/2021

Improved OOD Generalization via Adversarial Training and Pretraing

Mingyang Yi, Lu Hou, Jiacheng Sun and
Lifeng Shang, Xin Jiang, Qun Liu, Zhiming Ma

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

4:11

18/07/2021

Prediction-Centric Learning of Independent Cascade Dynamics from Partial Observations

Mateusz Wilinski, Andrey Lokhov

Keywords Paper

Probabilistic Methods, Approximate Inference

0

0

0

0

6:26

05/01/2021

Optimistic Agent: Accurate Graph-Based Value Estimation for More Successful Visual Navigation

Mahdi Kazemi Moghaddam, Qi Wu, Ehsan Abbasnejad, Javen Shi

Keywords Paper

0

0

0

0

4:58

02/02/2021

An LP-Based Approach for Goal Recognition as Planning

Luísa R. A. Santos, Felipe Meneguzzi, Ramon Fraga Pereira, André Grahl Pereira

Keywords Paper

0

0

0

0

19:54

12/07/2020

Optimization and Analysis of the pAp@k Metric for Recommender Systems

Gaurush Hiranandani, Warut Vijitbenjaronk, Sanmi Koyejo, Prateek Jain

Keywords Paper

Learning Theory

0

0

0

0

16:11

19/10/2020

Deep behavior tracing with multi-level temporality preserved embedding

Runze Wu, Hao Deng, Jianrong Tao and
Changjie Fan, Qi Liu, Liang Chen

Keywords Paper

game scene preloading, periodicity, self-attention, behavior tracing, temporal irregularity

0

0

0

0

8:03

06/12/2020

Adaptation Properties Allow Identification of Optimized Neural Codes

Luke Rast, Jan Drugowitsch

Keywords Paper

0

0

0

0

3:17

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

12/07/2020

Sequential Transfer in Reinforcement Learning with a Generative Model

Andrea Tirinzoni, Riccardo Poiani, Marcello Restelli

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:54

06/12/2021

SmoothMix: Training Confidence-calibrated Smoothed Classifiers for Certified Robustness

Jongheon Jeong, Sejun Park, Minkyu Kim and
Heung-Chang Lee, Do-Guk Kim, Jinwoo Shin

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security

0

0

0

0

12:23

06/12/2020

The Value Equivalence Principle for Model-Based Reinforcement Learning

Christopher Grimm, Andre Barreto, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:19

06/12/2020

On Efficiency in Hierarchical Reinforcement Learning

Zheng Wen, Doina Precup, Morteza Ibrahimi and
Andre Barreto, Benjamin Van Roy, Satinder Singh

Keywords Paper

0

0

0

0

3:05

02/02/2021

UNIPoint: Universally Approximating Point Processes Intensities

Alexander Soen, Alexander Mathews, Daniel Grixti-Cheng, Lexing Xie

Keywords Paper

0

0

0

0

18:32

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

26/04/2020

Posterior sampling for multi-agent reinforcement learning: solving extensive games with imperfect information

Yichi Zhou, Jialian Li, Jun Zhu

Keywords Paper

0

0

0

0

12:55

19/08/2021

Sensitivity Direction Learning with Neural Networks Using Domain Knowledge as Soft Shape Constraints

Kazuyuki Wakasugi

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Constraints and Data Mining; Constraints and Machine Learning

0

0

0

0

14:52

26/04/2020

Meta Reinforcement Learning with Autonomous Inference of Subtask Dependencies

Sungryull Sohn, Hyunjae Woo, Jongwook Choi, Honglak Lee

Keywords Paper

Meta reinforcement learning, subtask graph

0

0

0

0

5:26

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

06/12/2020

Gaussian Process Bandit Optimization of the Thermodynamic Variational Objective

Vu Nguyen, Vaden Masrani, Rob Brekelmans and
Michael A Osborne, Frank Wood

Keywords Paper

0

0

0

0

3:23