DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs

Abstract: We study an approach to offline reinforcement learning (RL) based on optimally solving finitely-represented MDPs derived from a static dataset of experience. This approach can be applied on top of any learned representation and has the potential to easily support multiple solution objectives as well as zero-shot adjustment to changing environments and goals. Our main contribution is to introduce the Deep Averagers with Costs MDP (DAC-MDP) and to investigate its solutions for offline RL. DAC-MDPs are a non-parametric model that can leverage deep representations and account for limited data by introducing costs for exploiting under-represented parts of the model. In theory, we show conditions that allow for lower-bounding the performance of DAC-MDP solutions. We also investigate the empirical behavior in a number of environments, including those with image-based observations. Overall, the experiments demonstrate that the framework can work in practice and scale to large complex offline RL problems.

06/12/2020

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

5:16

06/12/2020

DeepAveragers: Offline Reinforcement Learning By Solving Derived Non-Parametric MDPs

aayam shrestha, Stefan Lee, Prasad Tadepalli, Alan Fern

Comments

Similar Papers

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Abstract Paper

Active learning with maximum margin sparse gaussian processes

Weishi Shi, Qi Yu

Keywords Abstract Paper

Learnable Bernoulli Dropout for Bayesian Deep Learning

Shahin Boluki, Randy Ardywibowo, Siamak Zamani Dadaneh and Mingyuan Zhou, Xiaoning Qian

Keywords Abstract Paper

Learning explanations that are hard to vary

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto and Luigi Gresele, Bernhard Schoelkopf

Keywords Abstract Paper

invariances, gradient alignment, consistency

Structured Prediction with Partial Labelling through the Infimum Loss

Vivien Cabannnes, Francis Bach, Alessandro Rudi

Keywords Abstract Paper

Sequential, Network, and Time-Series Modeling

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Abstract Paper

Adversarial Intrinsic Motivation for Reinforcement Learning

Ishan Durugkar, Mauricio Tec, Scott Niekum, Peter Stone

Keywords Abstract Paper

reinforcement learning and planning, generative model

PerSim: Data-Efficient Offline Reinforcement Learning with Heterogeneous Agents via Personalized Simulators

Anish Agarwal, Abdullah Alomar, Varkey Alumootil and Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang

Keywords Abstract Paper

deep learning, reinforcement learning and planning

COMBO: Conservative Offline Model-Based Policy Optimization

Tianhe Yu, Aviral Kumar, Rafael Rafailov and Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Abstract Paper

deep learning, optimization, reinforcement learning and planning

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Abstract Paper

Theory, Statistical Learning Theory

Counterfactual Maximum Likelihood Estimation for Training Deep Networks

Xinyi Wang, Wenhu Chen, Michael Saxon, William Yang Wang

Keywords Abstract Paper

deep learning, domain adaptation, causality, language

Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions

Han-Jia Ye, Hexiang Hu, De-Chuan Zhan, Fei Sha

Keywords Abstract Paper

few-shot learning, meta-learning, embedding learning, embedding adaptation, set-to-set

EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL

Seyed Kamyar Seyed Ghasemipour, Dale Schuurmans, Shixiang Gu

Keywords Abstract Paper

Reinforcement Learning and Planning

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Abstract Paper

reinforcement learning and planning, robustness, representation learning

A theoretical characterization of semi-supervised learning with self-training for gaussian mixture models

Samet Oymak, Talha Cihad Gulcu

Keywords Abstract Paper

A Variational Approach for Learning from Positive and Unlabeled Data

Hui Chen, Fangqing Liu, Yin Wang and Liyue Zhao, Hao Wu

Keywords Abstract Paper

Variational Empowerment as Representation Learning for Goal-Conditioned Reinforcement Learning

Jongwook Choi, Archit Sharma, Honglak Lee and Sergey Levine, Shixiang Gu

Keywords Abstract Paper

Neuroscience and Cognitive Science, Neuroscience, Reinforcement Learning and Planning, Algorithms, Representation Learning; Algorithms, Sparse Coding and Dimensionality Expansion; Applications, Matrix and Ten

Implicit Rank-Minimizing Autoencoder

Li Jing, Jure Zbontar, yann lecun

Keywords Abstract Paper

Calibrated Surrogate Maximization of Linear-fractional Utility in Binary Classification

Han Bao, Masashi Sugiyama

Keywords Abstract Paper

Cross-Domain Few-Shot Classification via Adversarial Task Augmentation

Haoqing Wang, Zhi-Hong Deng

Keywords Abstract Paper

Computer Vision, Recognition, Adversarial Machine Learning, Deep Learning

Zero-Shot Recognition via Optimal Transport

Wenlin Wang, Hongteng Xu, Guoyin Wang and Wenqi Wang, Lawrence Carin

Keywords Abstract Paper

Adversarial Multi Class Learning under Weak Supervision with Performance Guarantees

Keywords Paper

Keywords Paper

Shahin Boluki, Randy Ardywibowo, Siamak Zamani Dadaneh and
Mingyuan Zhou, Xiaoning Qian

Keywords Paper

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto and
Luigi Gresele, Bernhard Schoelkopf

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Anish Agarwal, Abdullah Alomar, Varkey Alumootil and
Devavrat Shah, Dennis Shen, Zhi Xu, Cindy Yang

Keywords Paper

Tianhe Yu, Aviral Kumar, Rafael Rafailov and
Aravind Rajeswaran, Sergey Levine, Chelsea Finn

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hui Chen, Fangqing Liu, Yin Wang and
Liyue Zhao, Hao Wu

Keywords Paper

Jongwook Choi, Archit Sharma, Honglak Lee and
Sergey Levine, Shixiang Gu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Wenlin Wang, Hongteng Xu, Guoyin Wang and
Wenqi Wang, Lawrence Carin

Keywords Paper

Alessio Mazzetto, Cyrus Cousins, Dylan Sam and
Stephen Bach, Eli Upfal

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Eric Mitchell, Rafael Rafailov, Xue Bin Peng and
Sergey Levine, Chelsea Finn

Keywords Paper

Keywords Paper

Bingyi Kang, Yu Li, Sain Xie and
Zehuan Yuan, Jiashi Feng

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Tianhe Yu, Aviral Kumar, Yevgen Chebotar and
Karol Hausman, Sergey Levine, Chelsea Finn

Keywords Paper

Giorgia Ramponi, Amarildo Likmeta, Alberto Maria Metelli and
Andrea Tirinzoni, Marcello Restelli

Keywords Paper

Keywords Paper

Keywords Paper