Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications

26/10/2020

Multiple-Environment Markov Decision Processes: Efficient Analysis and Applications

Krishnendu Chatterjee, Martin Chmelík, Deep Karkhanis, Petr Novotný, Amélie Royer

Keywords: Markov decision processes, recommender systems, parameterized MDPs, partially observable MDPs

Abstract Paper Similar Papers

Abstract: Multiple-environment Markov decision processes (MEMDPs) are MDPs equipped with not one, but multiple probabilistic transition functions, which represent the various possible unknown environments. While the previous research on MEMDPs focused on theoretical properties for long-run average payoff, we study them with discounted-sum payoff and focus on their practical advantages and applications. MEMDPs can be viewed as a special case of Partially observable and Mixed observability MDPs: the state of the system is perfectly observable, but not the environment. We show that the specific structure of MEMDPs allows for more efficient algorithmic analysis, in particular for faster belief updates. We experimentally demonstrate the applicability of MEMDPs in several domains, including contextual recommendation systems and parameterized Markov decision processes.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICAPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Multi-task Additive Models for Robust Estimation and Automatic Structure Discovery

Yingjie Wang, Hong Chen, Feng Zheng and
Chen Xu, Tieliang Gong, Yanhong Chen

Keywords Paper

Applications -> Time Series Analysis; Probabilistic Methods -> Variational Inference, Probabilistic Methods -> Causal Inference

0

0

0

0

3:00

12/07/2020

Minimax Weight and Q-Function Learning for Off-Policy Evaluation

Masatoshi Uehara, Jiawei Huang, Nan Jiang

Keywords Paper

Reinforcement Learning - Theory

0

0

0

0

14:20

02/02/2021

Deep Recurrent Belief Propagation Network for POMDPs

Yuhui Wang, Xiaoyang Tan

Keywords Paper

0

0

0

0

15:15

13/04/2021

Sparse algorithms for markovian gaussian processes

William Wilkinson, Arno Solin, Vincent Adam

Keywords Paper

0

0

0

0

3:10

18/07/2021

A Discriminative Technique for Multiple-Source Adaptation

Corinna Cortes, Mehryar Mohri, Ananda Theertha Suresh, Ningshan Zhang

Keywords Paper

Applications, , Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

1

4:49

06/12/2020

On Efficiency in Hierarchical Reinforcement Learning

Zheng Wen, Doina Precup, Morteza Ibrahimi and
Andre Barreto, Benjamin Van Roy, Satinder Singh

Keywords Paper

0

0

0

0

3:05

06/12/2021

Twice regularized MDPs and the equivalence between robustness and regularization

Esther Derman, Matthieu Geist, Shie Mannor

Keywords Paper

optimization, reinforcement learning and planning, robustness

0

0

0

0

14:19

03/05/2021

Calibration tests beyond classification

David Widmann, Fredrik Lindsten, Dave Zachariah

Keywords Paper

uncertainty quantification, maximum mean discrepancy, integral probability metric, framework, calibration

0

0

0

0

6:05

06/12/2021

Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models

Kohei Miyaguchi

Keywords Paper

reinforcement learning and planning

0

0

0

0

9:06

06/12/2021

Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning

Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima and
Yutaka Matsuo, Shixiang (Shane) Gu

Keywords Paper

reinforcement learning and planning

0

0

0

0

10:00

18/07/2021

Continuous-time Model-based Reinforcement Learning

Cagatay Yildiz, Markus Heinonen, Harri Lähdesmäki

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:00

06/12/2021

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

optimization

0

0

0

0

8:58

06/12/2020

Promoting Stochasticity for Expressive Policies via a Simple and Efficient Regularization Method

Qi Zhou, Yufei Kuang, Zherui Qiu and
Houqiang Li, Jie Wang

Keywords Paper

0

0

0

0

3:10

06/12/2020

Approximate Cross-Validation for Structured Models

Soumya Ghosh, Will Stephenson, Stan Nguyen and
Sameer Deshpande, Tamara Broderick

Keywords Paper

0

0

0

0

3:24

12/07/2020

The continuous categorical: a novel simplex-valued exponential family

Elliott Gordon-Rodriguez, Gabriel Loaiza-Ganem, John Cunningham

Keywords Paper

Probabilistic Inference - Models and Probabilistic Programming

0

0

0

0

14:59

03/08/2020

Kernel Conditional Moment Test via Maximum Moment Restriction

Krikamol Muandet, Wittawat Jitkrittum, Jonas Kübler

Keywords Paper

0

0

0

0

6:26

18/07/2021

Efficient Statistical Tests: A Neural Tangent Kernel Approach

Sheng Jia, Ehsan Nezhadarya, Yuhuai Wu, Jimmy Ba

Keywords Paper

Deep Learning

0

0

0

0

5:13

02/02/2021

Counterfactual Explanations for Oblique Decision Trees:Exact, Efficient Algorithms

Miguel Á. Carreira-Perpiñán, Suryabhan Singh Hada

Keywords Paper

0

0

0

0

16:16

14/09/2020

A General Machine Learning Framework for Survival Analysis

Andreas Bender, David Rügamer, Fabian Scheipl, Bernd Bischl

Keywords Paper

survival analysis, gradient boosting, neural networks, competing risks, multi-state models

0

0

0

0

13:37

12/07/2020

Non-convex Learning via Replica Exchange Stochastic Gradient MCMC

Wei Deng, Qi Feng, Liyao Gao and
Faming Liang, Guang Lin

Keywords Paper

Probabilistic Inference - Approximate, Monte Carlo, and Spectral Methods

0

0

0

0

15:01

06/12/2021

Learning to Select Exogenous Events for Marked Temporal Point Process

Ping Zhang, Rishabh Iyer, Ashish Tendulkar and
Gaurav Aggarwal, Abir De

Keywords Paper

0

0

0

0

12:27

04/07/2020

Towards Transparent and Explainable Attention Models

Akash Kumar Mohankumar, Preksha Nema, Sharan Narasimhan and
Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran

Keywords Paper

interpretability distributions, attention mechanisms, Human evaluations, Transparent Models

0

0

0

0

11:58

18/07/2021

On Estimation in Latent Variable Models

Guanhua Fang, Ping Li

Keywords Paper

Optimization, Stochastic Optimization

0

0

0

0

4:55

06/12/2021

Structured Dropout Variational Inference for Bayesian Neural Networks

Son Nguyen, Duong Nguyen, Khai Nguyen and
Khoat Than, Hung Bui, Nhat Ho

Keywords Paper

deep learning, generative model

0

0

0

0

11:28

26/08/2020

On the Convergence Theory of Gradient-Based Model-Agnostic Meta-Learning Algorithms

Alireza Fallah, Aryan Mokhtari, Asuman Ozdaglar

Keywords Paper

0

0

0

0

15:02

06/12/2021

Continuous Latent Process Flows

Ruizhi Deng, Marcus Brubaker, Greg Mori, Andreas M Lehrmann

Keywords Paper

generative model

0

0

0

0

14:54

02/02/2021

Sequential Generative Exploration Model for Partially Observable Reinforcement Learning

Haiyan Yin, Jianda Chen, Sinno Jialin Pan, Sebastian Tschiatschek

Keywords Paper

0

0

0

0

14:40

06/12/2021

Closing the Gap: Tighter Analysis of Alternating Stochastic Gradient Methods for Bilevel Problems

Tianyi Chen, Yuejiao Sun, Wotao Yin

Keywords Paper

theory, optimization, reinforcement learning and planning, machine learning

0

0

0

0

7:27

02/02/2021

Progression Heuristics for Planning with Probabilistic LTL Constraints

Ian Mallett, Sylvie Thiebaux, Felipe Trevizan

Keywords Paper

0

0

0

0

18:23

19/08/2021

GSPL: A Succinct Kernel Model for Group-Sparse Projections Learning of Multiview Data

Danyang Wu, Jin Xu, Xia Dong and
Meng Liao, Rong Wang, Feiping Nie, Xuelong Li

Keywords Paper

Machine Learning, Learning Sparse Models, Multi-instance; Multi-label; Multi-view learning, Unsupervised Learning

0

0

0

0

11:48

06/12/2020

Leverage the Average: an Analysis of KL Regularization in Reinforcement Learning

Nino Vieillard, Tadashi Kozuno, Bruno Scherrer and
Olivier Pietquin, Remi Munos, Matthieu Geist

Keywords Paper

0

0

0

0

3:25

13/04/2021

Adversarially robust estimate and risk analysis in linear regression

Yue Xing, Ruizhi Zhang, Guang Cheng

Keywords Paper

0

0

0

0

3:03

12/07/2020

Fast and Consistent Learning of Hidden Markov Models by Incorporating Non-Consecutive Correlations

Robert Mattila, Cristian Rojas, Eric Moulines and
Vikram Krishnamurthy, Bo Wahlberg

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

13:37

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

26/08/2020

Adaptive, Distribution-Free Prediction Intervals for Deep Networks

Danijel Kivaranovic, Kory D. Johnson, Hannes Leeb

Keywords Paper

0

0

0

0

16:48

06/12/2020

A Local Temporal Difference Code for Distributional Reinforcement Learning

Pablo Tano, Peter Dayan, Alexandre Pouget

Keywords Paper

0

0

0

0

3:24

13/04/2021

Towards a theoretical understanding of the robustness of variational autoencoders

Alexander Camuto, Matthew Willetts, Stephen Roberts and
Chris Holmes, Tom Rainforth

Keywords Paper

0

0

0

0

3:00

02/02/2021

Bayes DistNet - A Robust Neural Network for Algorithm Runtime Distribution Predictions

Jake Tuero, Michael Buro

Keywords Paper

0

0

0

0

18:39

06/12/2021

Why Generalization in RL is Difficult: Epistemic POMDPs and Implicit Partial Observability

Dibya Ghosh, Jad Rahme, Aviral Kumar and
Amy Zhang, Ryan Adams, Sergey Levine

Keywords Paper

reinforcement learning and planning

0

0

0

0

15:17

12/07/2020

Online metric algorithms with untrusted predictions

Antonios Antoniadis, Christian Coester, Marek Elias and
Adam Polak, Bertrand Simon

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:15