Constrained Risk-Averse Markov Decision Processes

02/02/2021

Constrained Risk-Averse Markov Decision Processes

Mohamadreza Ahmadi, Ugo Rosolia, Michel D. Ingham, Richard M. Murray, Aaron D. Ames

Keywords:

Abstract Paper Similar Papers

Abstract: We consider the problem of designing policies for Markov decision processes (MDPs) with dynamic coherent risk objectives and constraints. We begin by formulating the problem in a Lagrangian framework. Under the assumption that the risk objectives and constraints can be represented by a Markov risk transition mapping, we propose an optimization-based method to synthesize Markovian policies that lower-bound the constrained risk-averse problem. We demonstrate that the formulated optimization problems are in the form of difference convex programs (DCPs) and can be solved by the disciplined convex-concave programming (DCCP) framework. We show that these results generalize linear programs for constrained MDPs with total discounted expected costs and constraints. Finally, we illustrate the effectiveness of the proposed method with numerical experiments on a rover navigation problem involving conditional-value-at-risk (CVaR) and entropic-value-at-risk (EVaR) coherent risk measures.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948733

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Risk Bounds and Calibration for a Smart Predict-then-Optimize Method

Heyuan Liu, Paul Grigas

Keywords Paper

theory, optimization, machine learning

0

0

0

0

14:56

06/12/2020

Bayesian Optimization of Risk Measures

Sait Cakmak, Raul Astudillo Marban, Peter Frazier, Enlu Zhou

Keywords Paper

0

0

0

0

3:13

26/08/2020

Why Non-myopic Bayesian Optimization is Promising and How Far Should We Look-ahead? A Study via Rollout

Xubo Yue, Raed AL Kontar

Keywords Paper

0

0

0

0

13:38

02/02/2021

Wasserstein Distributionally Robust Inverse Multiobjective Optimization

Chaosheng Dong, Bo Zeng

Keywords Paper

0

0

0

0

14:45

03/05/2021

Fast convergence of stochastic subgradient method under interpolation

Huang Fang, Zhenan Fan, Michael Friedlander

Keywords Paper

interpolation, stochastic subgradient method, convergence analysis, Optimization

0

0

0

0

4:42

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

06/12/2020

Empirical Likelihood for Contextual Bandits

Nikos Karampatziakis, John Langford, Paul Mineiro

Keywords Paper

0

0

0

0

2:02

13/04/2021

Learning prediction intervals for regression: Generalization and calibration

Haoxian Chen, Ziyi Huang, Henry Lam and
Huajie Qian, Haofeng Zhang

Keywords Paper

0

0

0

0

3:26

18/07/2021

Provably Correct Optimization and Exploration with Non-linear Policies

Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Fairness, Accountability, and Transparency, Theory, RL, Decisions and Control Theory

0

0

0

0

5:03

06/12/2020

Geometric Exploration for Online Control

Orestis Plevrakis, Elad Hazan

Keywords Paper

0

0

0

0

3:21

06/12/2020

Distributionally Robust Federated Averaging

Yuyang Deng, Mohammad Mahdi Kamani, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:11

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

06/12/2020

IDEAL: Inexact DEcentralized Accelerated Augmented Lagrangian Method

Yossi Arjevani, Joan Bruna, Bugra Can and
Mert Gurbuzbalaban, Stefanie Jegelka, Hongzhou Lin

Keywords Paper

0

0

0

0

3:24

06/12/2020

Variational Policy Gradient Method for Reinforcement Learning with General Utilities

Junyu Zhang, Alec Koppel, Amrit Bedi and
Csaba Szepesvari, Mengdi Wang

Keywords Paper

0

0

0

0

3:20

12/07/2020

Class-Weighted Classification: Trade-offs and Robust Approaches

Ziyu Xu, Chen Dan, Justin Khim, Pradeep Ravikumar

Keywords Paper

Learning Theory

0

0

0

0

11:49

06/12/2020

First-Order Methods for Large-Scale Market Equilibrium Computation

Yuan Gao, Christian Kroer

Keywords Paper

0

0

0

0

3:17

12/07/2020

Convex Calibrated Surrogates for the Multi-Label F-Measure

Mingyuan Zhang, Harish Guruprasad Ramaswamy, Shivani Agarwal

Keywords Paper

Supervised Learning

0

0

0

0

16:09

26/08/2020

Mixed Strategies for Robust Optimization of Unknown Objectives

Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

Keywords Paper

0

0

0

0

14:13

06/12/2020

The Wasserstein Proximal Gradient Algorithm

Adil Salim, Anna Korba, Giulia Luise

Keywords Paper

0

0

0

0

3:14

13/04/2021

Logistic q-learning

Joan Bas-Serrano, Sebastian Curi, Andreas Krause, Gergely Neu

Keywords Paper

0

0

0

0

2:44

06/12/2020

One Ring to Rule Them All: Certifiably Robust Geometric Perception with Outliers

Heng Yang, Luca Carlone

Keywords Paper

0

0

0

0

3:24

19/08/2021

Variational Model-based Policy Optimization

Yinlam Chow, Brandon Cui, Moonkyung Ryu, Mohammad Ghavamzadeh

Keywords Paper

Machine Learning, Reinforcement Learning

0

0

0

0

15:31

06/12/2021

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Matthias Schultheis, Dominik Straub, Constantin Rothkopf

Keywords Paper

0

0

0

0

9:29

06/12/2020

Quantized Variational Inference

Amir Dib

Keywords Paper

0

0

0

0

2:28

06/12/2020

Kernel Alignment Risk Estimator: Risk Prediction from Training Data

Arthur Jacot, Berfin Simsek, Francesco Spadaro and
Clement Hongler, Franck Gabriel

Keywords Paper

0

0

0

0

3:16

06/12/2021

Asymptotically Exact Error Characterization of Offline Policy Evaluation with Misspecified Linear Models

Kohei Miyaguchi

Keywords Paper

reinforcement learning and planning

0

0

0

0

9:06

18/07/2021

Beyond Variance Reduction: Understanding the True Impact of Baselines on Policy Optimization

Wes Chung, Valentin Thomas, Marlos C. Machado, Nicolas Le Roux

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:13

09/07/2020

Model-Based Reinforcement Learning with a Generative Model is Minimax Optimal

Alekh Agarwal, Sham Kakade, Lin Yang

Keywords Paper

Reinforcement learning, Sampling algorithms

0

0

0

0

15:13

26/08/2020

Ordered SGD: A New Stochastic Optimization Framework for Empirical Risk Minimization

Kenji Kawaguchi, Haihao Lu

Keywords Paper

0

0

0

0

14:10

26/08/2020

Linear Convergence of Adaptive Stochastic Gradient Descent

Yuege Xie, Xiaoxia Wu, Rachel Ward

Keywords Paper

0

0

0

0

10:02

06/12/2021

Variance-Aware Off-Policy Evaluation with Linear Function Approximation

Yifei Min, Tianhao Wang, Dongruo Zhou, Quanquan Gu

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

12:17

06/12/2021

Risk-averse Heteroscedastic Bayesian Optimization

Anastasia Makarova, Ilnura Usmanova, Ilija Bogunovic, Andreas Krause

Keywords Paper

optimization, robustness, kernel methods

0

0

0

0

8:17

06/12/2020

Large-Scale Methods for Distributionally Robust Optimization

Daniel Levy, Yair Carmon, John Duchi, Aaron Sidford

Keywords Paper

0

0

0

0

3:11

02/02/2021

A Sample-Efficient Algorithm for Episodic Finite-Horizon MDP with Constraints

Krishna C. Kalagarla, Rahul Jain, Pierluigi Nuzzo

Keywords Paper

0

0

0

0

16:23

04/08/2021

Information-Theoretic Generalization Bounds for Stochastic Gradient Descent

Gergely Neu, Gintare Karolina Dziugiate, Mahdi Haghifam, Daniel M. Roy

Keywords Paper

0

0

0

0

18:01

12/07/2020

Discount Factor as a Regularizer in Reinforcement Learning

Ron Amit, Kamil Ciosek, Ron Meir

Keywords Paper

Reinforcement Learning - General

0

0

0

0

14:46

03/05/2021

Tilted Empirical Risk Minimization

Tian Li, Ahmad Beirami, Maziar Sanjabi, Virginia Smith

Keywords Paper

fairness, label noise robustness, models of learning and generalization, exponential tilting

0

0

0

0

5:11

18/07/2021

A Lower Bound for the Sample Complexity of Inverse Reinforcement Learning

Abi Komanduru, Jean Honorio

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

4:55

18/07/2021

Active Feature Acquisition with Generative Surrogate Models

Yang Li, Junier Oliva

Keywords Paper

Deep Learning, Generative Models, Applications, Computational Biology and Bioinformatics, Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:44

18/07/2021

Outside the Echo Chamber: Optimizing the Performative Risk

John Miller, Juan Perdomo, Tijana Zrnic

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:05