Characterizing Optimal Mixed Policies: Where to Intervene and What to Observe

06/12/2020

Characterizing Optimal Mixed Policies: Where to Intervene and What to Observe

Sanghack Lee, Elias Bareinboim

Keywords:

Abstract Paper Similar Papers

Abstract: Intelligent agents are continuously faced with the challenge of optimizing a policy based on what they can observe (see) and which actions they can take (do) in the environment where they are deployed. Most policy can be parametrized in terms of these two dimensions, i.e., as a function of what can be seen and done given a certain situation, which we call a \textit{mixed policy}. In this paper, we investigate several properties of the class of mixed policies and provide an efficient and effective characterization, including optimality and non-redundancy. Specifically, we introduce a graphical criterion to identify unnecessary contexts for a set of actions, leading to a natural characterization of non-redundancy of mixed policies. We then derive sufficient conditions under which one strategy can dominate the other with respect to their maximum achievable expected rewards (optimality). This characterization leads to a fundamental understanding of the space of mixed policies and a possible refinement of the agent's strategy so that it converges to the optimum faster and more robustly. One surprising result of the causal characterization is that the agent following a more standard approach --- intervening on all intervenable variables and observing all available contexts --- may be hurting itself, and will never achieve an optimal performance.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Alternative Microfoundations for Strategic Classification

Meena Jagadeesan, Celestine Mendler-Dünner, Moritz Hardt

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:18

02/02/2021

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

Ran Tian, Liting Sun, Masayoshi Tomizuka

Keywords Paper

0

0

0

0

16:28

06/12/2020

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend, Junwei Lu, Leo Anthony Celi and
Tianxi Cai, Peter Szolovits

Keywords Paper

0

0

0

0

3:15

06/12/2021

Automated Dynamic Mechanism Design

Hanrui Zhang, Vincent Conitzer

Keywords Paper

0

0

0

0

14:35

08/07/2020

Obviously Strategyproof Single-Minded Combinatorial Auctions

Bart de Keijzer, Maria Kyropoulou, Carmine Ventre

Keywords Paper

OSP Mechanisms, Extensive-form Mechanisms, Single-minded Combinatorial Auctions, Greedy algorithms

0

0

0

0

24:42

18/07/2021

OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation

Jongmin Lee, Wonseok Jeon, Byung-Jun Lee and
Joelle Pineau, Kee-Eung Kim

Keywords Paper

Reinforcement Learning and Planning

1

0

0

1

5:15

03/05/2021

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Ruosong Wang, Dean Foster, Sham M Kakade

Keywords Paper

batch reinforcement learning, representation, function approximation, lower bound

0

0

0

0

9:02

13/04/2021

Linear models are robust optimal under strategic behavior

Wei Tang, Chien-Ju Ho, Yang Liu

Keywords Paper

0

0

0

0

3:32

18/07/2021

Off-Belief Learning

Hengyuan Hu, Adam Lerer, Brandon Cui and
Luis Pineda, Noam Brown, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:10

02/02/2021

On Fair Division under Heterogeneous Matroid Constraints

Amitay Dror, Michal Feldman, Erel Segal-Halevi

Keywords Paper

0

0

0

0

15:26

06/12/2020

High-Dimensional Contextual Policy Search with Unknown Context Rewards using Bayesian Optimization

Qing Feng , Ben Letham, Hongzi Mao, Eytan Bakshy

Keywords Paper

0

0

0

0

3:29

18/07/2021

Progressive-Scale Boundary Blackbox Attack via Projective Gradient Estimation

Jiawei Zhang, Linyi Li, Huichen Li and
Xiaolu Zhang, Shuang Yang, Bo Li

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

0

6:36

06/12/2021

Neural Algorithmic Reasoners are Implicit Planners

Andreea-Ioana Deac, Petar Veličković, Ognjen Milinkovic and
Pierre-Luc Bacon, Jian Tang, Mladen Nikolic

Keywords Paper

deep learning, reinforcement learning and planning, self-supervised learning, generative model, graph learning

0

0

0

0

13:10

12/07/2020

Performative Prediction

Juan Perdomo, Tijana Zrnic, Celestine Mendler-Dünner, University of California Moritz Hardt

Keywords Paper

Learning Theory

0

0

0

0

11:22

06/12/2021

Learning to Predict Trustworthiness with Steep Slope Loss

Yan Luo, Yongkang Wong, Mohan Kankanhalli, Qi Zhao

Keywords Paper

deep learning, machine learning, transformers

0

0

0

0

12:22

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

06/12/2021

Offline Constrained Multi-Objective Reinforcement Learning via Pessimistic Dual Value Iteration

Runzhe Wu, Yufeng Zhang, Zhuoran Yang, Zhaoran Wang

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:40

06/12/2020

Robust Multi-Agent Reinforcement Learning with Model Uncertainty

Kaiqing Zhang, TAO SUN, Yunzhe Tao and
Sahika Genc, Sunil Mallya, Tamer Basar

Keywords Paper

0

0

0

0

3:11

18/07/2021

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan, Mikayel Samvelyan, Lei Mao and
Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Anima Anandkumar

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:16

06/12/2021

Control Variates for Slate Off-Policy Evaluation

Nikos Vlassis, Ashok Chandrashekar, Fernando Amat, Nathan Kallus

Keywords Paper

optimization, bandits

0

0

0

0

12:25

26/08/2020

Value Preserving State-Action Abstractions

David Abel, Nate Umbanhowar, Khimya Khetarpal and
Dilip Arumugam, Doina Precup, Michael Littman

Keywords Paper

0

0

0

0

14:52

18/07/2021

Provably Correct Optimization and Exploration with Non-linear Policies

Fei Feng, Wotao Yin, Alekh Agarwal, Lin Yang

Keywords Paper

Deep Learning, Adversarial Networks, Applications, Fairness, Accountability, and Transparency, Theory, RL, Decisions and Control Theory

0

0

0

0

5:03

02/02/2021

Present-Biased Optimization

Fedor V. Fomin, Pierre Fraigniaud, Petr A. Golovach

Keywords Paper

0

0

0

0

19:38

06/12/2021

When Is Generalizable Reinforcement Learning Tractable?

Dhruv Malik, Yuanzhi Li, Pradeep Ravikumar

Keywords Paper

reinforcement learning and planning, generative model, representation learning

0

0

0

0

12:38

13/04/2021

Provably eﬃcient actor-critic for risk-sensitive and robust adversarial RL: A linear-quadratic case

Yufeng Zhang, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

2:53

06/12/2021

Robust Allocations with Diversity Constraints

Zeyu Shen, Lodewijk Gelauff, Ashish Goel and
Aleksandra Korolova, Kamesh Munagala

Keywords Paper

robustness, graph learning, fairness

0

0

0

0

14:39

18/07/2021

Detecting Rewards Deterioration in Episodic Reinforcement Learning

Ido Greenberg, Shie Mannor

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:25

06/12/2020

Bayesian Robust Optimization for Imitation Learning

Daniel Brown, Scott Niekum, Marek Petrik

Keywords Paper

0

0

0

0

3:06

06/12/2020

Reinforcement Learning for Control with Multiple Frequencies

Jongmin Lee, Byung-Jun Lee, Kee-Eung Kim

Keywords Paper

Algorithms -> Multitask and Transfer Learning; Deep Learning -> Supervised Deep Networks; Theory -> Learning Theory; Theory -> , Deep Learning

0

0

0

0

3:21

06/12/2021

Towards Calibrated Model for Long-Tailed Visual Recognition from Prior Perspective

Zhengzhuo Xu, Zenghao Chai, Chun Yuan

Keywords Paper

theory, machine learning

0

0

0

0

4:23

18/07/2021

Strategic Classification in the Dark

Ganesh Ghalme, Vineet Nair, Itay Eilat and
Inbal Talgam-Cohen, Nir Rosenfeld

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:08

06/12/2020

Decisions, Counterfactual Explanations and Strategic Behavior

Stratis Tsirtsis, Manuel Gomez Rodriguez

Keywords Paper

0

0

0

0

3:24

12/07/2020

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Yonadav Shavit, Benjamin Edelman, Brian Axelrod

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

12:37

12/07/2020

Adversarial Risk via Optimal Transport and Optimal Couplings

Muni Sreenivas Pydi, Varun Jog

Keywords Paper

Adversarial Examples

0

0

0

0

12:34

06/12/2021

Fair Classification with Adversarial Perturbations

L. Elisa Celis, Anay Mehrotra, Nisheeth Vishnoi

Keywords Paper

optimization, machine learning, fairness

0

0

0

0

15:03

06/12/2020

Consequences of Misaligned AI

Simon Zhuang, Dylan Hadfield-Menell

Keywords Paper

0

0

0

0

3:13

12/07/2020

Class-Weighted Classification: Trade-offs and Robust Approaches

Ziyu Xu, Chen Dan, Justin Khim, Pradeep Ravikumar

Keywords Paper

Learning Theory

0

0

0

0

11:49

06/12/2021

Reliable Post hoc Explanations: Modeling Uncertainty in Explainability

Dylan Slack, Anna Hilgard, Sameer Singh, Himabindu Lakkaraju

Keywords Paper

robustness, interpretability

0

0

0

0

15:06

06/12/2020

Provably Good Batch Off-Policy Reinforcement Learning Without Great Exploration

Yao Liu, Adith Swaminathan, Alekh Agarwal, Emma Brunskill

Keywords Paper

0

0

0

0

3:17

06/12/2021

Multi-Label Learning with Pairwise Relevance Ordering

Ming-Kun Xie, Sheng-Jun Huang

Keywords Paper

machine learning

0

0

0

0

3:56