Two steps to risk sensitivity

06/12/2021

Two steps to risk sensitivity

Christopher Gagne, Peter Dayan

Keywords: theory, reinforcement learning and planning

Abstract Paper Similar Papers

Abstract: Distributional reinforcement learning (RL) – in which agents learn about all the possible long-term consequences of their actions, and not just the expected value – is of great recent interest. One of the most important affordances of a distributional view is facilitating a modern, measured, approach to risk when outcomes are not completely certain. By contrast, psychological and neuroscientific investigations into decision making under risk have utilized a variety of more venerable theoretical models such as prospect theory that lack axiomatically desirable properties such as coherence. Here, we consider a particularly relevant risk measure for modeling human and animal planning, called conditional value-at-risk (CVaR), which quantifies worst-case outcomes (e.g., vehicle accidents or predation). We first adopt a conventional distributional approach to CVaR in a sequential setting and reanalyze the choices of human decision-makers in the well-known two-step task, revealing substantial risk aversion that had been lurking under stickiness and perseveration. We then consider a further critical property of risk sensitivity, namely time consistency, showing alternatives to this form of CVaR that enjoy this desirable characteristic. We use simulations to examine settings in which the various forms differ in ways that have implications for human and animal planning and behavior.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Detecting Rewards Deterioration in Episodic Reinforcement Learning

Ido Greenberg, Shie Mannor

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:25

06/12/2021

RMIX: Learning Risk-Sensitive Policies forCooperative Reinforcement Learning Agents

Wei Qiu, Xinrun Wang, Runsheng Yu and
Rundong Wang, Xu He, Bo An, Svetlana Obraztsova, Zinovi Rabinovich

Keywords Paper

reinforcement learning and planning

0

0

0

0

3:17

18/07/2021

Temporal Predictive Coding For Model-Based Planning In Latent Space

Tung Nguyen, Rui Shu, Tuan Pham and
Hung Bui, Stefano Ermon

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:19

12/07/2020

Performative Prediction

Juan Perdomo, Tijana Zrnic, Celestine Mendler-Dünner, University of California Moritz Hardt

Keywords Paper

Learning Theory

0

0

0

0

11:22

18/07/2021

Out-of-Distribution Generalization via Risk Extrapolation (REx)

David Krueger, Ethan Caballero, Jörn Jacobsen and
Amy Zhang, Jonathan Binas, Dinghuai Zhang, Remi Le Priol, Aaron Courville

Keywords Paper

Deep Learning

0

0

0

0

18:07

06/12/2020

Learning Robust Decision Policies from Observational Data

Muhammad Osama, Dave Zachariah, Peter Stoica

Keywords Paper

0

0

0

0

3:26

02/02/2021

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

Ran Tian, Liting Sun, Masayoshi Tomizuka

Keywords Paper

0

0

0

0

16:28

06/12/2020

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend, Junwei Lu, Leo Anthony Celi and
Tianxi Cai, Peter Szolovits

Keywords Paper

0

0

0

0

3:15

05/01/2021

Misclassification Risk and Uncertainty Quantification in Deep Classifiers

Murat Sensoy, Maryam Saleki, Simon Julier and
Reyhan Aydogan, John Reid

Keywords Paper

0

0

0

0

4:20

06/12/2021

Conservative Offline Distributional Reinforcement Learning

Yecheng Ma, Dinesh Jayaraman, Osbert Bastani

Keywords Paper

reinforcement learning and planning

1

0

0

0

13:54

06/12/2021

Uncertain Decisions Facilitate Better Preference Learning

Cassidy Laidlaw, Stuart Russell

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

15:03

06/12/2021

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Alicia Curth, Changhee Lee, Mihaela van der Schaar

Keywords Paper

deep learning, machine learning, domain adaptation, causality

0

0

0

0

13:43

06/12/2021

Collaborative Uncertainty in Multi-Agent Trajectory Forecasting

Bohan Tang, Yiqi Zhong, Ulrich Neumann and
Gang Wang, Siheng Chen, Ya Zhang

Keywords Paper

deep learning

0

0

0

0

7:15

26/04/2020

Diverse Trajectory Forecasting with Determinantal Point Processes

Ye Yuan, Kris M. Kitani

Keywords Paper

Diverse Inference, Generative Models, Trajectory Forecasting

0

0

0

0

5:07

19/08/2021

Hindsight Value Function for Variance Reduction in Stochastic Dynamic Environment

Jiaming Guo, Rui Zhang, Xishan Zhang and
Shaohui Peng, Qi Yi, Zidong Du, Xing Hu, Qi Guo, Yunji Chen

Keywords Paper

Machine Learning, Deep Learning, Deep Reinforcement Learning, Sequential Decision Making

0

0

0

0

14:36

06/12/2020

Towards Safe Policy Improvement for Non-Stationary MDPs

Yash Chandak, Scott Jordan, Georgios Theocharous and
Martha White, Philip Thomas

Keywords Paper

Applications -> Computer Vision; Deep Learning -> Attention Models, Deep Learning

0

0

0

0

3:13

06/12/2021

Continuous Mean-Covariance Bandits

Yihan Du, Siwei Wang, Zhixuan Fang, Longbo Huang

Keywords Paper

bandits

0

0

0

0

11:33

02/02/2021

GaussianPath:A Bayesian Multi-Hop Reasoning Framework for Knowledge Graph Reasoning

Guojia Wan, Bo Du

Keywords Paper

0

0

0

0

13:52

02/02/2021

Present-Biased Optimization

Fedor V. Fomin, Pierre Fraigniaud, Petr A. Golovach

Keywords Paper

0

0

0

0

19:38

18/07/2021

Off-Belief Learning

Hengyuan Hu, Adam Lerer, Brandon Cui and
Luis Pineda, Noam Brown, Jakob Foerster

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:10

06/12/2021

Counterexample Guided RL Policy Refinement Using Bayesian Optimization

Briti Gangopadhyay, Pallab Dasgupta

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

12:49

02/02/2021

Apparently Irrational Choice as Optimal Sequential Decision Making

Haiyang Chen, Hyung Jin Chang, Andrew Howes

Keywords Paper

0

0

0

0

15:59

06/12/2021

Control Variates for Slate Off-Policy Evaluation

Nikos Vlassis, Ashok Chandrashekar, Fernando Amat, Nathan Kallus

Keywords Paper

optimization, bandits

0

0

0

0

12:25

12/07/2020

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Yonadav Shavit, Benjamin Edelman, Brian Axelrod

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

12:37

02/02/2021

Foresee then Evaluate: Decomposing Value Estimation with Latent Future Prediction

Hongyao Tang, Zhaopeng Meng, Guangyong Chen and
Pengfei Chen, Chen Chen, Yaodong Yang, Luo Zhang, Wulong Liu, Jianye Hao

Keywords Paper

0

0

0

0

18:44

06/12/2021

Inverse Optimal Control Adapted to the Noise Characteristics of the Human Sensorimotor System

Matthias Schultheis, Dominik Straub, Constantin Rothkopf

Keywords Paper

0

0

0

0

9:29

18/07/2021

A Regret Minimization Approach to Iterative Learning Control

Naman Agarwal, Elad Hazan, Anirudha Majumdar, Karan Singh

Keywords Paper

Reinforcement Learning and Planning, Planning and Control

0

0

0

0

5:13

03/05/2021

Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

Zhengxian Lin, Kin-Ho Lam, Alan Fern

Keywords Paper

Deep Reinforcement Learning, Explainable AI

0

0

0

0

14:19

14/06/2020

Imitative Non-Autoregressive Modeling for Trajectory Forecasting and Imputation

Mengshi Qi, Jie Qin, Yu Wu, Yi Yang

Keywords Paper

trajectory forecasting, sequence imputation, non-autoregressive modele, imitation learning, autoencoder, recurrent neural network

0

0

0

0

1:00

14/06/2020

A Stochastic Conditioning Scheme for Diverse Human Motion Prediction

Sadegh Aliakbarian, Fatemeh Sadat Saleh, Mathieu Salzmann and
Lars Petersson, Stephen Gould

Keywords Paper

human motion prediction, variational autoencoders, generative models, output diversity, stochastic motion prediction

0

0

0

0

1:00

18/07/2021

Outside the Echo Chamber: Optimizing the Performative Risk

John Miller, Juan Perdomo, Tijana Zrnic

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:05

12/07/2020

Cautious Adaptation For Reinforcement Learning in Safety-Critical Settings

Jesse Zhang, Brian Cheung, Chelsea Finn and
Sergey Levine, Dinesh Jayaraman

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:54

03/05/2021

Uncertainty Estimation and Calibration with Finite-State Probabilistic RNNs

Cheng Wang, Carolin Lawrence, Mathias Niepert

Keywords Paper

calibration, uncertainty estimation, RNN

0

0

0

0

4:25

13/04/2021

Linear models are robust optimal under strategic behavior

Wei Tang, Chien-Ju Ho, Yang Liu

Keywords Paper

0

0

0

0

3:32

12/07/2020

What can I do here? A Theory of Affordances in Reinforcement Learning

Khimya Khetarpal, Zafarali Ahmed, Gheorghe Comanici and
David Abel, Doina Precup

Keywords Paper

Reinforcement Learning - General

0

0

0

0

12:55

12/07/2020

Selective Dyna-style Planning Under Limited Model Capacity

Zaheer SM, Samuel Sokota, Erin Talvitie, Martha White

Keywords Paper

Reinforcement Learning - General

0

0

0

0

15:00

03/05/2021

Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

Mohak Bhardwaj, Sanjiban Choudhury, Byron Boots

Keywords Paper

reinforcement learning, model-predictive control

0

0

0

0

5:09

06/12/2021

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Nathan Grinsztajn, Johan Ferret, Olivier Pietquin and
philippe preux, Matthieu Geist

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:31

06/12/2020

Evidential Sparsification of Multimodal Latent Spaces in Conditional Variational Autoencoders

Masha Itkina, Boris Ivanovic, Ransalu Senanayake and
Mykel J Kochenderfer, Marco Pavone

Keywords Paper

0

0

0

0

3:39

12/07/2020

Class-Weighted Classification: Trade-offs and Robust Approaches

Ziyu Xu, Chen Dan, Justin Khim, Pradeep Ravikumar

Keywords Paper

Learning Theory

0

0

0

0

11:49