Reinforcement Learning Under Moral Uncertainty

18/07/2021

Reinforcement Learning Under Moral Uncertainty

Adrien Ecoffet, Joel Lehman

Keywords: Social Aspects of Machine Learning, AI Safety

Abstract Paper Similar Papers

Abstract: An ambitious goal for machine learning is to create agents that behave ethically: The capacity to abide by human moral norms would greatly expand the context in which autonomous agents could be practically and safely deployed, e.g. fully autonomous vehicles will encounter charged moral decisions that complicate their deployment. While ethical agents could be trained by rewarding correct behavior under a specific moral theory (e.g. utilitarianism), there remains widespread disagreement about the nature of morality. Acknowledging such disagreement, recent work in moral philosophy proposes that ethical behavior requires acting under moral uncertainty, i.e. to take into account when acting that one's credence is split across several plausible ethical theories. This paper translates such insights to the field of reinforcement learning, proposes two training methods that realize different points among competing desiderata, and trains agents in simple environments to act under moral uncertainty. The results illustrate (1) how such uncertainty can help curb extreme behavior from commitment to single theories and (2) several technical complications arising from attempting to ground moral philosophy in RL (e.g. how can a principled trade-off between two competing but incomparable reward functions be reached). The aim is to catalyze progress towards morally-competent agents and highlight the potential of RL to contribute towards the computational grounding of moral philosophy.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Multi-Objective Reinforcement Learning for Designing Ethical Environments

Manel Rodriguez-Soto, Maite Lopez-Sanchez, Juan A. Rodriguez Aguilar

Keywords Paper

AI Ethics, Trust, Fairness, Moral Decision Making, Reinforcement Learning

0

0

0

0

14:00

02/02/2021

SCRUPLES: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes

Nicholas Lourie, Ronan Le Bras, Yejin Choi

Keywords Paper

0

0

0

0

16:14

03/05/2021

Aligning AI With Shared Human Values

Dan Hendrycks, Collin Burns, Steven Basart and
Andrew Critch, Jerry Li, Dawn Song, Jacob Steinhardt

Keywords Paper

alignment, human preferences, value learning

0

0

0

0

5:38

02/02/2021

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

Ran Tian, Liting Sun, Masayoshi Tomizuka

Keywords Paper

0

0

0

0

16:28

26/04/2020

Intrinsic Motivation for Encouraging Synergistic Behavior

Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta

Keywords Paper

reinforcement learning, intrinsic motivation, synergistic, robot manipulation

0

0

0

0

5:02

06/12/2020

Bayesian Robust Optimization for Imitation Learning

Daniel Brown, Scott Niekum, Marek Petrik

Keywords Paper

0

0

0

0

3:06

06/12/2021

Safe Reinforcement Learning with Natural Language Constraints

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and
Peter J Ramadge, Karthik Narasimhan

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:32

02/02/2021

WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning

Qisong Yang, Thiago D. Simão, Simon H Tindemans, Matthijs T. J. Spaan

Keywords Paper

0

0

0

0

17:28

18/07/2021

Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

Anuj Mahajan, Mikayel Samvelyan, Lei Mao and
Viktor Makoviychuk, Animesh Garg, Jean Kossaifi, Shimon Whiteson, Yuke Zhu, Anima Anandkumar

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:16

16/11/2020

Positive-Unlabeled Reward Learning

Danfei Xu, Misha Denil

Keywords Paper

0

0

0

0

5:04

12/09/2020

Dyadic Obligations over Complex Actions as Deontic Constraints in the Situation Calculus

Jens Claßen, James Delgrande

Keywords Paper

Reasoning about actions and change, action languages-General, Nonmonotonic logics, default logics, conditional logics-General, Reasoning about knowledge, beliefs, and other mental attitudes-General

0

0

0

0

14:14

02/02/2021

Ethically Compliant Sequential Decision Making

Justin Svegliato, Samer B. Nashed, Shlomo Zilberstein

Keywords Paper

0

0

0

0

19:09

12/07/2020

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:40

19/08/2021

Causal Learning for Socially Responsible AI

Lu Cheng, Ahmadreza Mosallanezhad, Paras Sheth, Huan Liu

Keywords Paper

Humans and AI, General, General, General, General

0

0

0

0

15:29

08/12/2020

Exploring Morality in Argumentation

Jonathan Kobbe, Ines Rehbein, Ioana Hulpuș, Heiner Stuckenschmidt

Keywords Paper

0

0

0

0

14:59

06/12/2020

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend, Junwei Lu, Leo Anthony Celi and
Tianxi Cai, Peter Szolovits

Keywords Paper

0

0

0

0

3:15

06/12/2021

Policy Learning Using Weak Supervision

Jingkang Wang, Hongyi Guo, Zhaowei Zhu, Yang Liu

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:57

02/02/2021

Ethical Dilemmas in Strategic Games

Pavel Naumov, Rui-Jie Yew

Keywords Paper

0

0

0

0

14:06

06/12/2021

An Axiomatic Theory of Provably-Fair Welfare-Centric Machine Learning

Cyrus Cousins

Keywords Paper

theory, machine learning

0

0

0

0

14:57

03/05/2021

Regularized Inverse Reinforcement Learning

Wonseok Jeon, Chen-Yang Su, Paul Barde and
Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Keywords Paper

reinforcement learning, regularized markov decision processes, reward learning, inverse reinforcement learning

0

0

0

0

9:50

12/07/2020

Learning Fair Policies in Multi-Objective (Deep) Reinforcement Learning with Average and Discounted Rewards

Umer Siddique, Paul Weng, Matthieu Zimmer

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:17

26/10/2020

Imitation Learning over Heterogeneous Agents with Restraining Bolts

Giuseppe De Giacomo, Marco Favorito, Luca Iocchi, Fabio Patrizi

Keywords Paper

Restraining Bolts, Non-markovian Rewards, Transfer Learning

0

0

0

0

7:50

09/07/2020

Pessimism About Unknown Unknowns Inspires Conservatism

Michael K Cohen, Marcus Hutter

Keywords Paper

Reinforcement learning, Bayesian methods

0

0

0

0

15:02

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

06/12/2021

Risk-Aware Transfer in Reinforcement Learning using Successor Features

Michael Gimelfarb, Andre Barreto, Scott Sanner, Chi-Guhn Lee

Keywords Paper

reinforcement learning and planning, representation learning, transfer learning

0

0

0

0

12:06

19/04/2021

Exploring supervised and unsupervised rewards in machine translation

Julia Ive, Zixu Wang, Marina Fomicheva, Lucia Specia

Keywords Paper

0

0

0

0

10:52

06/12/2021

Outcome-Driven Reinforcement Learning via Variational Inference

Tim G. J. Rudner, Vitchyr Pong, Rowan McAllister and
Yarin Gal, Sergey Levine

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

12:21

03/05/2021

Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

Zhengxian Lin, Kin-Ho Lam, Alan Fern

Keywords Paper

Deep Reinforcement Learning, Explainable AI

0

0

0

0

14:19

06/12/2020

Inverse Rational Control with Partially Observable Continuous Nonlinear Dynamics

Minhae Kwon, Saurabh Daptardar, Paul R Schrater, Xaq Pitkow

Keywords Paper

0

0

0

0

3:40

02/02/2021

Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Xiaoting Shao, Arseny Skryagin, Wolfgang Stammer and
Patrick Schramowski, Kristian Kersting

Keywords Paper

0

0

0

0

19:08

06/12/2021

Counterfactual Invariance to Spurious Correlations in Text Classification

Victor Veitch, Alexander D'Amour, Steve Yadlowsky, Jacob Eisenstein

Keywords Paper

theory, machine learning, domain adaptation, causality

0

0

0

0

15:06

06/12/2021

Conservative Offline Distributional Reinforcement Learning

Yecheng Ma, Dinesh Jayaraman, Osbert Bastani

Keywords Paper

reinforcement learning and planning

1

0

0

0

13:54

02/02/2021

Verifiable Machine Ethics in Changing Contexts

Louise A. Dennis, Martin Mose Bentzen, Felix Lindner, Michael Fisher

Keywords Paper

0

0

0

0

18:14

26/04/2020

Learning from Rules Generalizing Labeled Exemplars

Abhijeet Awasthi, Sabyasachi Ghosh, Rasna Goyal, Sunita Sarawagi

Keywords Paper

Learning from Rules, Learning from limited labeled data, Weakly Supervised Learning

0

0

0

0

5:18

02/02/2021

Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework

Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li

Keywords Paper

0

0

0

0

16:03

06/12/2020

End-to-End Learning and Intervention in Games

Jiayang Li, Jing Yu, Yu Nie, Zhaoran Wang

Keywords Paper

0

0

0

0

3:22

03/05/2021

Conservative Safety Critics for Exploration

Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart and
Sergey Levine, Florian Shkurti, Animesh Garg

Keywords Paper

Safe exploration, Reinforcement Learning

0

0

0

0

5:14

02/02/2021

Group Fairness by Probabilistic Modeling with Latent Fair Decisions

YooJung Choi, Meihua Dang, Guy Van den Broeck

Keywords Paper

0

0

0

0

19:30

06/12/2021

Counterexample Guided RL Policy Refinement Using Bayesian Optimization

Briti Gangopadhyay, Pallab Dasgupta

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

12:49