Multi-Objective Reinforcement Learning for Designing Ethical Environments

19/08/2021

Multi-Objective Reinforcement Learning for Designing Ethical Environments

Manel Rodriguez-Soto, Maite Lopez-Sanchez, Juan A. Rodriguez Aguilar

Keywords: AI Ethics, Trust, Fairness, Moral Decision Making, Reinforcement Learning

Abstract Paper Similar Papers

Abstract: AI research is being challenged with ensuring that autonomous agents learn to behave ethically, namely in alignment with moral values. A common approach, founded on the exploitation of Reinforcement Learning techniques, is to design environments that incentivise agents to behave ethically. However, to the best of our knowledge, current approaches do not theoretically guarantee that an agent will learn to behave ethically. Here, we make headway along this direction by proposing a novel way of designing environments wherein it is formally guaranteed that an agent learns to behave ethically while pursuing its individual objectives. Our theoretical results develop within the formal framework of Multi-Objective Reinforcement Learning to ease the handling of an agent's individual and ethical objectives. As a further contribution, we leverage on our theoretical results to introduce an algorithm that automates the design of ethical environments.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

18/07/2021

Reinforcement Learning Under Moral Uncertainty

Adrien Ecoffet, Joel Lehman

Keywords Paper

Social Aspects of Machine Learning, AI Safety

0

0

0

0

5:04

02/02/2021

SCRUPLES: A Corpus of Community Ethical Judgments on 32,000 Real-Life Anecdotes

Nicholas Lourie, Ronan Le Bras, Yejin Choi

Keywords Paper

0

0

0

0

16:14

03/05/2021

Aligning AI With Shared Human Values

Dan Hendrycks, Collin Burns, Steven Basart and
Andrew Critch, Jerry Li, Dawn Song, Jacob Steinhardt

Keywords Paper

alignment, human preferences, value learning

0

0

0

0

5:38

02/02/2021

Ethically Compliant Sequential Decision Making

Justin Svegliato, Samer B. Nashed, Shlomo Zilberstein

Keywords Paper

0

0

0

0

19:09

19/08/2021

Causal Learning for Socially Responsible AI

Lu Cheng, Ahmadreza Mosallanezhad, Paras Sheth, Huan Liu

Keywords Paper

Humans and AI, General, General, General, General

0

0

0

0

15:29

02/02/2021

Verifiable Machine Ethics in Changing Contexts

Louise A. Dennis, Martin Mose Bentzen, Felix Lindner, Michael Fisher

Keywords Paper

0

0

0

0

18:14

06/12/2021

Safe Reinforcement Learning with Natural Language Constraints

Tsung-Yen Yang, Michael Y Hu, Yinlam Chow and
Peter J Ramadge, Karthik Narasimhan

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:32

06/12/2021

Outcome-Driven Reinforcement Learning via Variational Inference

Tim G. J. Rudner, Vitchyr Pong, Rowan McAllister and
Yarin Gal, Sergey Levine

Keywords Paper

reinforcement learning and planning, generative model

0

0

0

0

12:21

06/12/2020

Error Bounds of Imitating Policies and Environments

Tian Xu, Ziniu Li, Yang Yu

Keywords Paper

0

0

0

0

3:07

06/12/2021

Multi-Objective SPIBB: Seldonian Offline Policy Improvement with Safety Constraints in Finite MDPs

harsh satija, Philip S. Thomas, Joelle Pineau, Romain Laroche

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:27

06/12/2021

Counterexample Guided RL Policy Refinement Using Bayesian Optimization

Briti Gangopadhyay, Pallab Dasgupta

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

12:49

12/09/2020

Dyadic Obligations over Complex Actions as Deontic Constraints in the Situation Calculus

Jens Claßen, James Delgrande

Keywords Paper

Reasoning about actions and change, action languages-General, Nonmonotonic logics, default logics, conditional logics-General, Reasoning about knowledge, beliefs, and other mental attitudes-General

0

0

0

0

14:14

19/08/2021

Probabilistic Sufficient Explanations

Eric Wang, Pasha Khosravi, Guy Van den Broeck

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Explainability, Exact Probabilistic Inference

0

0

0

0

12:13

06/12/2020

First Order Constrained Optimization in Policy Space

Yiming Zhang, Quan Vuong, Keith Ross

Keywords Paper

0

0

0

0

3:15

06/12/2020

Expert-Supervised Reinforcement Learning for Offline Policy Learning and Evaluation

Aaron Sonabend, Junwei Lu, Leo Anthony Celi and
Tianxi Cai, Peter Szolovits

Keywords Paper

0

0

0

0

3:15

12/07/2020

Constrained Markov Decision Processes via Backward Value Functions

Harsh Satija, Philip Amortila, Joelle Pineau

Keywords Paper

Reinforcement Learning - General

0

0

0

0

10:40

02/02/2021

Bounded Risk-Sensitive Markov Games: Forward Policy Design and Inverse Reward Learning with Iterative Reasoning and Cumulative Prospect Theory

Ran Tian, Liting Sun, Masayoshi Tomizuka

Keywords Paper

0

0

0

0

16:28

03/05/2021

Conservative Safety Critics for Exploration

Homanga Bharadhwaj, Aviral Kumar, Nicholas Rhinehart and
Sergey Levine, Florian Shkurti, Animesh Garg

Keywords Paper

Safe exploration, Reinforcement Learning

0

0

0

0

5:14

19/08/2021

Building Affordance Relations for Robotic Agents - A Review

Paola Ardón, Èric Pairet, Katrin S. Lohan and
Subramanian Ramamoorthy, Ron P. A. Petrick

Keywords Paper

Multidisciplinary topics and applications, General, General

0

0

0

0

11:26

06/12/2020

What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes

Herman Yau, Chris Russell, Simon Hadfield

Keywords Paper

0

0

0

0

3:15

18/07/2021

Interaction-Grounded Learning

Tengyang Xie, John Langford, Paul Mineiro, Ida Momennejad

Keywords Paper

Reinforcement Learning and Planning, Bandits

0

0

0

0

5:12

06/12/2021

On Learning Domain-Invariant Representations for Transfer Learning with Multiple Sources

Trung Phung, Trung Le, Tung-Long Vuong and
Toan Tran, Anh Tran, Hung Bui, Dinh Phung

Keywords Paper

theory, deep learning, machine learning, domain adaptation, transfer learning

0

0

0

0

10:09

03/05/2021

Regularized Inverse Reinforcement Learning

Wonseok Jeon, Chen-Yang Su, Paul Barde and
Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Keywords Paper

reinforcement learning, regularized markov decision processes, reward learning, inverse reinforcement learning

0

0

0

0

9:50

02/02/2021

Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Xiaoting Shao, Arseny Skryagin, Wolfgang Stammer and
Patrick Schramowski, Kristian Kersting

Keywords Paper

0

0

0

0

19:08

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

03/05/2021

Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

Zhengxian Lin, Kin-Ho Lam, Alan Fern

Keywords Paper

Deep Reinforcement Learning, Explainable AI

0

0

0

0

14:19

14/06/2020

A Programmatic and Semantic Approach to Explaining and Debugging Neural Network Based Object Detectors

Edward Kim, Divya Gopinath, Corina Păsăreanu, Sanjit A. Seshia

Keywords Paper

population-level explanation, testing, perception, neural network, blackbox, scenario, object detection, machine learning, autonomous driving

0

0

0

0

4:58

02/02/2021

WCSAC: Worst-Case Soft Actor Critic for Safety-Constrained Reinforcement Learning

Qisong Yang, Thiago D. Simão, Simon H Tindemans, Matthijs T. J. Spaan

Keywords Paper

0

0

0

0

17:28

22/09/2020

Making neural networks interpretable with attribution: Application to implicit signals prediction

Darius Afchar, Romain Hennequin

Keywords Paper

Implicit Recommender System, Interpretable machine learning

0

0

0

0

2:28

19/08/2021

Identifying Norms from Observation Using MCMC Sampling

Stephen Cranefield, Ashish Dhiman

Keywords Paper

Agent-based and Multi-agent Systems, Normative systems, Agent Societies, Bayesian Learning

0

0

0

0

14:44

13/04/2021

Logical team q-learning: An approach towards factored policies in cooperative MARL

Lucas Cassano, Ali H. Sayed

Keywords Paper

0

0

0

0

3:15

19/08/2021

Finite-Trace and Generalized-Reactivity Specifications in Temporal Synthesis

Giuseppe De Giacomo, Antonio Di Stasio, Lucas M. Tabajara and
Moshe Vardi, Shufang Zhu

Keywords Paper

Knowledge Representation and Reasoning, Action, Change and Causality, Theoretical Foundations of Planning, Formal Verification, Validation and Synthesis

0

0

0

0

13:42

06/12/2020

Automatic Curriculum Learning through Value Disagreement

Yunzhi Zhang, Pieter Abbeel, Lerrel Pinto

Keywords Paper

0

0

0

0

3:17

02/02/2021

A Unified Taylor Framework for Revisiting Attribution Methods

Huiqi Deng, Na Zou, Mengnan Du and
Weifu Chen, Guocan Feng, Xia Hu

Keywords Paper

0

0

0

0

16:18

06/12/2021

Risk-Aware Transfer in Reinforcement Learning using Successor Features

Michael Gimelfarb, Andre Barreto, Scott Sanner, Chi-Guhn Lee

Keywords Paper

reinforcement learning and planning, representation learning, transfer learning

0

0

0

0

12:06

02/02/2021

Unifying Principles and Metrics for Safe and Assistive AI

Siddharth Srivastava

Keywords Paper

0

0

0

0

13:52

30/11/2020

Exploiting Transferable Knowledge for Fairness-aware Image Classification

sunhee hwang, Sungho Park, Pilhyeon Lee and
seogkyu jeon, Dohyung Kim, Hyeran Byun

Keywords Paper

0

0

0

0

5:56

04/07/2020

Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations

Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini and
Thomas Lukasiewicz, Phil Blunsom

Keywords Paper

Adversarial Explanations, artificial systems, generation explanations, sanity models

0

0

0

0

6:54

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2021

Safe Reinforcement Learning by Imagining the Near Future

Garrett Thomas, Yuping Luo, Tengyu Ma

Keywords Paper

reinforcement learning and planning

2

1

0

0

6:50