Reward-rational (implicit) choice: A unifying formalism for reward learning

06/12/2020

Reward-rational (implicit) choice: A unifying formalism for reward learning

Hong Jun Jeon, Smitha Milli, Anca Dragan

Keywords:

Abstract Paper Similar Papers

Abstract: It is often difficult to hand-specify what the correct reward function is for a task, so researchers have instead aimed to learn reward functions from human behavior or feedback. The types of behavior interpreted as evidence of the reward function have expanded greatly in recent years. We've gone from demonstrations, to comparisons, to reading into the information leaked when the human is pushing the robot away or turning it off. And surely, there is more to come. How will a robot make sense of all these diverse types of behavior? Our key observation is that different types of behavior can be interpreted in a single unifying formalism - as a reward-rational choice that the human is making, often implicitly. We use this formalism to survey prior work through a unifying lens, and discuss its potential use as a recipe for interpreting new sources of information that are yet to be uncovered.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

Meta-learning curiosity algorithms

Ferran Alet, Martin F. Schneider, Tomas Lozano-Perez, Leslie Pack Kaelbling

Keywords Paper

meta-learning, exploration, curiosity

0

0

0

0

5:04

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

18/07/2021

PEBBLE: Feedback-Efficient Interactive Reinforcement Learning via Relabeling Experience and Unsupervised Pre-training

Kimin Lee, Laura Smith, Pieter Abbeel

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

15:02

16/11/2020

The EMPATHIC Framework for Task Learning from Implicit Human Feedback

Yuchen Cui, Qiping Zhang, Brad Knox and
Alessandro Allievi, Peter Stone, Scott Niekum

Keywords Paper

0

0

0

0

5:11

06/12/2020

Learning to Incentivize Other Learning Agents

Jiachen Yang, Ang Li, Mehrdad Farajtabar and
Peter Sunehag, Edward Hughes, Hongyuan Zha

Keywords Paper

0

0

0

0

3:20

19/08/2021

Building Affordance Relations for Robotic Agents - A Review

Paola Ardón, Èric Pairet, Katrin S. Lohan and
Subramanian Ramamoorthy, Ron P. A. Petrick

Keywords Paper

Multidisciplinary topics and applications, General, General

0

0

0

0

11:26

03/05/2021

Learning What To Do by Simulating the Past

David Lindner, Rohin Shah, Pieter Abbeel, Anca Dragan

Keywords Paper

reinforcement learning, imitation learning, reward learning

0

0

0

0

5:09

19/08/2021

A Unifying Bayesian Formulation of Measures of Interpretability in Human-AI Interaction

Sarath Sreedharan, Anagha Kulkarni, David Smith, Subbarao Kambhampati

Keywords Paper

Humans and AI, General, General

0

0

0

0

15:12

06/12/2020

Discovering Reinforcement Learning Algorithms

Junhyuk Oh, Matteo Hessel, Wojciech Czarnecki and
Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:21

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

06/12/2021

Discovery of Options via Meta-Learned Subgoals

Vivek Veeriah, Tom Zahavy, Matteo Hessel and
Zhongwen Xu, Junhyuk Oh, Iurii Kemaev, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

4:13

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

16/11/2020

f-IRL: Inverse Reinforcement Learning via State Marginal Matching

Tianwei Ni, Harshit Sikchi, Yufei Wang and
Tejus Gupta, Lisa Lee, Ben Eysenbach

Keywords Paper

0

0

0

0

5:07

16/11/2020

Transformers for One-Shot Visual Imitation

Sudeep Dasari, Abhinav Gupta

Keywords Paper

0

0

0

0

5:06

12/07/2020

What Can Learned Intrinsic Rewards Capture?

Zeyu Zheng, Junhyuk Oh, Matteo Hessel and
Zhongwen Xu, Manuel Kroiss, Hado van Hasselt, David Silver, Satinder Singh

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

14:47

26/04/2020

Intrinsic Motivation for Encouraging Synergistic Behavior

Rohan Chitnis, Shubham Tulsiani, Saurabh Gupta, Abhinav Gupta

Keywords Paper

reinforcement learning, intrinsic motivation, synergistic, robot manipulation

0

0

0

0

5:02

03/05/2021

Plan-Based Relaxed Reward Shaping for Goal-Directed Tasks

Ingmar Schubert, Oz Oguz, Marc Toussaint

Keywords Paper

reinforcement learning, robotics, robotic manipulation, plan-based reward shaping, reward shaping

0

0

0

0

4:38

16/11/2020

Positive-Unlabeled Reward Learning

Danfei Xu, Misha Denil

Keywords Paper

0

0

0

0

5:04

12/07/2020

Active World Model Learning in Agent-rich Environments with Progress Curiosity

Kuno Kim, Megumi Sano, Julian De Freitas and
Nick Haber, Daniel Yamins

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

15:25

18/07/2021

Exploration in Approximate Hyper-State Space for Meta Reinforcement Learning

Luisa Zintgraf, Leo Feng, Cong Lu and
Max Igl, Kristian Hartikainen, Katja Hofmann, Shimon Whiteson

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:52

26/04/2020

RIDE: Rewarding Impact-Driven Exploration for Procedurally-Generated Environments

Roberta Raileanu, Tim Rocktäschel

Keywords Paper

reinforcement learning, exploration, curiosity

0

0

0

0

4:48

26/04/2020

Watch, Try, Learn: Meta-Learning from Demonstrations and Rewards

Allan Zhou, Eric Jang, Daniel Kappler and
Alex Herzog, Mohi Khansari, Paul Wohlhart, Yunfei Bai, Mrinal Kalakrishnan, Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, reinforcement learning, imitation learning

0

0

0

0

4:34

06/12/2020

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh

Keywords Paper

0

0

0

0

3:22

06/12/2021

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

Gregory Dexter, Kevin Bello, Jean Honorio

Keywords Paper

theory, reinforcement learning and planning

0

0

0

0

14:49

03/05/2021

Domain-Robust Visual Imitation Learning with Mutual Information Constraints

Edoardo Cetin, Oya Celiktutan

Keywords Paper

Domain Adaption, Third-Person Imitation, Observational Imitation, Reinforcement Learning, Machine Learning, Mutual Information, Imitation Learning

0

0

0

0

4:51

19/08/2021

Inter-Task Similarity for Lifelong Reinforcement Learning in Heterogeneous Tasks

Sergio A. Serrano

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Reinforcement Learning, Incremental Learning, Learning in Robotics

0

0

0

0

11:02

16/11/2020

MELD: Meta-Reinforcement Learning from Images via Latent State Models

Zihao Zhao, Anusha Nagabandi, Kate Rakelly and
Chelsea Finn, Sergey Levine

Keywords Paper

0

0

0

0

5:06

08/12/2020

Improving Conversational Question Answering Systems after Deployment using Feedback-Weighted Learning

Jon Ander Campos, Kyunghyun Cho, Arantxa Otegi and
Aitor Soroa, Eneko Agirre, Gorka Azkune

Keywords Paper

0

0

0

0

11:07

06/12/2020

Inverse Reinforcement Learning from a Gradient-based Learner

Giorgia Ramponi, Gianluca Drappo, Marcello Restelli

Keywords Paper

0

0

0

0

2:42

02/02/2021

Successor Feature Sets: Generalizing Successor Representations Across Policies

Kianté Brantley, Soroush Mehri, Geoff J. Gordon

Keywords Paper

0

0

0

0

17:43

02/02/2021

Synthesis of Search Heuristics for Temporal Planning via Reinforcement Learning

Andrea Micheli, Alessandro Valentini

Keywords Paper

0

0

0

0

19:00

02/02/2021

Dynamic Automaton-Guided Reward Shaping for Monte Carlo Tree Search

Alvaro Velasquez, Brett Bissey, Lior Barak and
Andre Beckus, Ismail Alkhouri, Daniel Melcer, George Atia

Keywords Paper

0

0

0

0

18:52

19/08/2021

Identifying Norms from Observation Using MCMC Sampling

Stephen Cranefield, Ashish Dhiman

Keywords Paper

Agent-based and Multi-agent Systems, Normative systems, Agent Societies, Bayesian Learning

0

0

0

0

14:44

06/12/2020

What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes

Herman Yau, Chris Russell, Simon Hadfield

Keywords Paper

0

0

0

0

3:15

18/07/2021

Demonstration-Conditioned Reinforcement Learning for Few-Shot Imitation

Christopher Dance, Perez Julien, Théo Cachet

Keywords Paper

Reinforcement Learning and Planning, Planning and Control

0

0

0

0

5:13

26/10/2020

Imitation Learning over Heterogeneous Agents with Restraining Bolts

Giuseppe De Giacomo, Marco Favorito, Luca Iocchi, Fabio Patrizi

Keywords Paper

Restraining Bolts, Non-markovian Rewards, Transfer Learning

0

0

0

0

7:50

19/08/2021

Inferring Time-delayed Causal Relations in POMDPs from the Principle of Independence of Cause and Mechanism

Junchi Liang, Abdeslam Boularias

Keywords Paper

Knowledge Representation and Reasoning, Action, Change and Causality, Cognitive Robotics

0

0

0

0

13:50

06/12/2021

Accelerating Robotic Reinforcement Learning via Parameterized Action Primitives

Murtaza Dalal, Deepak Pathak, Russ Salakhutdinov

Keywords Paper

optimization, reinforcement learning and planning

0

0

0

0

10:01

18/07/2021

Targeted Data Acquisition for Evolving Negotiation Agents

Minae Kwon, Sidd Karamcheti, Mariano-Florentino Cuellar, Dorsa Sadigh

Keywords Paper

Reinforcement Learning and Planning, Multi-Agent RL

0

0

0

0

5:15

18/07/2021

Goal-Conditioned Reinforcement Learning with Imagined Subgoals

Elliot Chane-Sane, Cordelia Schmid, Ivan Laptev

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

4:57