Identifiability in inverse reinforcement learning

Abstract: Inverse reinforcement learning attempts to reconstruct the reward function in a Markov decision problem, using observations of agent actions. As already observed in Russell [1998] the problem is ill-posed, and the reward function is not identifiable, even under the presence of perfect information about optimal behavior. We provide a resolution to this non-identifiability for problems with entropy regularization. For a given environment, we fully characterize the reward functions leading to a given policy and demonstrate that, given demonstrations of actions for the same reward under two distinct discount factors, or under sufficiently different environments, the unobserved reward can be recovered up to a constant. We also give general necessary and sufficient conditions for reconstruction of time-homogeneous rewards on finite horizons, and for action-independent rewards, generalizing recent results of Kim et al. [2021] and Fu et al. [2018].

06/12/2021

Identifiability in inverse reinforcement learning

Haoyang Cao, Samuel Cohen, Lukasz Szpruch

Comments

Similar Papers

Explicable Reward Design for Reinforcement Learning Agents

Rati Devidze, Goran Radanovic, Parameswaran Kamalaruban, Adish Singla

Keywords Abstract Paper

optimization, reinforcement learning and planning, interpretability

There Is No Turning Back: A Self-Supervised Approach for Reversibility-Aware Reinforcement Learning

Nathan Grinsztajn, Johan Ferret, Olivier Pietquin and philippe preux, Matthieu Geist

Keywords Abstract Paper

reinforcement learning and planning

Learning One Representation to Optimize All Rewards

Ahmed Touati, Yann Ollivier

Keywords Abstract Paper

deep learning, reinforcement learning and planning, representation learning

Inverse Reinforcement Learning in a Continuous State Space with Formal Guarantees

Gregory Dexter, Kevin Bello, Jean Honorio

Keywords Abstract Paper

theory, reinforcement learning and planning

Reinforcement Learning with Trajectory Feedback

Yonathan Efroni, Nadav Merlis, Shie Mannor

Keywords Abstract Paper

Provably Efficient Learning of Transferable Rewards

Alberto Maria Metelli, Giorgia Ramponi, Alessandro Concetti, Marcello Restelli

Keywords Abstract Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Optimization, Combinatorial Optimization

Identifying the Reward Function by Anchor Actions

Sinong Geng, Houssam Nassif, Carlos Manzanares and Max Reppen, Ronnie Sircar

Keywords Abstract Paper

Reinforcement Learning - General

Regularized Inverse Reinforcement Learning

Wonseok Jeon, Chen-Yang Su, Paul Barde and Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Keywords Abstract Paper

reinforcement learning, regularized markov decision processes, reward learning, inverse reinforcement learning

Information Directed Reward Learning for Reinforcement Learning

David Lindner, Matteo Turchetta, Sebastian Tschiatschek and Kamil Ciosek, Andreas Krause

Keywords Abstract Paper

reinforcement learning and planning, active learning

Efficient Exploration of Reward Functions in Inverse Reinforcement Learning via Bayesian Optimization

Sreejith Balakrishnan, Quoc Phong Nguyen, Bryan Kian Hsiang Low, Harold Soh

Keywords Abstract Paper

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Zhe Xu, Ivan Gavran, Yousef Ahmad and Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu

Keywords Abstract Paper

Reward Machines, Automata Learning, Reinforcement Learning

Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping

Yujing Hu, Weixun Wang, Hangtian Jia and Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan

Keywords Abstract Paper

Reward Identification in Inverse Reinforcement Learning

Kuno Kim, Shivam Garg, Kiran Shiragur, Stefano Ermon

Keywords Abstract Paper

Theory, RL, Decisions and Control Theory

Provably efficient safe exploration via primal-dual policy optimization

Dongsheng Ding, Xiaohan Wei, Zhuoran Yang and Zhaoran Wang, Mihailo Jovanovic

Keywords Abstract Paper

On Reward-Free Reinforcement Learning with Linear Function Approximation

Ruosong Wang, Simon Du, Lin Yang, Russ Salakhutdinov

Keywords Abstract Paper

Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers

Ben Eysenbach, Shreyas Chaudhari, Swapnil Asawa and Sergey Levine, Ruslan Salakhutdinov

Keywords Abstract Paper

reinforcement learning, domain adaptation, transfer learning

Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward

Xiong Wang, Riheng Jia

Keywords Abstract Paper

Machine Learning, Online Learning, Algorithmic Game Theory, Multi-agent Learning

On the Expressivity of Markov Reward

David Abel, Will Dabney, Anna Harutyunyan and Mark Ho, Michael Littman, Doina Precup, Satinder Singh

Keywords Abstract Paper

reinforcement learning and planning

Maximum Likelihood Constraint Inference for Inverse Reinforcement Learning

Dexter R.R. Scobee, S. Shankar Sastry

Keywords Abstract Paper

learning from demonstration, inverse reinforcement learning, constraint inference

Emergent Prosociality in Multi-Agent Games Through Gifting

Woodrow Z. Wang, Mark Beliaev, Erdem Bıyık and Daniel A. Lazar, Ramtin Pedarsani, Dorsa Sadigh

Keywords Abstract Paper

Agent-based and Multi-agent Systems, Coordination and Cooperation, Multi-agent Learning, Noncooperative Games

Corruption-robust exploration in episodic reinforcement learning

Keywords Paper

Nathan Grinsztajn, Johan Ferret, Olivier Pietquin and
philippe preux, Matthieu Geist

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sinong Geng, Houssam Nassif, Carlos Manzanares and
Max Reppen, Ronnie Sircar

Keywords Paper

Wonseok Jeon, Chen-Yang Su, Paul Barde and
Thang Doan, Derek Nowrouzezahrai, Joelle Pineau

Keywords Paper

David Lindner, Matteo Turchetta, Sebastian Tschiatschek and
Kamil Ciosek, Andreas Krause

Keywords Paper

Keywords Paper

Zhe Xu, Ivan Gavran, Yousef Ahmad and
Rupak Majumdar, Daniel Neider, Ufuk Topcu, Bo Wu

Keywords Paper

Yujing Hu, Weixun Wang, Hangtian Jia and
Yixiang Wang, Yingfeng Chen, Jianye Hao, Feng Wu, Changjie Fan

Keywords Paper

Keywords Paper

Dongsheng Ding, Xiaohan Wei, Zhuoran Yang and
Zhaoran Wang, Mihailo Jovanovic

Keywords Paper

Keywords Paper

Ben Eysenbach, Shreyas Chaudhari, Swapnil Asawa and
Sergey Levine, Ruslan Salakhutdinov

Keywords Paper

Keywords Paper

David Abel, Will Dabney, Anna Harutyunyan and
Mark Ho, Michael Littman, Doina Precup, Satinder Singh

Keywords Paper

Keywords Paper

Woodrow Z. Wang, Mark Beliaev, Erdem Bıyık and
Daniel A. Lazar, Ramtin Pedarsani, Dorsa Sadigh

Keywords Paper

Keywords Paper

Keywords Paper

Tan Zhu, Guannan Liang, Chunjiang Zhu and
Haining Li, Jinbo Bi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yuqian Jiang, Suda Bharadwaj, Bo Wu and
Rishi Shah, Ufuk Topcu, Peter Stone

Keywords Paper

Kaiqing Zhang, TAO SUN, Yunzhe Tao and
Sahika Genc, Sunil Mallya, Tamer Basar

Keywords Paper

Joey Hong, Branislav Kveton, Manzil Zaheer and
Yinlam Chow, Amr Ahmed, Craig Boutilier

Keywords Paper

Tongzheng Ren, Jialian Li, Bo Dai and
Simon Du, Sujay Sanghavi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Gianluca Brero, Alon Eden, Matthias Gerstgrasser and
David Parkes, Duncan Rheingans-Yoo

Keywords Paper