EDGE: Explaining Deep Reinforcement Learning Policies

06/12/2021

EDGE: Explaining Deep Reinforcement Learning Policies

Wenbo Guo, Xian Wu, Usmann Khan, Xinyu Xing

Keywords: reinforcement learning and planning, adversarial robustness and security, generative model, kernel methods, interpretability

Abstract Paper Similar Papers

Abstract: With the rapid development of deep reinforcement learning (DRL) techniques, there is an increasing need to understand and interpret DRL policies. While recent research has developed explanation methods to interpret how an agent determines its moves, they cannot capture the importance of actions/states to a game's final result. In this work, we propose a novel self-explainable model that augments a Gaussian process with a customized kernel function and an interpretable predictor. Together with the proposed model, we also develop a parameter learning procedure that leverages inducing points and variational inference to improve learning efficiency. Using our proposed model, we can predict an agent's final rewards from its game episodes and extract time step importance within episodes as strategy-level explanations for that agent. Through experiments on Atari and MuJoCo games, we verify the explanation fidelity of our method and demonstrate how to employ interpretation to understand agent behavior, discover policy vulnerabilities, remediate policy errors, and even defend against adversarial attacks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Value-driven Hindsight Modelling

Arthur Guez, Fabio Viola, Theophane Weber and
Lars Buesing, Steven Kapturowski, Doina Precup, David Silver, Nicolas Heess

Keywords Paper

1

0

0

0

3:20

12/07/2020

Learning Calibratable Policies using Programmatic Style-Consistency

Eric Zhan, Albert Tseng, Yisong Yue and
Adith Swaminathan, Matthew Hausknecht

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:05

06/12/2020

An Imitation from Observation Approach to Transfer Learning with Dynamics Mismatch

Siddharth Desai, Ishan Durugkar, Haresh Karnan and
Garrett Warnell, Josiah Hanna, Peter Stone

Keywords Paper

0

0

0

0

3:22

06/12/2020

Discovering Reinforcement Learning Algorithms

Junhyuk Oh, Matteo Hessel, Wojciech Czarnecki and
Zhongwen Xu, Hado van Hasselt, Satinder Singh, David Silver

Keywords Paper

0

0

0

0

3:21

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

12/07/2020

Learning From Strategic Agents: Accuracy, Improvement, and Causality

Yonadav Shavit, Benjamin Edelman, Brian Axelrod

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

12:37

02/02/2021

Hindsight and Sequential Rationality of Correlated Play

Dustin Morrill, Ryan D'Orazio, Reca Sarfati and
Marc Lanctot, James R Wright, Amy R Greenwald, Michael Bowling

Keywords Paper

0

0

0

0

18:34

06/12/2021

Stateful Strategic Regression

Keegan Harris, Hoda Heidari, Steven Wu

Keywords Paper

optimization

0

0

0

0

14:02

02/02/2021

The Value-Improvement Path: Towards Better Representations for Reinforcement Learning

Will Dabney, André Barreto, Mark Rowland and
Robert Dadashi, John Quan, Marc G. Bellemare, David Silver

Keywords Paper

0

0

0

0

20:06

02/02/2021

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines

Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi and
Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

Keywords Paper

0

0

0

0

17:44

03/05/2021

Learning Generalizable Visual Representations via Interactive Gameplay

Luca Weihs, Ani Kembhavi, Kiana Ehsani and
Sarah M Pratt, Winson Han, Alvaro Herrasti, Eric Kolve, Dustin Schwenk, Roozbeh Mottaghi, Ali Farhadi

Keywords Paper

computer vision, deep reinforcement learning, representation learning

0

0

0

0

14:17

02/02/2021

DeepSynth: Automata Synthesis for Automatic Task Segmentation in Deep Reinforcement Learning

Mohammadhosein Hasanbeig, Natasha Yogananda Jeppu, Alessandro Abate and
Tom Melham, Daniel Kroening

Keywords Paper

0

0

0

0

15:45

06/12/2021

Learning to Synthesize Programs as Interpretable and Generalizable Policies

Dweep Trivedi, Jesse Zhang, Shao-Hua Sun, Joseph Lim

Keywords Paper

deep learning, reinforcement learning and planning

0

0

0

0

10:30

06/12/2020

A Game Theoretic Analysis of Additive Adversarial Attacks and Defenses

Ambar Pal, Rene Vidal

Keywords Paper

0

0

0

0

3:19

25/04/2020

How Points and Theme Affect Performance and Experience in a Gamified Cognitive Task

Katelyn Wiley, Sarah Vedress, Regan Mandryk

Keywords Paper

cognitive tasks, dot probe, games, gamification, assessment

0

0

0

0

7:25

02/02/2021

Estimating α-Rank by Maximizing Information Gain

Tabish Rashid, Cheng Zhang, Kamil Ciosek

Keywords Paper

0

0

0

0

14:52

26/04/2020

Discriminative Particle Filter Reinforcement Learning for Complex Partial observations

Xiao Ma, Peter Karkus, David Hsu and
Wee Sun Lee, Nan Ye

Keywords Paper

Reinforcement Learning, Partial Observability, Differentiable Particle Filtering

0

0

0

0

5:08

03/05/2021

On the role of planning in model-based deep reinforcement learning

Jessica Hamrick, Abram Friesen, Feryal Behbahani and
Arthur Guez, Fabio Viola, Sims Witherspoon, Thomas Anthony, Lars Buesing, Petar Veličković, Theo Weber

Keywords Paper

planning, MuZero, model-based RL

0

0

0

0

5:15

12/07/2020

A Distributional Framework For Data Valuation

Amirata Ghorbani, Michael Kim, James Zou

Keywords Paper

Learning Theory

0

0

0

0

14:15

18/07/2021

Learning in Nonzero-Sum Stochastic Games with Potentials

David Mguni, Yutong Wu, Yali Du and
Yaodong Yang, Ziyi Wang, M. Li, Ying Wen, Joel Jennings, Jun Wang

Keywords Paper

Theory, Game Theory and Computational Economics

0

0

0

0

5:36

19/10/2020

Deep behavior tracing with multi-level temporality preserved embedding

Runze Wu, Hao Deng, Jianrong Tao and
Changjie Fan, Qi Liu, Liang Chen

Keywords Paper

game scene preloading, periodicity, self-attention, behavior tracing, temporal irregularity

0

0

0

0

8:03

18/07/2021

A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation

Scott Fujimoto, David Meger, Doina Precup

Keywords Paper

Reinforcement Learning and Planning, Deep RL

1

0

0

0

3:50

26/04/2020

Finding and Visualizing Weaknesses of Deep Reinforcement Learning Agents

Christian Rupprecht, Cyril Ibrahim, Christopher J. Pal

Keywords Paper

Visualization, Reinforcement Learning, Safety

0

0

0

0

4:52

12/07/2020

A Game Theoretic Perspective on Model-Based Reinforcement Learning

Aravind Rajeswaran, Igor Mordatch, Vikash Kumar

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

13:17

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

02/02/2021

Successor Feature Sets: Generalizing Successor Representations Across Policies

Kianté Brantley, Soroush Mehri, Geoff J. Gordon

Keywords Paper

0

0

0

0

17:43

12/07/2020

Problems with Shapley-value-based explanations as feature importance measures

Indra Kumar, Suresh Venkatasubramanian, Carlos Scheidegger, Sorelle Friedler

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

13:57

12/07/2020

From Chaos to Order: Symmetry and Conservation Laws in Game Dynamics

Sai Ganesh Nagarajan, David Balduzzi, Georgios Piliouras

Keywords Paper

Learning Theory

0

0

0

0

15:35

03/05/2021

Learning to Represent Action Values as a Hypergraph on the Action Vertices

Arash Tavakoli, Mehdi Fatemi, Petar Kormushev

Keywords Paper

reinforcement learning, learning action representations, multi-dimensional discrete action spaces, structural inductive bias, structural credit assignment

0

0

0

0

3:43

26/04/2020

Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning

Hengyuan Hu, Jakob N Foerster

Keywords Paper

multi-agent RL, theory of mind

0

0

0

0

5:20

02/02/2021

Bayesian Distributional Policy Gradients

Luchen Li, A. Aldo Faisal

Keywords Paper

1

0

0

0

18:06

06/12/2020

Learning to Play Sequential Games versus Unknown Opponents

Pier Giuseppe Sessa, Ilija Bogunovic, Maryam Kamgarpour, Andreas Krause

Keywords Paper

0

0

0

0

3:04

12/07/2020

Distributionally Robust Policy Evaluation and Learning in Offline Contextual Bandits

Nian Si, Fan Zhang, Zhengyuan Zhou, Jose Blanchet

Keywords Paper

Online Learning, Active Learning, and Bandits

0

0

0

0

12:18

19/08/2021

Non-decreasing Quantile Function Network with Efficient Exploration for Distributional Reinforcement Learning

Fan Zhou, Zhoufan Zhu, Qi Kuang, Liwen Zhang

Keywords Paper

Machine Learning, Deep Reinforcement Learning

0

0

0

0

11:11

03/05/2021

Iterative Empirical Game Solving via Single Policy Best Response

Max Smith, Thomas Anthony, Michael Wellman

Keywords Paper

Reinforcement Learning, Multiagent Learning, Empirical Game Theory

0

0

0

0

8:49

18/07/2021

Provably Efficient Algorithms for Multi-Objective Competitive RL

Tiancheng Yu, Yi Tian, Jingzhao Zhang, Suvrit Sra

Keywords Paper

Theory, RL, Decisions and Control Theory

0

0

0

0

17:04

06/12/2020

Calibration of Shared Equilibria in General Sum Partially Observable Markov Games

Nelson Vadori, Sumitra Ganesh, Prashant Reddy, Manuela Veloso

Keywords Paper

0

0

0

0

3:18

06/12/2020

What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes

Herman Yau, Chris Russell, Simon Hadfield

Keywords Paper

0

0

0

0

3:15

06/12/2021

Episodic Multi-agent Reinforcement Learning with Curiosity-driven Exploration

Lulu Zheng, Jiarui Chen, Jianhao Wang and
Jiamin He, Yujing Hu, Yingfeng Chen, Changjie Fan, Yang Gao, Chongjie Zhang

Keywords Paper

reinforcement learning and planning

0

0

0

0

12:25

03/05/2021

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer, Ankesh Anand, Rishab Goel and
R Devon Hjelm, Aaron Courville, Philip Bachman

Keywords Paper

Representation Learning, Self-Supervised Learning, Reinforcement Learning, Sample Efficiency

0

0

0

1

10:04