TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

02/02/2021

TripleTree: A Versatile Interpretable Representation of Black Box Agents and their Environments

Tom Bewley, Jonathan Lawry

Keywords:

Abstract Paper Similar Papers

Abstract: In explainable artificial intelligence, there is increasing interest in understanding the behaviour of autonomous agents to build trust and validate performance. Modern agent architectures, such as those trained by deep reinforcement learning, are currently so lacking in interpretable structure as to effectively be black boxes, but insights may still be gained from an external, behaviourist perspective. Inspired by conceptual spaces theory, we suggest that a versatile first step towards general understanding is to discretise the state space into convex regions, jointly capturing similarities over the agent's action, value function and temporal dynamics within a dataset of observations. We create such a representation using a novel variant of the CART decision tree algorithm, and demonstrate how it facilitates practical understanding of black box agents through prediction, visualisation and rule-based explanation.

The video of this talk cannot be embedded. You can watch it here:

https://slideslive.com/38948123

(Link will open in new window)

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at AAAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/09/2020

Making neural networks interpretable with attribution: Application to implicit signals prediction

Darius Afchar, Romain Hennequin

Keywords Paper

Implicit Recommender System, Interpretable machine learning

0

0

0

0

2:28

26/04/2020

Meta-Learning Acquisition Functions for Transfer Learning in Bayesian Optimization

Michael Volpp, Lukas P. Fröhlich, Kirsten Fischer and
Andreas Doerr, Stefan Falkner, Frank Hutter, Christian Daniel

Keywords Paper

Transfer Learning, Meta Learning, Bayesian Optimization, Reinforcement Learning

0

0

0

0

5:05

02/02/2021

MARTA: Leveraging Human Rationales for Explainable Text Classification

Ines Arous, Ljiljana Dolamic, Jie Yang and
Akansha Bhardwaj, Giuseppe Cuccu, Philippe Cudré-Mauroux

Keywords Paper

0

0

0

0

16:43

12/07/2020

Probing Emergent Semantics in Predictive Agents via Question Answering

Abhishek Das, Federico Carnevale, Hamza Merzic and
Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

12:31

02/02/2021

Exploring Explainable Selection to Control Abstractive Summarization

Haonan Wang, Yang Gao, Yu Bai and
Mirella Lapata, Heyan Huang

Keywords Paper

0

0

0

0

18:50

19/08/2021

Reasoning-Based Learning of Interpretable ML Models

Alexey Ignatiev, Joao Marques-Silva, Nina Narodytska, Peter J. Stuckey

Keywords Paper

Constraints and SAT, General, General, General

0

0

0

0

14:43

05/01/2021

Assessing Image and Text Generation With Topological Analysis and Fuzzy Logic

Goncalo Mordido, Julian Niedermeier, Christoph Meinel

Keywords Paper

0

0

0

0

4:51

18/07/2021

AGENT: A Benchmark for Core Psychological Reasoning

Tianmin Shu, Abhishek Bhandwaldar, Chuang Gan and
Kevin Smith, Shari Liu, Dan Gutfreund, Elizabeth Spelke, Josh Tenenbaum, Tomer Ullman

Keywords Paper

Applications, Neuroscience and Cognitive Science

0

0

0

0

5:14

04/07/2020

Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Hanjie Chen, Guangtao Zheng, Yangfeng Ji

Keywords Paper

Text Classification, Generating explanations, natural processing, model prediction

0

0

0

0

11:47

12/07/2020

Online metric algorithms with untrusted predictions

Antonios Antoniadis, Christian Coester, Marek Elias and
Adam Polak, Bertrand Simon

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:15

02/02/2021

Text-based RL Agents with Commonsense Knowledge: New Challenges, Environments and Baselines

Keerthiram Murugesan, Mattia Atzeni, Pavan Kapanipathi and
Pushkar Shukla, Sadhana Kumaravel, Gerald Tesauro, Kartik Talamadupula, Mrinmaya Sachan, Murray Campbell

Keywords Paper

0

0

0

0

17:44

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

18/07/2021

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

16:11

02/02/2021

Asking the Right Questions: Learning Interpretable Action Models Through Query Answering

Pulkit Verma, Shashank Rao Marpally, Siddharth Srivastava

Keywords Paper

0

0

0

0

18:48

06/12/2020

Deep Reinforcement Learning with Stacked Hierarchical Attention for Text-based Games

Yunqiu Xu, Meng Fang, Ling Chen and
Yali Du, Joey Tianyi Zhou, Chengqi Zhang

Keywords Paper

0

0

0

0

3:08

26/04/2020

Emergent Tool Use From Multi-Agent Autocurricula

Bowen Baker, Ingmar Kanitscheider, Todor Markov and
Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch

Keywords Paper

0

0

0

0

5:10

02/02/2021

Neural Analogical Matching

Maxwell Crouse, Constantine Nakos, Ibrahim Abdelaziz, Ken Forbus

Keywords Paper

0

0

0

0

14:02

05/12/2020

DAPPER: Learning domain-adapted persona representation using pretrained BERT and external memory

Prashanth Vijayaraghavan, Eric Chu, Deb Roy

Keywords Paper

0

0

0

0

14:48

12/07/2020

Generating Programmatic Referring Expressions via Program Synthesis

Jiani Huang, Calvin Smith, Osbert Bastani and
Rishabh Singh, Aws Albarghouthi, Mayur Naik

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:54

05/01/2021

Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation

Tianqi Tang, Xin Yu, Xuanyi Dong, Yi Yang

Keywords Paper

0

0

0

0

4:39

16/11/2020

Learning Explainable Linguistic Expressions with Neural Inductive Logic Programming for Sentence Classification

Prithviraj Sen, Marina Danilevsky, Yunyao Li and
Siddhartha Brahma, Matthias Boehm, Laura Chiticariu, Rajasekar Krishnamurthy

Keywords Paper

interpretability models, sentence classification, le, human-machine models

0

0

0

0

9:42

02/02/2021

A Hybrid Probabilistic Approach for Table Understanding

Kexuan Sun, Harsha Rayudu, Jay Pujara

Keywords Paper

0

0

0

0

18:27

19/08/2021

Building Affordance Relations for Robotic Agents - A Review

Paola Ardón, Èric Pairet, Katrin S. Lohan and
Subramanian Ramamoorthy, Ron P. A. Petrick

Keywords Paper

Multidisciplinary topics and applications, General, General

0

0

0

0

11:26

03/05/2021

Prototypical Representation Learning for Relation Extraction

Ning Ding, Xiaobin Wang, Yao Fu and
Guangwei Xu, Rui Wang, Pengjun Xie, Ying Shen, Fei Huang, Hai-Tao Zheng, Rui Zhang

Keywords Paper

NLP, Representation Learning, Relation Extraction

0

0

0

0

5:14

03/05/2021

Deep symbolic regression: Recovering mathematical expressions from data via risk-seeking policy gradients

Brenden Petersen, Mikel Landajuela Larma, Terrell N Mundhenk and
Claudio Santiago, Soo Kim, Joanne Kim

Keywords Paper

reinforcement learning, automated machine learning, symbolic regression

0

0

0

0

15:02

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

06/12/2020

Grasp Proposal Networks: An End-to-End Solution for Visual Learning of Robotic Grasps

Chaozheng Wu, Jian Chen, Qiaoyu Cao and
Jianchi Zhang, Yunxin Tai, Lin Sun, Kui Jia

Keywords Paper

0

0

0

0

3:19

06/12/2020

What Did You Think Would Happen? Explaining Agent Behaviour through Intended Outcomes

Herman Yau, Chris Russell, Simon Hadfield

Keywords Paper

0

0

0

0

3:15

06/12/2021

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Lin Guan, Mudit Verma, Suna (Sihang) Guo and
Ruohan Zhang, Subbarao Kambhampati

Keywords Paper

reinforcement learning and planning, machine learning

0

0

0

0

13:41

26/04/2020

Explain Your Move: Understanding Agent Actions Using Focused Feature Saliency

Piyush Gupta, Nikaash Puri, Sukriti Verma and
Dhruv Kayastha, Shripad Deshmukh, Balaji Krishnamurthy, Sameer Singh

Keywords Paper

Deep Reinforcement Learning, Saliency maps, Chess, Atari games, Interpretable AI

0

0

0

0

4:59

19/08/2021

Probabilistic Sufficient Explanations

Eric Wang, Pasha Khosravi, Guy Van den Broeck

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Explainability, Exact Probabilistic Inference

0

0

0

0

12:13

12/07/2020

Active World Model Learning in Agent-rich Environments with Progress Curiosity

Kuno Kim, Megumi Sano, Julian De Freitas and
Nick Haber, Daniel Yamins

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

15:25

06/12/2020

Leap-Of-Thought: Teaching Pre-Trained Models to Systematically Reason Over Implicit Knowledge

Alon Talmor, Oyvind Tafjord, Peter Clark and
Yoav Goldberg, Jonathan Berant

Keywords Paper

0

0

0

0

3:28

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

19/08/2021

Identifying Norms from Observation Using MCMC Sampling

Stephen Cranefield, Ashish Dhiman

Keywords Paper

Agent-based and Multi-agent Systems, Normative systems, Agent Societies, Bayesian Learning

0

0

0

0

14:44

02/02/2021

A Unified Taylor Framework for Revisiting Attribution Methods

Huiqi Deng, Na Zou, Mengnan Du and
Weifu Chen, Guocan Feng, Xia Hu

Keywords Paper

0

0

0

0

16:18

06/12/2021

Generalizable Imitation Learning from Observation via Inferring Goal Proximity

Youngwoon Lee, Andrew Szot, Shao-Hua Sun, Joseph Lim

Keywords Paper

reinforcement learning and planning

0

0

0

1

9:09

19/08/2021

What’s the Context? Implicit and Explicit Assumptions in Model-Based Goal Recognition

Peta Masters, Mor Vered

Keywords Paper

Agent-based and multi-agent based systems, General, General, General

0

0

0

0

12:05

06/12/2020

Finding the Homology of Decision Boundaries with Active Learning

Weizhi Li, Gautam Dasarathy, Karthi Natesan Ramamurthy, Visar Berisha

Keywords Paper

Algorithms -> AutoML; Applications -> Fairness, Accountability, and Transparency; Optimization -> Stochastic Optimization, Algorithms -> Classification

0

0

0

0

3:27

02/02/2021

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

Yaru Hao, Li Dong, Furu Wei, Ke Xu

Keywords Paper

0

0

0

0

16:26