The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Abstract: Explaining the decisions of an Artificial Intelligence (AI) model is increasingly critical in many real-world, high-stake applications.Hundreds of papers have either proposed new feature attribution methods, discussed or harnessed these tools in their work.However, despite humans being the target end-users, most attribution methods were only evaluated on proxy automatic-evaluation metrics (Zhang et al. 2018; Zhou et al. 2016; Petsiuk et al. 2018). In this paper, we conduct the first user study to measure attribution map effectiveness in assisting humans in ImageNet classification and Stanford Dogs fine-grained classification, and when an image is natural or adversarial (i.e., contains adversarial perturbations). Overall, feature attribution is surprisingly not more effective than showing humans nearest training-set examples. On a harder task of fine-grained dog categorization, presenting attribution maps to humans does not help, but instead hurts the performance of human-AI teams compared to AI alone. Importantly, we found automatic attribution-map evaluation measures to correlate poorly with the actual human-AI team performance. Our findings encourage the community to rigorously test their methods on the downstream human-in-the-loop applications and to rethink the existing evaluation metrics.

The effectiveness of feature attribution methods and its correlation with automatic evaluation scores

Giang Nguyen, Daeyoung Kim, Anh Nguyen

Comments

Similar Papers

Make Up Your Mind! Adversarial Generation of Inconsistent Natural Language Explanations

Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini and Thomas Lukasiewicz, Phil Blunsom

Keywords Abstract Paper

Adversarial Explanations, artificial systems, generation explanations, sanity models

How Well do Feature Visualizations Support Causal Understanding of CNN Activations?

Roland S. Zimmermann, Judy Borowski, Robert Geirhos and Matthias Bethge, Thomas Wallis, Wieland Brendel

Keywords Abstract Paper

interpretability

TracKlinic: Diagnosis of Challenge Factors in Visual Tracking

Heng Fan, Fan Yang, Peng Chu and Yuewei Lin, Lin Yuan, Haibin Ling

Keywords Abstract Paper

Synthetic Training for Accurate 3D Human Pose and Shape Estimation in the Wild

Akash Sengupta, Roberto Cipolla, Ignas Budvytis

Keywords Abstract Paper

3D human shape estimation, 3D pose estimation, 3D reconstruction, smpl, synthetic data, pose and shape optimisation, 3D human dataset

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Abstract Paper

The Utility of Explainable AI in Ad Hoc Human-Machine Teaming

Rohan Paleja, Muyleng Ghuy, Nadun Ranawaka Arachchige and Reed Jensen, Matthew Gombolay

Keywords Abstract Paper

machine learning, interpretability

Learning From Synthetic Animals

Jiteng Mu, Weichao Qiu, Gregory D. Hager, Alan L. Yuille

Keywords Abstract Paper

synthetic data, unsupervised domain adaptation, semi-supervised learning, animals, 2d pose estimation, semantic part segmentation, domain generalization, consistency, multi-task learning

Spot The Bot: A Robust and Efficient Framework for the Evaluation of Conversational Dialogue Systems

Jan Deriu, Don Tuggener, Pius von Däniken and Jon Ander Campos, Alvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak

Keywords Abstract Paper

evalu-ation methods, conversational systems, chat bots, spot bot

An Analysis of the Utility of Explicit Negative Examples to Improve the Syntactic Abilities of Neural Language Models

Hiroshi Noji, Hiroya Takamura

Keywords Abstract Paper

resolving agreement, Augmentation, Augmentation sentences, Syntactic Models

REVERIE: Remote Embodied Visual Referring Expression in Real Indoor Environments

Yuankai Qi, Qi Wu, Peter Anderson and Xin Wang, William Yang Wang, Chunhua Shen, Anton van den Hengel

Keywords Abstract Paper

remote object grounding, visual-and-language navigation, intelligent robot agent, indoor environment, referring expression grounding

The role of Disentanglement in Generalisation

Milton Montero, Casimir JH Ludwig, Rui Ponte Costa and Gaurav Malhotra, Jeffrey Bowers

Keywords Abstract Paper

generalisation, compositional generalization, generative models, compositionality, variational autoencoders, disentanglement

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Lin Guan, Mudit Verma, Suna (Sihang) Guo and Ruohan Zhang, Subbarao Kambhampati

Keywords Abstract Paper

reinforcement learning and planning, machine learning

SSAL: Synergizing between Self-Training and Adversarial Learning for Domain Adaptive Object Detection

Muhammad Akhtar Munir, Muhammad Haris Khan, M. Sarfraz, Mohsen Ali

Keywords Abstract Paper

deep learning, vision, domain adaptation

Incremental Learning for Animal Pose Estimation using RBF k-DPP

Gaurav Kumar Nayak, Het Shah, Anirban Chakraborty

Keywords Abstract Paper

animal pose estimation, incremental learning, Determinantal Point Processes, k-DPP, RBF k-DPP, image warping, exemplar memory

Adversarial AutoAugment

Xinyu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

Keywords Abstract Paper

Automatic Data Augmentation, Adversarial Learning, Reinforcement Learning

RGBD-Dog: Predicting Canine Pose from RGBD Sensors

Sinéad Kearney, Wenbin Li, Martin Parsons and Kwang In Kim, Darren Cosker

Keywords Abstract Paper

pose, shape, rgbd, dog, animal, depth, kinect, dataset, network, model

MOS: A Low Latency and Lightweight Framework for Face Detection, Landmark Localization, and Head Pose Estimation

Yepeng Liu, Zaiwang Gu, Shenghua Gao and Dong Wang, Yusheng Zeng, Jun Cheng

Keywords Abstract Paper

face detect, head pose estimation, multi-task, Low Latency

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

Xinyu Wang, Yuliang Liu, Chunhua Shen and Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

Keywords Abstract Paper

visual question answering, scene text, ocr

What Does My QA Model Know? Devising Controlled Probes using Expert

Kyle Richardson, Ashish Sabharwal

Keywords Abstract Paper

knowledge challenges, benchmark tasks, diagnostic tasks, taxonomic reasoning

DecAug: Augmenting HOI Detection via Decomposition

Hao-Shu Fang, Yichen Xie, Dian Shao and Yong-Lu Li, Cewu Lu

Oana-Maria Camburu, Brendan Shillingford, Pasquale Minervini and
Thomas Lukasiewicz, Phil Blunsom

Keywords Paper

Roland S. Zimmermann, Judy Borowski, Robert Geirhos and
Matthias Bethge, Thomas Wallis, Wieland Brendel

Keywords Paper

Heng Fan, Fan Yang, Peng Chu and
Yuewei Lin, Lin Yuan, Haibin Ling

Keywords Paper

Keywords Paper

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

Rohan Paleja, Muyleng Ghuy, Nadun Ranawaka Arachchige and
Reed Jensen, Matthew Gombolay

Keywords Paper

Keywords Paper

Jan Deriu, Don Tuggener, Pius von Däniken and
Jon Ander Campos, Alvaro Rodrigo, Thiziri Belkacem, Aitor Soroa, Eneko Agirre, Mark Cieliebak

Keywords Paper

Keywords Paper

Yuankai Qi, Qi Wu, Peter Anderson and
Xin Wang, William Yang Wang, Chunhua Shen, Anton van den Hengel

Keywords Paper

Milton Montero, Casimir JH Ludwig, Rui Ponte Costa and
Gaurav Malhotra, Jeffrey Bowers

Keywords Paper

Lin Guan, Mudit Verma, Suna (Sihang) Guo and
Ruohan Zhang, Subbarao Kambhampati

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Sinéad Kearney, Wenbin Li, Martin Parsons and
Kwang In Kim, Darren Cosker

Keywords Paper

Yepeng Liu, Zaiwang Gu, Shenghua Gao and
Dong Wang, Yusheng Zeng, Jun Cheng

Keywords Paper

Xinyu Wang, Yuliang Liu, Chunhua Shen and
Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

Keywords Paper

Keywords Paper

Hao-Shu Fang, Yichen Xie, Dian Shao and
Yong-Lu Li, Cewu Lu

Keywords Paper

Zhiquan Wen, Guanghui Xu, Mingkui Tan and
Qingyao Wu, Qi Wu

Keywords Paper

Konrad Zolna, Scott Reed, Alexander Novikov and
Sergio Gómez Colmenarejo, David Budden, Serkan Cabi, Misha Denil, Nando de Freitas, Ziyu Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yasmeen Alufaisan, Laura R. Marusich, Jonathan Z. Bakdash and
Yan Zhou, Murat Kantarcioglu

Keywords Paper

Siva Karthik Mustikovela, Varun Jampani, Shalini De Mello and
Sifei Liu, Umar Iqbal, Carsten Rother, Jan Kautz

Keywords Paper

Duncan C. McElfresh, Lok Chan, Kenzie Doyle and
Walter Sinnott-Armstrong, Vincent Conitzer, Jana Schaich Borg, John P. Dickerson

Keywords Paper

Weijia Wu, Ning Lu, Enze Xie and
Yuxing Wang, Wenwen Yu, Cheng Yang, Hong Zhou

Keywords Paper

Keywords Paper

Keywords Paper

Zihan Liu, Yan Xu, Tiezheng Yu and
Wenliang Dai, Ziwei Ji, Samuel Cahyawijaya, Andrea Madotto, Pascale Fung

Keywords Paper

Yu Li, Tao Wang, Bingyi Kang and
Sheng Tang, Chunfeng Wang, Jintao Li, Jiashi Feng

Keywords Paper

Keywords Paper