FIMAP: Feature Importance by Minimal Adversarial Perturbation

Abstract: Instance-based model-agnostic feature importance explanations (LIME, SHAP, L2X) are a popular form of algorithmic transparency. These methods generally return either a weighting or subset of input features as an explanation for the classification of an instance. An alternative literature argues instead that counterfactual instances, which alter the black-box model's classification, provide a more actionable form of explanation. We present Feature Importance by Minimal Adversarial Perturbation (FIMAP), a neural network based approach that unifies feature importance and counterfactual explanations. We show that this approach combines the two paradigms, recovering the output of feature-weighting methods in continuous feature spaces, whilst indicating the direction in which the nearest counterfactuals can be found. Our method also provides an implicit confidence estimate in its own explanations, something existing methods lack. Additionally, FIMAP improves upon the speed of sampling-based methods, such as LIME, by an order of magnitude, allowing for explanation deployment in time-critical applications. We extend our approach to categorical features using a partitioned Gumbel layer and demonstrate its efficacy on standard datasets.

12/07/2020

FIMAP: Feature Importance by Minimal Adversarial Perturbation

Matt Chapman-Rounds, Umang Bhatt, Erik Pazos, Marc-Andre Schulz, Konstantinos Georgatzis

Comments

Similar Papers

When Explanations Lie: Why Many Modified BP Attributions Fail

Leon Sixt, Maximilian Granz, Tim Landgraf

Keywords Abstract Paper

Accountability, Transparency and Interpretability

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and Jun Zhu, Bo Zhang

Keywords Abstract Paper

optimization

Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration

Jaekyeom Kim, Minjung Kim, Dongyeon Woo, Gunhee Kim

Keywords Abstract Paper

Reinforcement learning, Information bottleneck

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Abstract Paper

model compression, binary neural networks, deep learning, quantization, computer vision

Towards Competitive N-gram Smoothing

Moein Falahatgar, Mesrob Ohannessian, Alon Orlitsky, Venkatadheeraj Pichapati

Keywords Abstract Paper

Domain Adaptation with Conditional Distribution Matching and Generalized Label Shift

Remi Tachet des Combes, Han Zhao, Yu-Xiang Wang, Geoffrey Gordon

Keywords Abstract Paper

Minimax Weight and Q-Function Learning for Off-Policy Evaluation

Masatoshi Uehara, Jiawei Huang, Nan Jiang

Keywords Abstract Paper

Reinforcement Learning - Theory

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Abstract Paper

NLP, neural classification, training, automatic evaluations

Decoupling Value and Policy for Generalization in Reinforcement Learning

Roberta Raileanu, Rob Fergus

Keywords Abstract Paper

Theory, Learning Theory, Theory, Large Deviations and Asymptotic Analysis, Reinforcement Learning and Planning, Deep RL

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and Michael Pfeiffer, Dan Zhang

Keywords Abstract Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

Counterfactual Explanations for Oblique Decision Trees:Exact, Efficient Algorithms

Miguel Á. Carreira-Perpiñán, Suryabhan Singh Hada

Keywords Abstract Paper

Conformal Bayesian Computation

Edwin Fong, Chris C Holmes

Keywords Abstract Paper

machine learning

Can Subnetwork Structure Be the Key to Out-of-Distribution Generalization?

Dinghuai Zhang, Kartik Ahuja, Yilun Xu and Yisen Wang, Aaron Courville

Keywords Abstract Paper

Deep Learning, Algorithms, Theory; Theory, Regularization

Implicit Class-Conditioned Domain Alignment for Unsupervised Domain Adaptation

Xiang Jiang, Qicheng Lao, Stan Matwin, Mohammad Havaei

Keywords Abstract Paper

Deep Learning - Algorithms

High Fidelity GAN Inversion via Prior Multi-Subspace Feature Composition

Guanyue Li, Qianfen Jiao, Sheng Qian and Si Wu, Hau-San Wong

Keywords Abstract Paper

Context Mover's Distance & Barycenters: Optimal Transport of Contexts for Building Representations

Sidak Pal Singh, Andreas Hug, Aymeric Dieuleveut, Martin Jaggi

Keywords Abstract Paper

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Abstract Paper

Deep Learning, Embedding and Representation learning

Algorithmic stability and generalization of an unsupervised feature selection algorithm

xinxing wu, Qiang Cheng

Keywords Abstract Paper

deep learning

Improving adversarial robustness via unlabeled out-of-domain data

Zhun Deng, Linjun Zhang, Amirata Ghorbani, James Zou

Keywords Abstract Paper

Proximal Causal Learning with Kernels: Two-Stage Estimation and Moment Restriction

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin and Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Keywords Abstract Paper

Algorithms, Kernel Methods

GMAC: A Distributional Perspective on Actor-Critic Framework

Daniel Nam, Younghoon Kim, Chan Park

Keywords Paper

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

Keywords Paper

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Dinghuai Zhang, Kartik Ahuja, Yilun Xu and
Yisen Wang, Aaron Courville

Keywords Paper

Keywords Paper

Guanyue Li, Qianfen Jiao, Sheng Qian and
Si Wu, Hau-San Wong

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Afsaneh Mastouri, Yuchen Zhu, Limor Gultchin and
Anna Korba, Ricardo Silva, Matt J. Kusner, Arthur Gretton, Krikamol Muandet

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Hiroki Furuta, Tadashi Kozuno, Tatsuya Matsushima and
Yutaka Matsuo, Shixiang (Shane) Gu

Keywords Paper

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto and
Luigi Gresele, Bernhard Schoelkopf

Keywords Paper

Keywords Paper

Qi Qi, Youzhi Luo, Zhao Xu and
Shuiwang Ji, Tianbao Yang

Keywords Paper

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

Keywords Paper

Keywords Paper