When Explanations Lie: Why Many Modified BP Attributions Fail

12/07/2020

When Explanations Lie: Why Many Modified BP Attributions Fail

Leon Sixt, Maximilian Granz, Tim Landgraf

Keywords: Accountability, Transparency and Interpretability

Abstract Paper Similar Papers

Abstract: Attribution methods aim to explain a neural network's prediction by highlighting the most relevant image areas. A popular approach is to backpropagate (BP) a custom relevance score using modified rules, rather than the gradient. We analyze an extensive set of modified BP methods: Deep Taylor Decomposition, Layer-wise Relevance Propagation (LRP), Excitation BP, PatternAttribution, DeepLIFT, Deconv, RectGrad, and Guided BP. We find empirically that the explanations of all mentioned methods, except for DeepLIFT, are independent of the parameters of later layers. We provide theoretical insights for this surprising behavior and also analyze why DeepLIFT does not suffer from this limitation. Empirically, we measure how information of later layers is ignored by using our new metric, cosine similarity convergence (CSC). The paper provides a framework to assess the faithfulness of new and existing modified BP methods theoretically and empirically.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

FIMAP: Feature Importance by Minimal Adversarial Perturbation

Matt Chapman-Rounds, Umang Bhatt, Erik Pazos and
Marc-Andre Schulz, Konstantinos Georgatzis

Keywords Paper

0

0

0

0

20:10

06/12/2021

Improving Deep Learning Interpretability by Saliency Guided Training

Aya Abdelsalam Ismail, Hector Corrada Bravo, Soheil Feizi

Keywords Paper

deep learning, transformers, vision, language, interpretability

0

0

0

0

10:45

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

06/12/2021

The Out-of-Distribution Problem in Explainability and Search Methods for Feature Importance Explanations

Peter Hase, Harry Xie, Mohit Bansal

Keywords Paper

machine learning, interpretability

0

0

0

0

15:05

05/01/2021

Exploiting Spatial Relation for Reducing Distortion in Style Transfer

Jia-Ren Chang, Yong-Sheng Chen

Keywords Paper

0

0

0

0

4:54

06/12/2021

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

optimization

0

0

0

0

8:58

06/12/2021

Re-ranking for image retrieval and transductive few-shot classification

Xi SHEN, Yang Xiao, Shell Hu and
Othman Sbai, Mathieu Aubry

Keywords Paper

machine learning, graph learning, meta learning, few shot learning

0

0

0

0

5:46

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

03/05/2021

Adversarial score matching and improved sampling for image generation

Alexia Jolicoeur-Martineau, Rémi Piché-Taillefer, Ioannis Mitliagkas, Remi Combes

Keywords Paper

score matching, adversarial, generative model, GAN, Langevin dynamics

0

0

0

0

4:56

06/12/2021

When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning?

Lijie Fan, Sijia Liu, Pin-Yu Chen and
Gaoyuan Zhang, Chuang Gan

Keywords Paper

machine learning, robustness, adversarial robustness and security, self-supervised learning, vision, contrastive learning, clustering

0

0

0

0

7:33

05/01/2021

Generative Patch Priors for Practical Compressive Image Recovery

Rushil Anirudh, Suhas Lohit, Pavan Turaga

Keywords Paper

0

0

0

0

5:03

03/05/2021

Multi-Class Uncertainty Calibration via Mutual Information Maximization-based Binning

Kanil Patel, William H Beluch, Bin Yang and
Michael Pfeiffer, Dan Zhang

Keywords Paper

deep neural networks, histogram binning, post-hoc calibration, uncertainty calibration, mutual information

0

0

0

0

5:13

12/07/2020

On the Generalization Effects of Linear Transformations in Data Augmentation

Sen Wu, Hongyang Zhang, Gregory Valiant, Christopher Re

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

15:18

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

06/12/2020

Bayesian Deep Learning and a Probabilistic Perspective of Generalization

Andrew Wilson, Pavel Izmailov

Keywords Paper

0

0

0

0

3:27

06/12/2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, self-supervised learning, vision, contrastive learning, representation learning, transfer learning

0

0

0

0

14:34

14/06/2020

Selective Transfer With Reinforced Transfer Network for Partial Domain Adaptation

Zhihong Chen, Chao Chen, Zhaowei Cheng and
Boyuan Jiang, Ke Fang, Xinyu Jin

Keywords Paper

partial domain adaptation, selective transfer, pixel-level information, reconstruct error, reinforcement learning

1

1

0

0

1:01

14/06/2020

Structure-Preserving Super Resolution With Gradient Guidance

Cheng Ma, Yongming Rao, Yean Cheng and
Ce Chen, Jiwen Lu, Jie Zhou

Keywords Paper

super resolution, image restoration, image enhancement, structure preserving, generative model, generative adversarial network, gan, deep-learning

0

0

0

0

1:01

03/05/2021

Improving Relational Regularized Autoencoders with Spherical Sliced Fused Gromov Wasserstein

Khai Nguyen, Son Nguyen, Nhat Ho and
Tung Pham, Hung Bui

Keywords Paper

sliced fused Gromov Wasserstein, Relational regularized autoencoder, deep generative model, spherical distributions

0

0

0

0

4:40

06/12/2021

A Unified View of cGANs with and without Classifiers

Si-An Chen, Chun-Liang Li, Hsuan-Tien Lin

Keywords Paper

machine learning, generative model

0

0

0

0

11:40

06/12/2021

Post-Training Quantization for Vision Transformer

Zhenhua Liu, Yunhe Wang, Kai Han and
Wei Zhang, Siwei Ma, Wen Gao

Keywords Paper

deep learning, transformers, vision

0

0

0

0

5:52

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

06/12/2020

Goal-directed Generation of Discrete Structures with Conditional Generative Models

Amina Mollaysa, Brooks Paige, Alexandros Kalousis

Keywords Paper

0

0

0

0

3:10

06/12/2020

On the Convergence of Smooth Regularized Approximate Value Iteration Schemes

Elena Smirnova, Elvis Dohmatob

Keywords Paper

0

0

0

0

3:22

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

03/05/2021

Rethinking the Role of Gradient-based Attribution Methods for Model Interpretability

Suraj Srinivas, François Fleuret

Keywords Paper

Interpretability, saliency maps, score-matching

0

0

0

0

15:08

06/12/2021

Reducing the Covariate Shift by Mirror Samples in Cross Domain Alignment

Yin Zhao, minquan wang, Longjun Cai

Keywords Paper

domain adaptation

0

0

0

0

9:28

06/12/2021

Representer Point Selection via Local Jacobian Expansion for Post-hoc Classifier Explanation of Deep Neural Networks and Ensemble Models

Yi Sui, Ga Wu, Scott Sanner

Keywords Paper

deep learning, optimization, machine learning, vision

0

0

0

0

10:29

02/02/2021

Interpreting Deep Neural Networks with Relative Sectional Propagation by Analyzing Comparative Gradients and Hostile Activations

Woo-Jeoung Nam, Jaesik Choi, Seong-Whan Lee

Keywords Paper

0

0

0

0

14:51

02/02/2021

Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation

Junhong Shen, Lin F. Yang

Keywords Paper

0

0

0

0

19:12

18/07/2021

LAMDA: Label Matching Deep Domain Adaptation

Trung Le, Tuan Nguyen, Nhat Ho and
Hung Bui, Dinh Phung

Keywords Paper

Theory, Deep learning Theory

0

0

0

1

5:14

06/12/2021

Test-Time Classifier Adjustment Module for Model-Agnostic Domain Generalization

Yusuke Iwasawa, Yutaka Matsuo

Keywords Paper

deep learning, optimization, transformers, domain adaptation

0

0

0

0

13:50

14/06/2020

When to Use Convolutional Neural Networks for Inverse Problems

Nathaniel Chodosh, Simon Lucey

Keywords Paper

optimization, sparse coding, inverse problems, trajectory reconstruction, artifact removal

0

0

0

0

1:02

14/06/2020

A Characteristic Function Approach to Deep Implicit Generative Modeling

Abdul Fatir Ansari, Jonathan Scarlett, Harold Soh

Keywords Paper

generative adversarial networks, generative models, probability metrics, characteristic functions, unsupervised learning

0

0

0

0

4:56

06/12/2021

Loss function based second-order Jensen inequality and its application to particle variational inference

Futoshi Futami, Tomoharu Iwata, naonori ueda and
Issei Sato, Masashi Sugiyama

Keywords Paper

optimization, generative model

0

0

0

0

14:09

26/08/2020

Neural Topic Model with Attention for Supervised Learning

Xinyi Wang, YI YANG

Keywords Paper

0

0

0

0

12:39

14/09/2020

Learning Gradient Boosted Multi-label Classification Rules

Michael Rapp, Eneldo Loza Mencía, Johannes Fürnkranz and
Vu-Linh Nguyen, Eyke Hüllermeier

Keywords Paper

multi-label classification, gradient boosting, rule learning

0

0

0

0

15:45

03/05/2021

Representation Learning via Invariant Causal Mechanisms

Jovana Mitrovic, Brian McWilliams, Jacob C Walker and
Lars Buesing, Charles Blundell

Keywords Paper

Self-supervised Learning, Representation Learning, Causality, Contrastive Methods

1

0

0

0

7:03

26/08/2020

Towards Competitive N-gram Smoothing

Moein Falahatgar, Mesrob Ohannessian, Alon Orlitsky, Venkatadheeraj Pichapati

Keywords Paper

0

0

0

0

17:51

06/12/2020

Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

Guoliang Kang, Yunchao Wei, Yi Yang and
Yueting Zhuang, Alexander Hauptmann

Keywords Paper

0

0

0

0

3:16