Fairwashing explanations with off-manifold detergent

12/07/2020

Fairwashing explanations with off-manifold detergent

Christopher Anders, Ann-Kathrin Dombrowski, Klaus-robert Mueller, Pan Kessel, Plamen Pasliev

Keywords: Accountability, Transparency and Interpretability

Abstract Paper Similar Papers

Abstract: Explanation methods promise to make black-box classifiers more transparent. As a result, it is hoped that they can act as proof for a sensible, fair and trustworthy decision-making process of the algorithm and thereby increase its acceptance by the end-users. In this paper, we show both theoretically and experimentally that these hopes are presently unfounded. Specifically, we show that, for any classifier $g$, one can always construct another classifier $\tilde{g}$ which has the same behavior on the data (same train, validation, and test error) but has arbitrarily manipulated explanation maps. We derive this statement theoretically using differential geometry and demonstrate it experimentally for various explanation methods, architectures, and datasets. Motivated by our theoretical insights, we then propose a modification of existing explanation methods which makes them significantly more robust.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Classification with Valid and Adaptive Coverage

Yaniv Romano, Matteo Sesia, Emmanuel Candes

Keywords Paper

0

0

0

0

3:14

19/08/2021

Probabilistic Sufficient Explanations

Eric Wang, Pasha Khosravi, Guy Van den Broeck

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Explainability, Exact Probabilistic Inference

0

0

0

0

12:13

18/07/2021

Towards the Unification and Robustness of Perturbation and Gradient Based Explanations

Sushant Agarwal, Shahin Jabbari, Chirag Agarwal and
Sohini Upadhyay, Steven Wu, Himabindu Lakkaraju

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:18

02/02/2021

Automated Symbolic Law Discovery: A Computer Vision Approach

Hengrui Xing, Ansaf Salleb-Aouissi, Nakul Verma

Keywords Paper

0

0

0

0

15:30

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

12/07/2020

A Generic First-Order Algorithmic Framework for Bi-Level Programming Beyond Lower-Level Singleton

Risheng Liu, Pan Mu, Xiaoming Yuan and
Shangzhi Zeng, Jin Zhang

Keywords Paper

Optimization - Non-convex

0

0

0

0

13:51

22/09/2020

Making neural networks interpretable with attribution: Application to implicit signals prediction

Darius Afchar, Romain Hennequin

Keywords Paper

Implicit Recommender System, Interpretable machine learning

0

0

0

0

2:28

06/12/2021

USCO-Solver: Solving Undetermined Stochastic Combinatorial Optimization Problems

Guangmo Tong

Keywords Paper

optimization

0

0

0

0

15:00

02/02/2021

Fairness, Semi-Supervised Learning, and More: A General Framework for Clustering with Stochastic Pairwise Constraints

Brian Brubach, Darshan Chakrabarti, John P. Dickerson and
Aravind Srinivasan, Leonidas Tsepenekas

Keywords Paper

0

0

0

0

17:56

26/08/2020

Model-Agnostic Counterfactual Explanations for Consequential Decisions

Amir-Hossein Karimi, Gilles Barthe, Borja Balle, Isabel Valera

Keywords Paper

0

0

0

0

16:25

06/12/2020

Finding the Homology of Decision Boundaries with Active Learning

Weizhi Li, Gautam Dasarathy, Karthi Natesan Ramamurthy, Visar Berisha

Keywords Paper

Algorithms -> AutoML; Applications -> Fairness, Accountability, and Transparency; Optimization -> Stochastic Optimization, Algorithms -> Classification

0

0

0

0

3:27

13/04/2021

DebiNet: Debiasing linear models with nonlinear overparameterized neural networks

Shiyun Xu, Zhiqi Bu

Keywords Paper

0

0

0

0

2:56

13/04/2021

SONIA: A symmetric blockwise truncated optimization algorithm

Majid Jahani, MohammadReza Nazari, Rachael Tappenden and
Albert Berahas, Martin Takac

Keywords Paper

0

0

0

0

2:55

06/12/2021

Faster Matchings via Learned Duals

Michael Dinitz, Sungjin Im, Thomas Lavastida and
Benjamin Moseley, Sergei Vassilvitskii

Keywords Paper

theory, optimization

0

0

0

0

20:11

18/07/2021

Scalable Normalizing Flows for Permutation Invariant Densities

Marin Biloš, Stephan Günnemann

Keywords Paper

Deep Learning, Generative Models

0

0

0

0

5:10

14/06/2020

SQE: a Self Quality Evaluation Metric for Parameters Optimization in Multi-Object Tracking

Yanru Huang, Feiyu Zhu, Zheni Zeng and
Xi Qiu, Yuan Shen, Jianan Wu

Keywords Paper

multi-object tracking, self quality evaluation, gaussian mixture model, parameters self-optimization

0

0

0

0

1:00

06/12/2021

On Calibration and Out-of-Domain Generalization

Yoav Wald, Amir Feder, Daniel Greenfeld, Uri Shalit

Keywords Paper

machine learning, domain adaptation, causality

0

0

0

0

11:00

06/12/2020

Revisiting the Sample Complexity of Sparse Spectrum Approximation of Gaussian Processes

Minh Hoang, Nghia Hoang, Hai Pham, David Woodruff

Keywords Paper

, Deep Learning

0

0

0

0

3:25

30/11/2020

Reconstructing Human Body Mesh from Point Clouds by Adversarial GP Network

Boyao Zhou, Jean-Sebastien Franco, Federica Bogo and
Bugra Tekin, Edmond Boyer

Keywords Paper

0

0

0

0

7:09

06/12/2020

Adversarial Robustness of Supervised Sparse Coding

Jeremias Sulam, Ramchandran Muthukumar, Raman Arora

Keywords Paper

0

0

0

0

3:08

02/02/2021

PenDer: Incorporating Shape Constraints via Penalized Derivatives

Akhil Gupta, Lavanya Marla, Ruoyu Sun and
Naman Shukla, Arinbjörn Kolbeinsson

Keywords Paper

0

0

0

0

15:13

13/04/2021

Towards a theoretical understanding of the robustness of variational autoencoders

Alexander Camuto, Matthew Willetts, Stephen Roberts and
Chris Holmes, Tom Rainforth

Keywords Paper

0

0

0

0

3:00

06/12/2021

Quantifying and Improving Transferability in Domain Generalization

Guojun Zhang, Han Zhao, Yaoliang Yu, Pascal Poupart

Keywords Paper

domain adaptation, transfer learning

0

0

0

0

10:27

06/12/2020

Approximate Heavily-Constrained Learning with Lagrange Multiplier Models

Harikrishna Narasimhan, Andy Cotter, Yichen Zhou and
Serena Wang, Wenshuo Guo

Keywords Paper

0

0

0

0

3:21

30/11/2020

Modeling Cross-Modal interaction in a Multi-detector, Multi-modal Tracking Framework

Yiqi Zhong, Suya You, Ulrich Neumann

Keywords Paper

0

0

0

0

8:03

06/12/2020

Memory-Efficient Learning of Stable Linear Dynamical Systems for Prediction and Control

Giorgos Mamakoukas, Orest Xherija, Todd Murphey

Keywords Paper

Optimization -> Non-Convex Optimization, Optimization -> Stochastic Optimization

0

0

0

0

3:13

12/07/2020

Robust Black Box Explanations Under Distribution Shift

Himabindu Lakkaraju, Nino Arsov, Osbert Bastani

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

14:02

03/05/2021

Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning

Enrico Marchesini, Davide Corsi, Alessandro Farinelli

Keywords Paper

Evolutionary Algorithms, Deep Reinforcement Learning, Machine Learning for Robotics, Formal Verification

0

0

0

0

5:01

12/07/2020

Why Are Learned Indexes So Effective?

Paolo Ferragina, Fabrizio Lillo, Giorgio Vinciguerra

Keywords Paper

Applications - Other

0

0

0

0

13:22

06/12/2021

Hyperparameter Optimization Is Deceiving Us, and How to Stop It

A. Feder Cooper, Yucheng Lu, Jessica Forde, Christopher De Sa

Keywords Paper

optimization, machine learning

0

0

0

0

11:55

16/11/2020

An Imitation Game for Learning Semantic Parsers from User Interaction

Ziyu Yao, Yiqi Tang, Wen-tau Yih and
Huan Sun, Yu Su

Keywords Paper

bootstrapping, fine-tuning parsers, theoretical analysis, text-to-sql problem

0

0

0

0

11:49

12/07/2020

Learning Reasoning Strategies in End-to-End Differentiable Proving

Pasquale Minervini, Tim Rocktäschel, Sebastian Riedel and
Edward Grefenstette, Pontus Stenetorp

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

16:38

14/06/2020

Learning to Optimize on SPD Manifolds

Zhi Gao, Yuwei Wu, Yunde Jia, Mehrtash Harandi

Keywords Paper

riemannian optimization, symmetric positive definite (spd) manifolds, optimization-based meta-learning, automatical spd optimizer design, learning to optimize, gradiend-based spd optimization, optimization problems with spd constraints

0

0

0

0

0:50

18/07/2021

Implicit rate-constrained optimization of non-decomposable objectives

Abhishek Kumar, Harikrishna Narasimhan, Andrew Cotter

Keywords Paper

Algorithms, Supervised Learning

0

0

0

0

3:48

18/07/2021

Systematic Analysis of Cluster Similarity Indices: How to Validate Validation Measures

Martijn Gösgens, Aleksei Tikhonov, Liudmila Prokhorenkova

Keywords Paper

Algorithms, Clustering

0

0

0

0

5:12

06/12/2021

Physics-Integrated Variational Autoencoders for Robust and Interpretable Generative Modeling

Naoya Takeishi, Alexandros Kalousis

Keywords Paper

deep learning, machine learning, generative model, interpretability

0

0

0

0

8:13

06/12/2021

Stability and Generalization of Bilevel Programming in Hyperparameter Optimization

Fan Bao, Guoqiang Wu, Chongxuan LI and
Jun Zhu, Bo Zhang

Keywords Paper

optimization

0

0

0

0

8:58

12/07/2020

Online metric algorithms with untrusted predictions

Antonios Antoniadis, Christian Coester, Marek Elias and
Adam Polak, Bertrand Simon

Keywords Paper

Trustworthy Machine Learning

0

0

0

0

15:15

12/07/2020

Prediction-Guided Multi-Objective Reinforcement Learning for Continuous Robot Control

Jie Xu, Yunsheng Tian, Pingchuan Ma and
Daniela Rus, Shinjiro Sueda, Wojciech Matusik

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:15

03/05/2021

Evaluation of Similarity-based Explanations

Kazuaki Hanawa, Sho Yokoi, Satoshi Hara, Kentaro Inui

Keywords Paper

Interpretability, Explainability

0

0

0

0

5:11