Learning Deep Attribution Priors Based On Prior Knowledge

06/12/2020

Learning Deep Attribution Priors Based On Prior Knowledge

Ethan Weinberger, Joe Janizek, Su-In Lee

Keywords:

Abstract Paper Similar Papers

Abstract: Feature attribution methods, which explain an individual prediction made by a model as a sum of attributions for each input feature, are an essential tool for understanding the behavior of complex deep learning models. However, ensuring that models produce meaningful explanations, rather than ones that rely on noise, is not straightforward. Exacerbating this problem is the fact that attribution methods do not provide insight as to why features are assigned their attribution values, leading to explanations that are difficult to interpret. In real-world problems we often have sets of additional information for each feature that are predictive of that feature's importance to the task at hand. Here, we propose the deep attribution prior (DAPr) framework to exploit such information to overcome the limitations of attribution methods. Our framework jointly learns a relationship between prior information and feature importance, as well as biases models to have explanations that rely on features predicted to be important. We find that our framework both results in networks that generalize better to out of sample data and admits new methods for interpreting model behavior.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

05/01/2021

Adversarial Reinforcement Learning for Unsupervised Domain Adaptation

Youshan Zhang, Hui Ye, Brian D. Davison

Keywords Paper

0

0

0

0

4:52

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

18/07/2021

Marginal Contribution Feature Importance - an Axiomatic Approach for Explaining Data

Amnon Catav, Boyang Fu, Yazeed Zoabi and
Ahuva Weiss Meilik, Noam Shomron, Jason Ernst, Sriram Sankararaman, Ran Gilad-Bachrach

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

4:08

02/02/2021

Right for Better Reasons: Training Differentiable Models by Constraining their Influence Functions

Xiaoting Shao, Arseny Skryagin, Wolfgang Stammer and
Patrick Schramowski, Kristian Kersting

Keywords Paper

0

0

0

0

19:08

02/02/2021

Bi-Classifier Determinacy Maximization for Unsupervised Domain Adaptation

Shuang Li, Fangrui Lv, Binhui Xie and
Chi Harold Liu, Jian Liang, Chen Qin

Keywords Paper

0

0

0

0

14:07

12/07/2020

Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge

Laura Rieger, Chandan Singh, William Murdoch, Bin Yu

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

15:15

14/06/2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

Keywords Paper

representation learning, hierarchical robustness

0

0

0

0

1:01

18/07/2021

MURAL: Meta-Learning Uncertainty-Aware Rewards for Outcome-Driven Reinforcement Learning

Kevin Li, Abhishek Gupta, Ashwin D Reddy and
Vitchyr Pong, Aurick Zhou, Justin Yu, Sergey Levine

Keywords Paper

Reinforcement Learning and Planning, Deep RL

0

0

0

0

5:16

06/12/2021

Towards Robust Bisimulation Metric Learning

Mete Kemertas, Tristan Aumentado-Armstrong

Keywords Paper

reinforcement learning and planning, robustness, representation learning

0

0

0

0

12:24

16/11/2020

Bridging the Gap between Prior and Posterior Knowledge Selection for Knowledge-Grounded Dialogue Generation

Xiuyi Chen, Fandong Meng, Peng Li and
Feilong Chen, Shuang Xu, Bo Xu, Jie Zhou

Keywords Paper

knowledge selection, knowledge-grounded dialogue, dialogue generation, inference

0

0

0

0

10:57

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

18/07/2021

LAMDA: Label Matching Deep Domain Adaptation

Trung Le, Tuan Nguyen, Nhat Ho and
Hung Bui, Dinh Phung

Keywords Paper

Theory, Deep learning Theory

0

0

0

1

5:14

06/12/2020

Trade-offs and Guarantees of Adversarial Representation Learning for Information Obfuscation

Han Zhao, Jianfeng Chi, Yuan Tian, Geoffrey Gordon

Keywords Paper

0

0

0

0

3:17

06/12/2021

Local policy search with Bayesian optimization

Sarah Müller, Alexander von Rohr, Sebastian Trimpe

Keywords Paper

theory, optimization, reinforcement learning and planning, active learning

0

0

0

0

11:42

06/12/2021

Explanation-based Data Augmentation for Image Classification

Sandareka Wickramanayake, Wynne Hsu, Mong Li Lee

Keywords Paper

deep learning, machine learning, vision, interpretability

0

0

0

0

14:23

04/07/2020

Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Hanjie Chen, Guangtao Zheng, Yangfeng Ji

Keywords Paper

Text Classification, Generating explanations, natural processing, model prediction

0

0

0

0

11:47

04/07/2020

Explaining Black Box Predictions and Unveiling Data Artifacts through Influence Functions

Xiaochuang Han, Byron C. Wallace, Yulia Tsvetkov

Keywords Paper

NLP, model prediction, model decisions, natural inference

0

0

0

0

11:57

06/12/2020

Part-dependent Label Noise: Towards Instance-dependent Label Noise

Xiaobo Xia, Tongliang Liu, Bo Han and
Nannan Wang, Mingming Gong, Haifeng Liu, Gang Niu, Dacheng Tao, Masashi Sugiyama

Keywords Paper

0

0

0

0

3:00

16/11/2020

Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction

Rujun Han, Yichao Zhou, Nanyun Peng

Keywords Paper

extracting relations, information extraction, natural understanding, maximum inference

0

0

0

0

12:03

03/05/2021

Evaluations and Methods for Explanation through Robustness Analysis

Cheng-Yu Hsieh, Chih-Kuan Yeh, Xuanqing Liu and
Pradeep K Ravikumar, Seungyeon Kim, Sanjiv Kumar, Cho-Jui Hsieh

Keywords Paper

Interpretability, Adversarial Robustness, Explanations

0

0

0

0

5:11

06/12/2021

Variational Multi-Task Learning with Gumbel-Softmax Priors

Jiayi Shen, Xiantong Zhen, Marcel Worring, Ling Shao

Keywords Paper

machine learning, generative model

0

0

0

0

13:09

03/05/2021

Trusted Multi-View Classification

Zongbo Han, Changqing Zhang, Huazhu FU, Joey T Zhou

Keywords Paper

Uncertainty Machine Learning, Multi-View Learning, Multi-Modal Learning

0

0

0

0

4:33

18/07/2021

Towards Rigorous Interpretations: a Formalisation of Feature Attribution

Darius Afchar, Vincent Guigue, Romain Hennequin

Keywords Paper

Social Aspects of Machine Learning, Fairness, Accountability, and Transparency

0

0

0

0

5:20

06/12/2021

Can contrastive learning avoid shortcut solutions?

Joshua Robinson, Li Sun, Ke Yu and
Kayhan Batmanghelich, Stefanie Jegelka, Suvrit Sra

Keywords Paper

self-supervised learning, contrastive learning

0

0

0

0

12:45

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

14/06/2020

Selective Transfer With Reinforced Transfer Network for Partial Domain Adaptation

Zhihong Chen, Chao Chen, Zhaowei Cheng and
Boyuan Jiang, Ke Fang, Xinyu Jin

Keywords Paper

partial domain adaptation, selective transfer, pixel-level information, reconstruct error, reinforcement learning

1

1

0

0

1:01

06/12/2020

Domain Generalization via Entropy Regularization

Shanshan Zhao, Mingming Gong, Tongliang Liu and
Huan Fu, Dacheng Tao

Keywords Paper

0

0

0

1

3:16

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

06/12/2021

Understanding Interlocking Dynamics of Cooperative Rationalization

Mo Yu, Yang Zhang, Shiyu Chang, Tommi Jaakkola

Keywords Paper

deep learning, language, interpretability

0

0

0

0

13:41

14/09/2020

Learning Gradient Boosted Multi-label Classification Rules

Michael Rapp, Eneldo Loza Mencía, Johannes Fürnkranz and
Vu-Linh Nguyen, Eyke Hüllermeier

Keywords Paper

multi-label classification, gradient boosting, rule learning

0

0

0

0

15:45

19/08/2021

Two Birds with One Stone: Series Saliency for Accurate and Interpretable Multivariate Time Series Forecasting

Qingyi Pan, Wenbo Hu, Ning Chen

Keywords Paper

Machine Learning, Explainable/Interpretable Machine Learning, Time-series; Data Streams

0

1

0

0

15:01

08/12/2020

Classifier Probes May Just Learn from Linear Context Features

Jenny Kunz, Marco Kuhlmann

Keywords Paper

0

0

0

0

14:33

18/07/2021

RNNRepair: Automatic RNN Repair via Model-based Analysis

Xiaofei Xie, Wenbo Guo, Lei Ma and
Wei Le, Jian Wang, Lingjun Zhou, Yang Liu, Xinyu Xing

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

5:21

04/07/2020

Learning to Faithfully Rationalize by Construction

Sarthak Jain, Sarah Wiegreffe, Yuval Pinter, Byron C. Wallace

Keywords Paper

NLP, neural classification, training, automatic evaluations

0

0

0

0

11:55

22/09/2020

Making neural networks interpretable with attribution: Application to implicit signals prediction

Darius Afchar, Romain Hennequin

Keywords Paper

Implicit Recommender System, Interpretable machine learning

0

0

0

0

2:28

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

18/07/2021

Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

Shariq Iqbal, Christian Schroeder, Bei Peng and
Wendelin Boehmer, Shimon Whiteson, Fei Sha

Keywords Paper

Optimization, Convex Optimization, Reinforcement Learning and Planning, Multi-Agent RL, Algorithms, Large Scale Learning; Probabilistic Methods, Distributed Inference

0

0

0

0

20:08

06/12/2021

SurvITE: Learning Heterogeneous Treatment Effects from Time-to-Event Data

Alicia Curth, Changhee Lee, Mihaela van der Schaar

Keywords Paper

deep learning, machine learning, domain adaptation, causality

0

0

0

0

13:43

26/04/2020

Towards Hierarchical Importance Attribution: Explaining Compositional Semantics for Neural Sequence Models

Xisen Jin, Zhongyu Wei, Junyi Du and
Xiangyang Xue, Xiang Ren

Keywords Paper

natural language processing, interpretability

0

0

0

0

4:58

18/07/2021

Finding Relevant Information via a Discrete Fourier Expansion

Mohsen Heidari, Jithin Sreedharan, Gil Shamir, Wojciech Szpankowski

Keywords Paper

, Theory, Theory, Statistical Learning Theory

0

0

0

0

5:25