Staying True to Your Word: (How) Can Attention Become Explanation?

01/07/2020

Staying True to Your Word: (How) Can Attention Become Explanation?

Martin Tutek, Jan Snajder

Keywords:

Abstract Paper Similar Papers

Abstract: The attention mechanism has quickly become ubiquitous in NLP. In addition to improving performance of models, attention has been widely used as a glimpse into the inner workings of NLP models. The latter aspect has in the recent years become a common topic of discussion, most notably in recent work of Jain and Wallace; Wiegreffe and Pinter. With the shortcomings of using attention weights as a tool of transparency revealed, the attention mechanism has been stuck in a limbo without concrete proof when and whether it can be used as an explanation. In this paper, we provide an explanation as to why attention has seen rightful critique when used with recurrent networks in sequence classification tasks. We propose a remedy to these issues in the form of a word level objective and our findings give credibility for attention to provide faithful interpretations of recurrent models.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ACL Workshops virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Story Ending Generation with Multi-Level Graph Convolutional Networks over Dependency Trees

Qingbao Huang, Linzhang Mo, Pijian Li and
Yi Cai, Qingguang Liu, Jielong Wei, Qing Li, Ho-fung Leung

Keywords Paper

0

0

0

0

17:34

18/07/2021

Transfer-Based Semantic Anomaly Detection

Lucas Deecke, Lukas Ruff, Rob Vandermeulen, Hakan Bilen

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

0

4:52

06/12/2021

Self-Interpretable Model with Transformation Equivariant Interpretation

Yipei Wang, Xiaoqian Wang

Keywords Paper

deep learning, machine learning, vision

0

0

0

0

8:26

02/02/2021

Entity Guided Question Generation with Contextual Structure and Sequence Information Capturing

Qingbao Huang, Mingyi Fu, Linzhang Mo and
Yi Cai, Jingyun Xu, Pijian Li, Qing Li, Ho-fung Leung

Keywords Paper

0

0

0

0

19:41

02/02/2021

Adversarial Pose Regression Network for Pose-Invariant Face Recognitions

Pengyu Li, Biao Wang, Lei Zhang

Keywords Paper

0

0

0

0

15:17

14/06/2020

Cascaded Human-Object Interaction Recognition

Tianfei Zhou, Wenguan Wang, Siyuan Qi and
Haibin Ling, Jianbing Shen

Keywords Paper

human-object interaction recognition, cascade reasoning, fine-grained relation segmentation

0

0

0

0

1:01

30/11/2020

Attended-Auxiliary Supervision Representation for Face Anti-spoofing

Son Minh Nguyen, Linh Duy Tran, Masayuki Arai

Keywords Paper

0

0

0

0

6:50

06/12/2020

Reconsidering Generative Objectives For Counterfactual Reasoning

Danni Lu, Chenyang Tao, Junya Chen and
Fan Li, Feng Guo, Lawrence Carin

Keywords Paper

0

0

0

0

3:22

03/05/2021

Property Controllable Variational Autoencoder via Invertible Mutual Dependence

Xiaojie Guo, Yuanqi Du, Liang Zhao

Keywords Paper

deep generative models, disentangled representation learning, interpretable latent representation

0

0

0

0

4:45

06/12/2020

Incorporating Interpretable Output Constraints in Bayesian Neural Networks

Wanqian Yang, Lars Lorch, Moritz Graule and
Himabindu Lakkaraju, Finale Doshi-Velez

Keywords Paper

0

0

0

0

3:02

14/06/2020

Exploring Categorical Regularization for Domain Adaptive Object Detection

Chang-Dong Xu, Xing-Ran Zhao, Xin Jin, Xiu-Shen Wei

Keywords Paper

domain adaptive object detection, image-level categorical regularization, categorical consistency regularization, domain adaptive faster r-cnn

0

0

0

0

1:00

16/11/2020

APE: Argument Pair Extraction from Peer Review and Rebuttal via Multi-task Learning

Liying Cheng, Lidong Bing, Qian Yu and
Wei Lu, Luo Si

Keywords Paper

peer review, argument task, sequence task, text task

0

0

0

0

11:05

04/07/2020

Reverse Engineering Configurations of Neural Text Generation Models

Yi Tay, Dara Bahri, Che Zheng and
Clifford Brunk, Donald Metzler, Andrew Tomkins

Keywords Paper

Reverse Models, neural modeling, Neural Models, generative models

0

0

0

0

6:16

22/11/2021

MAGECally invert images for realistic editing

Asya Grechka, jean Francois Goudou, Matthieu Cord

Keywords Paper

gan inversion, gan, stylegan2, gan editing, image editing, gan projection, stylegan, semantic editing, latent space manipulation, latent editing

0

0

0

0

3:01

12/07/2020

Learning and Simulation in Generative Structured World Models

Zhixuan Lin, Yi-Fu Wu, Skand Peri and
Bofeng Fu, Jindong Jiang, Sungjin Ahn

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

11:56

18/07/2021

Systematic Analysis of Cluster Similarity Indices: How to Validate Validation Measures

Martijn Gösgens, Aleksei Tikhonov, Liudmila Prokhorenkova

Keywords Paper

Algorithms, Clustering

0

0

0

0

5:12

18/07/2021

A Bit More Bayesian: Domain-Invariant Learning with Uncertainty

Zehao Xiao, Jiayi Shen, Xiantong Zhen and
Ling Shao, Cees Snoek

Keywords Paper

Algorithms, Model Selection and Structure Learning, Applications, Computational Biology and Bioinformatics; Applications, Health; Deep Learning, Adversarial Networks; Theory, Deep Learning, Bayesian Deep Learning

0

0

0

0

5:46

04/07/2020

SEEK: Segmented Embedding of Knowledge Graphs

Wentao Xu, Shun Zheng, Liang He and
Bin Shao, Jian Yin, Tie-Yan Liu

Keywords Paper

Segmented Graphs, knowledge embedding, artificial intelligence, recommendation

0

0

0

0

12:01

18/07/2021

Inverse Decision Modeling: Learning Interpretable Representations of Behavior

Daniel Jarrett, Alihan Hüyük, Mihaela van der Schaar

Keywords Paper

Reinforcement Learning and Planning, Others

0

0

0

0

16:11

04/07/2020

ClarQ: A large-scale and diverse dataset for Clarification Question Generation

Vaibhav Kumar, Alan W Black

Keywords Paper

Clarification Generation, Question answering, classifying questions, downstream question-answering

0

0

0

0

6:17

12/07/2020

Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective

Ruixiang ZHANG, Katsuhiko Ishiguro, Masanori Koyama

Keywords Paper

Learning Theory

0

0

0

0

14:46

12/07/2020

Interpretations are Useful: Penalizing Explanations to Align Neural Networks with Prior Knowledge

Laura Rieger, Chandan Singh, William Murdoch, Bin Yu

Keywords Paper

Accountability, Transparency and Interpretability

0

0

0

0

15:15

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

03/05/2021

Bayesian Context Aggregation for Neural Processes

Michael Volpp, Fabian Flürenbrock, Lukas Grossberger and
Christian Daniel, Gerhard Neumann

Keywords Paper

Neural Processes, Multi-task Learning, Deep Sets, Meta Learning, Latent Variable Models, Aggregation Methods

0

0

0

0

5:04

16/11/2020

Understanding the Mechanics of SPIGOT: Surrogate Gradients for Latent Structure Learning

Tsvetomila Mihaylova, Vlad Niculae, André F. T. Martins

Keywords Paper

pipeline systems, ste, latent models, end-to-end training

0

0

0

0

11:50

19/04/2021

Expanding, retrieving and infilling: Diversifying cross-domain question generation with flexible templates

Xiaojing Yu, Anxiao Jiang

Keywords Paper

0

0

0

0

11:40

16/11/2020

Distilling Structured Knowledge for Text-Based Relational Reasoning

Jin Dong, Marc-Antoine Rondeau, William L. Hamilton

Keywords Paper

relational reasoning, reasoning task, cross-modal transfer, text-based systems

0

0

0

0

5:15

03/05/2021

Evaluations and Methods for Explanation through Robustness Analysis

Cheng-Yu Hsieh, Chih-Kuan Yeh, Xuanqing Liu and
Pradeep K Ravikumar, Seungyeon Kim, Sanjiv Kumar, Cho-Jui Hsieh

Keywords Paper

Interpretability, Adversarial Robustness, Explanations

0

0

0

0

5:11

02/02/2021

Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text

Nishtha Madaan, Inkit Padhi, Naveen Panwar, Diptikalyan Saha

Keywords Paper

0

0

0

0

20:15

06/12/2020

Joint Contrastive Learning with Infinite Possibilities

Qi Cai, Yu Wang, Yingwei Pan and
Ting Yao, Tao Mei

Keywords Paper

0

0

0

0

3:06

16/11/2020

A Diagnostic Study of Explainability Techniques for Text Classification

Pepa Atanasova, Jakob Grue Simonsen, Christina Lioma, Isabelle Augenstein

Keywords Paper

downstream tasks, machine learning, explainability techniques, diverse techniques

0

0

0

0

11:24

02/02/2021

Visualization of Supervised and Self-Supervised Neural Networks via Attribution Guided Factorization

Shir Gur, Ameen Ali, Lior Wolf

Keywords Paper

0

0

0

0

14:14

02/02/2021

Dynamic Neuro-Symbolic Knowledge Graph Construction for Zero-shot Commonsense Question Answering

Antoine Bosselut, Ronan Le Bras, Yejin Choi

Keywords Paper

0

0

0

0

20:36

02/02/2021

Learning to Copy Coherent Knowledge for Response Generation

Jiaqi Bai, Ze Yang, Xinnian Liang and
Wei Wang, Zhoujun Li

Keywords Paper

0

0

0

0

14:15

14/06/2020

Attention-Based Context Aware Reasoning for Situation Recognition

Thilini Cooray, Ngai-Man Cheung, Wei Lu

Keywords Paper

situation recognition, visual semantic role labelling, scene understanding, vision and language, action recognition

0

0

0

0

1:00

16/11/2020

Neural Topic Modeling with Cycle-Consistent Adversarial Training

Xuemeng Hu, Rui Wang, Deyu Zhou, Yuxuan Xiong

Keywords Paper

neural modeling, deep models, adversarial-neural model, adversarially network

0

0

0

1

9:57

03/05/2021

Representation Balancing Offline Model-based Reinforcement Learning

Byung-Jun Lee, Jongmin Lee, Kee-Eung Kim

Keywords Paper

Off-policy policy evaluation, Batch Reinforcement Learning, Offline Reinforcement Learning, Model-based Reinforcement Learning, Reinforcement Learning

0

0

0

0

5:45

26/04/2020

Counterfactuals uncover the modular structure of deep generative models

Michel Besserve, Arash Mehrjou, Rémy Sun, Bernhard Schölkopf

Keywords Paper

generative models, causality, counterfactuals, representation learning, disentanglement, generalization, unsupervised learning

0

0

0

0

5:42

02/02/2021

Bayes-TrEx: a Bayesian Sampling Approach to Model Transparency by Example

Serena Booth, Yilun Zhou, Ankit Shah, Julie Shah

Keywords Paper

0

0

0

0

15:00

03/08/2020

Walking on Two Legs: Learning Image Segmentation with Noisy Labels

Guohua Cheng, Hongli Ji, Yan Tian

Keywords Paper

0

0

0

0

10:02