On Identifiability in Transformers

26/04/2020

On Identifiability in Transformers

Gino Brunner, Yang Liu, Damian Pascual, Oliver Richter, Massimiliano Ciaramita, Roger Wattenhofer

Keywords: Self-attention, interpretability, identifiability, BERT, Transformer, NLP, explanation, gradient attribution

Abstract Paper Similar Papers

Abstract: In this paper we delve deep in the Transformer architecture by investigating two of its core components: self-attention and contextual embeddings. In particular, we study the identifiability of attention weights and token embeddings, and the aggregation of context into hidden tokens. We show that, for sequences longer than the attention head dimension, attention weights are not identifiable. We propose effective attention as a complementary tool for improving explanatory interpretations based on attention. Furthermore, we show that input tokens retain to a large degree their identity across the model. We also find evidence suggesting that identity information is mainly encoded in the angle of the embeddings and gradually decreases with depth. Finally, we demonstrate strong mixing of input information in the generation of contextual embeddings by means of a novel quantification method based on gradient attribution. Overall, we show that self-attention distributions are not directly interpretable and present tools to better understand and further investigate Transformer models.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Implicit Semantic Response Alignment for Partial Domain Adaptation

Wenxiao Xiao, Zhengming Ding, Hongfu Liu

Keywords Paper

domain adaptation, transfer learning

0

0

0

0

11:43

19/04/2021

Telling BERT’s full story: From local attention to global aggregation

Damian Pascual, Gino Brunner, Roger Wattenhofer

Keywords Paper

0

0

0

0

10:38

30/11/2020

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-Supervised Person Re-Identification

Gehan Hao, Yang Yang, Xue Zhou and
Guanan Wang, Zhen Lei

Keywords Paper

0

0

0

0

5:09

06/12/2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Paper

0

0

0

0

3:34

02/02/2021

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

Chaoqun Wang, Xuejin Chen, Shaobo Min and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

14:56

22/11/2021

Feature Fusion Vision Transformer for Fine-Grained Visual Categorization

Jun Wang, Xiaohan Yu, Yongsheng Gao

Keywords Paper

Fine-grained visual categorization, Vision transformer, Self-attention, Feature Fusion

0

0

0

0

3:02

07/09/2020

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

Binh Nguyen, Binh Nguyen, Gustavo Carneiro and
Erman Tjiputra, Quang Tran, Thanh-Toan Do

Keywords Paper

unsupervised deep metric learning, unsupervised feature learning, unsupervised metric loss, negative mining, deep clustering, pseudo labels, reconstruction, centroid representations, retrieval, multi-task

0

0

0

0

6:18

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

14/06/2020

A Shared Multi-Attention Framework for Multi-Label Zero-Shot Learning

Dat Huynh, Ehsan Elhamifar

Keywords Paper

multi-label learning, zero-shot learning, few-shot learning, attention

0

0

0

0

4:56

22/11/2021

Measuring the Biases and Effectiveness of Content-Style Disentanglement

Xiao Liu, Spyridon Thermos, Gabriele Valvano and
Agisilaos Chartsias, Alison Q O'Neil, Sotirios Tsaftaris

Keywords Paper

Disentangled Representations Learning, Content and Style Disentanglement, Metrics, Biases, Semantic Segmentation, Image to Image Translation, Pose Estimation

0

0

0

0

2:57

02/02/2021

Consistent-Separable Feature Representation for Semantic Segmentation

Xingjian He, Jing Liu, Jun Fu and
Xinxin Zhu, Jinqiao Wang, Hanqing Lu

Keywords Paper

0

0

0

0

13:37

18/07/2021

Decoupling Value and Policy for Generalization in Reinforcement Learning

Roberta Raileanu, Rob Fergus

Keywords Paper

Theory, Learning Theory, Theory, Large Deviations and Asymptotic Analysis, Reinforcement Learning and Planning, Deep RL

0

0

0

0

16:35

19/08/2021

Dependent Multi-Task Learning with Causal Intervention for Image Captioning

Wenqing Chen, Jidong Tian, Caoyun Fan and
Hao He, Yaohui Jin

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Natural Language Generation, Language and Vision

0

0

0

0

12:02

03/05/2021

Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis

Zhipeng Bao, Yu-Xiong Wang, Martial Hebert

Keywords Paper

adversarial training, computer vision, object recognition, few-shot learning, generative models

0

0

0

0

5:11

25/07/2020

Disentangled graph collaborative filtering

Xiang Wang, Hongye Jin, An Zhang and
Xiangnan He, Tong Xu, Tat-Seng Chua

Keywords Paper

explainable recommendation, disentangled representation learning, collaborative filtering, graph neural networks

0

0

0

0

15:17

26/04/2020

Masked Based Unsupervised Content Transfer

Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

Keywords Paper

0

0

0

0

4:38

03/05/2021

Drop-Bottleneck: Learning Discrete Compressed Representation for Noise-Robust Exploration

Jaekyeom Kim, Minjung Kim, Dongyeon Woo, Gunhee Kim

Keywords Paper

Reinforcement learning, Information bottleneck

0

0

0

1

5:13

02/02/2021

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

Yaru Hao, Li Dong, Furu Wei, Ke Xu

Keywords Paper

0

0

0

0

16:26

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

14/06/2020

Self-Supervised Domain-Aware Generative Network for Generalized Zero-Shot Learning

Jiamin Wu, Tianzhu Zhang, Zheng-Jun Zha and
Jiebo Luo, Yongdong Zhang, Feng Wu

Keywords Paper

zero-shot learning, self-supervised learning, generative adversarial network, generative model, multi-label learning

0

0

0

0

1:01

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

16/11/2020

Adaptive Attentional Network for Few-Shot Knowledge Graph Completion

Jiawei Sheng, Shu Guo, Zhenyu Chen and
Juwei Yue, Lihong Wang, Tingwen Liu, Hongbo Xu

Keywords Paper

few-shot completion, knowledge acquisition, link prediction, adaptive network

0

0

0

0

11:38

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

16/11/2020

Accurate Word Alignment Induction from Neural Machine Translation

Yun Chen, Yang Liu, Guanhua Chen and
Xin Jiang, Qun Liu

Keywords Paper

transformer, attention mechanism, word methods, shift-att

0

0

0

0

11:47

26/08/2020

Deterministic Decoding for Discrete Data in Variational Autoencoders

Daniil Polykovskiy, Dmitry Vetrov

Keywords Paper

0

0

0

0

9:00

06/12/2021

SOFT: Softmax-free Transformer with Linear Complexity

Jiachen Lu, Jinghan Yao, Junge Zhang and
Xiatian Zhu, Hang Xu, Weiguo Gao, Chunjing XU, Tao Xiang, Li Zhang

Keywords Paper

robustness, transformers, language

0

0

0

0

8:04

14/06/2020

Domain-Aware Visual Bias Eliminating for Generalized Zero-Shot Learning

Shaobo Min, Hantao Yao, Hongtao Xie and
Chaoqun Wang, Zheng-Jun Zha, Yongdong Zhang

Keywords Paper

generalized zero-shot learning, domain detection, recognition, segmentation, margin loss, bilinear pooling, nas, transfer learning, domain adaption, computer vision.

0

0

0

0

0:58

03/05/2021

Learning explanations that are hard to vary

Giambattista Parascandolo, Alexander Neitz, Antonio Orvieto and
Luigi Gresele, Bernhard Schoelkopf

Keywords Paper

invariances, gradient alignment, consistency

0

0

0

0

5:16

04/07/2020

Quantifying Attention Flow in Transformers

Samira Abnar, Willem Zuidema

Keywords Paper

Quantifying Transformers, quantifying information, Attention Transformers, Transformer model

0

0

0

0

6:24

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

03/05/2021

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li, Pan Zhou, Caiming Xiong, Steven Hoi

Keywords Paper

self-supervised learning, unsupervised learning, representation learning, contrastive learning

0

0

0

0

4:51

14/09/2020

Soft Labels Transfer with Discriminative Representations Learning for Unsupervised Domain Adaptation

Manliang Cao, Xiangdong Zhou, Lan Lin

Keywords Paper

unsupervised domain adaptation, distribution alignment, discriminative feature, soft labels transfer

0

0

0

0

13:34

14/06/2020

Gait Recognition via Semi-supervised Disentangled Representation Learning to Identity and Covariate Features

Xiang Li, Yasushi Makihara, Chi Xu and
Yasushi Yagi, Mingwu Ren

Keywords Paper

gait recognition, semi-supervised disentangled representation learningcovariate

0

0

0

0

1:01

22/11/2021

Domain Attention Consistency for Multi-Source Domain Adaptation

Zhongying Deng, Kaiyang Zhou, Yongxin Yang, Tao Xiang

Keywords Paper

Transferable Attribute Learning, Domain Attention Consistency, Multi-Source Domain Adaptation

0

0

0

0

9:24

06/12/2021

Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

vision

0

0

0

0

9:03

19/08/2021

Learning Visual Words for Weakly-Supervised Semantic Segmentation

Lixiang Ru, Bo Du, Chen Wu

Keywords Paper

Computer Vision, Recognition, Deep Learning

0

0

0

0

13:35

02/02/2021

Unsupervised Domain Adaptation for Semantic Segmentation by Content Transfer

Suhyeon Lee, Junhyuk Hyun, Hongje Seong, Euntai Kim

Keywords Paper

0

0

0

0

15:27

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

06/12/2020

Unsupervised Semantic Aggregation and Deformable Template Matching for Semi-Supervised Learning

Tao Han, Junyu Gao, Yuan Yuan, Qi Wang

Keywords Paper

0

0

0

0

3:22