TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

03/08/2020

TX-Ray: Quantifying and Explaining Model-Knowledge Transfer in (Un-)Supervised NLP

Nils Rethmeier, Vageesh Kumar Saxena, Isabelle Augenstein

Keywords:

Abstract Paper Similar Papers

Abstract: While state-of-the-art NLP explainability (XAI) methods focus on explaining per-sample decisions in supervised end or probing tasks, this is insufficient to explain and quantify model knowledge transfer during (un-)supervised training. Thus, for TX-Ray, we modify the established computer vision explainability principle of ‘visualizing preferred inputs of neurons’ to make it usable for both NLP and for transfer analysis. This allows one to analyze, track and quantify how self- or supervised NLP models first build knowledge abstractions in pretraining (1), andthen transfer abstractions to a new domain (2), or adapt them during supervised finetuning (3) – see Fig. 1. TX-Ray expresses neurons as feature preference distributions to quantify fine-grained knowledge transfer or adaptation and guide human analysis. We find that, similar to Lottery Ticket based pruning, TX-Ray based pruning can improve test set generalization and that it can reveal how early stages of self-supervision automatically learn linguistic abstractions like parts-of-speech.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at UAI 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Quantifying Learnability and Describability of Visual Concepts Emerging in Representation Learning

Iro Laina, Ruth Fong, Andrea Vedaldi

Keywords Paper

Algorithms -> Image Segmentation; Algorithms -> Semi-Supervised Learning; Applications -> Computer Vision; Applications -> Imag, Algorithms -> Adversarial Learning

0

0

0

0

3:25

04/07/2020

CompGuessWhat?!: A Multi-task Evaluation Framework for Grounded Language Learning

Alessandro Suglia, Ioannis Konstas, Andrea Vanzo and
Emanuele Bastianelli, Desmond Elliott, Stella Frank, Oliver Lemon

Keywords Paper

Grounded Learning, Goal-oriented evaluation, Object evaluation, Zero-shot evaluation

0

0

0

0

11:09

06/12/2021

Impression learning: Online representation learning with synaptic plasticity

Colin Bredenberg, Benjamin Lyo, Eero P Simoncelli, Cristina Savin

Keywords Paper

neuroscience, representation learning

0

0

0

0

14:11

12/07/2020

On Variational Learning of Controllable Representations for Text without Supervision

Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao

Keywords Paper

Representation Learning

0

0

0

0

14:51

04/07/2020

On the Linguistic Representational Power of Neural Machine Translation Models

Yonatan Belinkov, Nadir Durrani, Fahim Dalvi and
Hassan Sajjad, James Glass

Keywords Paper

Linguistic Models, natural processing, artificial intelligence, translating languages

0

0

0

0

19:17

02/02/2021

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition

Siteng Huang, Min Zhang, Yachen Kang, Donglin Wang

Keywords Paper

0

0

0

0

17:04

06/12/2021

Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation

Lin Guan, Mudit Verma, Suna (Sihang) Guo and
Ruohan Zhang, Subbarao Kambhampati

Keywords Paper

reinforcement learning and planning, machine learning

0

0

0

0

13:41

06/12/2020

Compositional Explanations of Neurons

Jesse Mu, Jacob Andreas

Keywords Paper

0

0

0

0

3:16

18/07/2021

Explore Visual Concept Formation for Image Classification

Shengzhou Xiong, Yihua Tan, Guoyou Wang

Keywords Paper

Deep Learning

0

0

0

0

5:10

01/07/2020

CopyBERT: A Unified Approach to Question Generation with Self-Attention

Stalin Varanasi, Saadullah Amin, Guenter Neumann

Keywords Paper

0

0

0

0

12:35

03/05/2021

Learning and Evaluating Representations for Deep One-Class Classification

Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon and
Minho Jin, Tomas Pfister

Keywords Paper

self-supervised learning, deep one-class classification

0

0

0

1

5:13

06/12/2020

Learning Semantic-aware Normalization for Generative Adversarial Networks

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

0

0

0

0

3:11

22/11/2021

Rich Semantics Improve Few-Shot Learning

Mohamed Afham Mohamed Aflal, Salman Khan, Muhammad Haris Khan and
Muzammal Naseer, Fahad Shahbaz Khan

Keywords Paper

few shot learning, multimodal learning, transformers in vision

0

0

0

0

2:47

26/04/2020

Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang, Saining Xie, Marcus Rohrbach and
Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis

Keywords Paper

long-tailed recognition, classification

0

0

0

1

5:00

03/05/2021

The role of Disentanglement in Generalisation

Milton Montero, Casimir JH Ludwig, Rui Ponte Costa and
Gaurav Malhotra, Jeffrey Bowers

Keywords Paper

generalisation, compositional generalization, generative models, compositionality, variational autoencoders, disentanglement

0

0

0

0

4:16

19/08/2021

Information Bottleneck Approach to Spatial Attention Learning

Qiuxia Lai, Yu Li, Ailing Zeng and
Minhao Liu, Hanqiu Sun, Qiang Xu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Classification, Deep Learning

0

0

0

0

14:42

16/11/2020

Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation

Minki Kang, Moonsu Han, Sung Ju Hwang

Keywords Paper

self-supervised pre-training, question answering, task, reinforcement learning

0

0

0

0

12:00

14/09/2020

Diversity-Based Generalization for Unsupervised Text Classification under Domain Shift

Jitin Krishnan, Hemant Purohit, Huzefa Rangwala

Keywords Paper

text classification, unsupervised domain adaptation, natural language processing, neural networks

0

0

0

0

16:13

03/05/2021

Grounded Language Learning Fast and Slow

Felix Hill, Olivier Tieleman, Tamara von Glehn and
Nathaniel Wong, Hamza Merzic, Stephen Clark

Keywords Paper

memory, meta-learning, word-learning, grounding, fast-mapping, language, cognition

0

0

0

0

11:44

04/07/2020

Information-Theoretic Probing for Linguistic Structure

Tiago Pimentel, Josef Valvoda, Rowan Hall Maudslay and
Ran Zmigrod, Adina Williams, Ryan Cotterell

Keywords Paper

Information-Theoretic Probing, NLP tasks, linguistic task, probing

0

0

0

0

10:30

03/05/2021

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee, Dongchan Min, Seanie Lee, Sung Ju Hwang

Keywords Paper

Unsupervised Learning, Variational Autoencoders, Unsupervised Meta-learning, Meta-Learning

0

0

0

0

13:31

19/08/2021

Novelty Detection via Contrastive Learning with Negative Data Augmentation

Chengwei Chen, Yuan Xie, Shaohui Lin and
Ruizhi Qiao, Jian Zhou, Xin Tan, Yi Zhang, Lizhuang Ma

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Anomaly/Outlier Detection

0

0

0

0

14:37

06/12/2021

Supervising the Transfer of Reasoning Patterns in VQA

Corentin Kervadec, Christian Wolf, Grigory Antipov and
Moez Baccouche, Madiha Nadri

Keywords Paper

theory, deep learning, vision

0

0

0

0

12:54

03/05/2021

Adversarially-Trained Deep Nets Transfer Better: Illustration on Image Classification

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and
Rajiv Khanna, Michael W Mahoney

Keywords Paper

adversarial training, limited data, influence functions, transfer learning

0

0

0

0

5:12

02/02/2021

Exploring Explainable Selection to Control Abstractive Summarization

Haonan Wang, Yang Gao, Yu Bai and
Mirella Lapata, Heyan Huang

Keywords Paper

0

0

0

0

18:50

06/12/2021

Low-dimensional Structure in the Space of Language Representations is Reflected in Brain Responses

Richard Antonello, Javier S Turek, Vy Vo, Alexander Huth

Keywords Paper

vision, language, transfer learning

0

0

0

0

10:29

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

06/12/2021

Refining Language Models with Compositional Explanations

Huihan Yao, Ying Chen, Qinyuan Ye and
Xisen Jin, Xiang Ren

Keywords Paper

machine learning, fairness, language

0

0

0

0

13:17

06/12/2021

Visualizing the Emergence of Intermediate Visual Patterns in DNNs

Mingjie Li, Shaobo Wang, Quanshi Zhang

Keywords Paper

deep learning, adversarial robustness and security

0

0

0

0

7:13

14/06/2020

NestedVAE: Isolating Common Factors via Weak Supervision

Matthew J. Vowels, Necati Cihan Camgöz, Richard Bowden

Keywords Paper

fairness, bias, representation learning, invariance, vae, variational, weakly supervised, information bottleneck

0

0

0

0

1:00

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

16/11/2020

CSP:Code-Switching Pre-training for Neural Machine Translation

Zhen Yang, Bojie Hu, Ambyera Han and
Shen Huang, Qi Ju

Keywords Paper

neural nmt, lexicon induction, unsupervised nmt, pre-training method

0

0

0

0

10:10

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

16/11/2020

Learning to Represent Image and Text with Denotation Graph

Bowen Zhang, Hexiang Hu, Vihan Jain and
Eugene Ie, Fei Sha

Keywords Paper

cross-modal retrieval, referring expression, compositional recognition, pre-training

0

0

0

0

10:59

02/02/2021

What's the Best Place for an AI Conference, Vancouver or _______: Why Completing Comparative Questions is Difficult

‪Avishai Zagoury‬, Einat Minkov, Idan Szpektor, William W. Cohen

Keywords Paper

0

0

0

0

15:15

05/01/2021

Integrating Human Gaze Into Attention for Egocentric Activity Recognition

Kyle Min, Jason J. Corso

Keywords Paper

0

0

0

0

4:56

12/07/2020

Probing Emergent Semantics in Predictive Agents via Question Answering

Abhishek Das, Federico Carnevale, Hamza Merzic and
Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

12:31

02/02/2021

A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

0

0

0

0

14:21

30/11/2020

Sequential View Synthesis with Transformer

Phong Nguyen-Ha, Lam Huynh, Esa Rahtu, Janne Heikkila

Keywords Paper

0

0

0

0

9:38

03/05/2021

Self-supervised Learning from a Multi-view Perspective

Yao-Hung Hubert Tsai, Yue Wu, Ruslan Salakhutdinov, LP Morency

Keywords Paper

Self-supervised Learning, Unsupervised Learning, Multi-view Representation Learning

0

0

0

0

5:36