The geometry of integration in text classification RNNs

03/05/2021

The geometry of integration in text classification RNNs

Kyle Aitken, Vinay Ramasesh, Ankush Garg, Yuan Cao, David Sussillo, Niru Maheswaranathan

Keywords: interpretability, dynamical systems, reverse engineering, document classification, Recurrent neural networks

Abstract Paper Similar Papers

Abstract: Despite the widespread application of recurrent neural networks (RNNs), a unified understanding of how RNNs solve particular tasks remains elusive. In particular, it is unclear what dynamical patterns arise in trained RNNs, and how those pat-terns depend on the training dataset or task. This work addresses these questions in the context of text classification, building on earlier work studying the dynamics of binary sentiment-classification networks (Maheswaranathan et al., 2019). We study text-classification tasks beyond the binary case, exploring the dynamics ofRNNs trained on both natural and synthetic datasets. These dynamics, which we find to be both interpretable and low-dimensional, share a common mechanism across architectures and datasets: specifically, these text-classification networks use low-dimensional attractor manifolds to accumulate evidence for each class as they process the text. The dimensionality and geometry of the attractor manifold are determined by the structure of the training dataset, with the dimensionality reflecting the number of scalar quantities the network remembers in order to classify.In categorical classification, for example, we show that this dimensionality is one less than the number of classes. Correlations in the dataset, such as those induced by ordering, can further reduce the dimensionality of the attractor manifold; we show how to predict this reduction using simple word-count statistics computed on the training dataset. To the degree that integration of evidence towards a decision is a common computational primitive, this work continues to lay the foundation for using dynamical systems techniques to study the inner workings of RNNs.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

04/07/2020

Cross-Lingual Unsupervised Sentiment Classification with Multi-View Transfer Learning

Hongliang Fei, Ping Li

Keywords Paper

Cross-Lingual Classification, sentiment classification, unsupervised system, classification

0

0

0

0

12:23

06/12/2021

Structured Reordering for Modeling Latent Alignments in Sequence Transduction

bailin wang, Mirella Lapata, Ivan Titov

Keywords Paper

language

0

0

0

0

15:00

30/11/2020

Imbalance Robust Softmax for Deep Embedding Learning

Hao Zhu, Yang Yuan, Guosheng Hu and
Xiang Wu, Neil Robertson

Keywords Paper

0

0

0

0

7:16

16/11/2020

Learning from Context or Names? An Empirical Study on Neural Relation Extraction

Hao Peng, Tianyu Gao, Xu Han and
Yankai Lin, Peng Li, Zhiyuan Liu, Maosong Sun, Jie Zhou

Keywords Paper

relation benchmarks, re scenarios, neural models, re models

0

0

0

0

11:56

16/11/2020

Diversified Multiple Instance Learning for Document-Level Multi-Aspect Sentiment Classification

Yunjie Ji, Hao Liu, Bolei He and
Xinyan Xiao, Hua Wu, Yanhua Yu

Keywords Paper

neural, aspect-level classification, dmsc, diversified

0

0

0

0

10:53

12/07/2020

Deep Reinforcement Learning with Smooth Policy

Qianli Shen, Yan Li, Haoming Jiang and
Zhaoran Wang, Tuo Zhao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

9:51

14/06/2020

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

Keywords Paper

data augmentation, text recognition, joint training

0

0

0

0

0:59

19/08/2021

A Sequence-to-Set Network for Nested Named Entity Recognition

Zeqi Tan, Yongliang Shen, Shuai Zhang and
Weiming Lu, Yueting Zhuang

Keywords Paper

Natural Language Processing, Information Extraction, Named Entities

0

0

0

0

10:38

02/02/2021

Data Augmentation for Abstractive Query-Focused Multi-Document Summarization

Ramakanth Pasunuru, Asli Celikyilmaz, Michel Galley and
Chenyan Xiong, Yizhe Zhang, Mohit Bansal, Jianfeng Gao

Keywords Paper

0

0

0

0

16:46

06/12/2021

Sequence-to-Sequence Learning with Latent Neural Grammars

Yoon Kim

Keywords Paper

deep learning

0

0

0

0

14:31

18/07/2021

LTL2Action: Generalizing LTL Instructions for Multi-Task RL

Pashootan Vaezipoor, Andrew C Li, Rodrigo A Toro Icarte, Sheila McIlraith

Keywords Paper

Reinforcement Learning and Planning

0

0

0

0

5:07

03/05/2021

Co-Mixup: Saliency Guided Joint Mixup with Supermodular Diversity

Jang-Hyun Kim, Wonho Choo, Hosan Jeong, Hyun Oh Song

Keywords Paper

Supervised Learning, Discrete Optimization, Data Augmentation, Deep Learning

0

0

0

0

14:43

18/07/2021

Latent Space Energy-Based Model of Symbol-Vector Coupling for Text Generation and Classification

Bo Pang, Ying Nian Wu

Keywords Paper

Algorithms, Unsupervised Learning

0

0

0

0

5:17

02/02/2021

LREN: Low-Rank Embedded Network for Sample-Free Hyperspectral Anomaly Detection

Kai Jiang, Weiying Xie, Jie Lei and
Tao Jiang, Yunsong Li

Keywords Paper

0

0

0

0

12:56

03/05/2021

Learning Task-General Representations with Generative Neuro-Symbolic Modeling

Reuben Feinman, Brenden Lake

Keywords Paper

probabilistic programs, neuro-symbolic models, few-shot concept learning, generative models

0

0

0

0

6:13

06/12/2020

Characterizing emergent representations in a space of candidate learning rules for deep networks

Yinan Cao, Christopher Summerfield, Andrew Saxe

Keywords Paper

0

0

0

0

3:16

16/11/2020

Weakly-Supervised Aspect-Based Sentiment Analysis via Joint Aspect-Sentiment Topic Embedding

Jiaxin Huang, Yu Meng, Fang Guo and
Heng Ji, Jiawei Han

Keywords Paper

extracting aspects, classifying reviews, aspect-based analysis, aspect classification

0

0

0

0

11:23

02/02/2021

An Adaptive Hybrid Framework for Cross-domain Aspect-based Sentiment Analysis

Yan Zhou, Fuqing Zhu, Pu Song and
Jizhong Han, Tao Guo, Songlin Hu

Keywords Paper

0

0

0

0

17:23

13/04/2021

Learning to defend by learning to attack

Haoming Jiang, Zhehui Chen, Yuyang Shi and
Bo Dai, Tuo Zhao

Keywords Paper

0

0

0

0

2:58

04/07/2020

End-to-End Bias Mitigation by Modelling Biases in Corpora

Rabeeh Karimi Mahabadi, Yonatan Belinkov, James Henderson

Keywords Paper

End-to-End Mitigation, real-world scenarios, training, large-scale benchmarks

0

0

0

0

10:57

06/12/2020

A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network

Basile Confavreux, Friedemann Zenke, Everton Agnes and
Timothy Lillicrap, Tim Vogels

Keywords Paper

0

0

0

0

3:25

06/12/2020

Intra-Processing Methods for Debiasing Neural Networks

Yash Savani, Colin White, Naveen Sundar Govindarajulu

Keywords Paper

0

0

0

0

3:22

02/02/2021

LRSC: Learning Representations for Subspace Clustering

Changsheng Li, Chen Yang, Bo Liu and
Ye Yuan, Guoren Wang

Keywords Paper

0

0

0

0

15:09

25/07/2020

Query resolution for conversational search with limited supervision

Nikos Voskarides, Dan Li, Pengjie Ren and
Evangelos Kanoulas, Maarten Rijke

Keywords Paper

query resolution, conversational search

0

0

0

0

8:42

06/12/2020

Kernelized information bottleneck leads to biologically plausible 3-factor Hebbian learning in deep networks

Roman Pogodin, Peter E Latham

Keywords Paper

Deep Learning -> Adversarial Networks, Algorithms -> Semi-Supervised Learning

0

0

0

0

2:30

06/12/2020

What Neural Networks Memorize and Why: Discovering the Long Tail via Influence Estimation

Vitaly Feldman, Chiyuan Zhang

Keywords Paper

0

0

0

0

3:22

12/07/2020

PoKED: A Semi-Supervised System for Word Sense Disambiguation

Feng Wei

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

15:39

02/02/2021

Decision-Guided Weighted Automata Extraction from Recurrent Neural Networks

Xiyue Zhang, Xiaoning Du, Xiaofei Xie and
Lei Ma, Yang Liu, Meng Sun

Keywords Paper

0

0

0

0

16:44

06/12/2020

Co-Tuning for Transfer Learning

Kaichao You, Zhi Kou, Mingsheng Long, Jianmin Wang

Keywords Paper

0

0

0

0

3:24

25/07/2020

DVGAN: A minimax game for search result diversification combining explicit and implicit features

Jiongnan Liu, Zhicheng Dou, Xiaojie Wang and
Shuqi Lu, Ji-Rong Wen

Keywords Paper

generative adversarial network, search result diversification

0

0

0

0

12:46

16/11/2020

Deep Weighted MaxSAT for Aspect-based Opinion Extraction

Meixi Wu, Wenya Wang, Sinno Jialin Pan

Keywords Paper

nlp tasks, training process, logic programs, satisfiability problem

0

0

0

0

11:36

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

06/12/2021

Representation Learning Beyond Linear Prediction Functions

Ziping Xu, Ambuj Tewari

Keywords Paper

theory, deep learning, optimization, representation learning, few shot learning

0

0

0

0

11:00

16/11/2020

Coarse-to-Fine Query Focused Multi-Document Summarization

Yumo Xu, Mirella Lapata

Keywords Paper

modeling interactions, query summarization, assembling summaries, question answering

0

0

0

0

11:30

06/12/2020

Learning Invariances in Neural Networks from Training Data

Greg Benton, Marc Finzi, Pavel Izmailov, Andrew Wilson

Keywords Paper

0

0

0

0

3:03

03/05/2021

Iterated learning for emergent systematicity in VQA

Ankit Vani, Max Schwarzer, Yuchen Lu and
Eeshan Dhekane, Aaron Courville

Keywords Paper

clevr, vqa, shapes, neural module network, cultural transmission, iterated learning, visual question answering, systematic generalization, compositionality

0

0

0

0

15:10

14/06/2020

Unsupervised Intra-Domain Adaptation for Semantic Segmentation Through Self-Supervision

Fei Pan, Inkyu Shin, Francois Rameau and
Seokju Lee, In So Kweon

Keywords Paper

domain adaptation, semantic segmentation, self-supervised learning, unsupervised learning, transfer learning.

0

0

0

0

4:58

03/05/2021

Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control

Zhuang Liu, Xuanlin Li, Bingyi Kang, trevor darrell

Keywords Paper

Deep Reinforcement Learning, Regularization, Continuous Control, Policy Optimization

0

0

0

0

8:45

06/12/2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut Simsekli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:38

06/12/2020

VIME: Extending the Success of Self- and Semi-supervised Learning to Tabular Domain

Jinsung Yoon, Yao Zhang, James Jordon, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:25