The interplay between randomness and structure during learning in RNNs

06/12/2020

The interplay between randomness and structure during learning in RNNs

Friedrich Schuessler, Francesca Mastrogiuseppe, Alexis Dubreuil, Srdjan Ostojic, Omri Barak

Keywords: Algorithms -> Clustering, Applications -> Network Analysis

Abstract Paper Similar Papers

Abstract: Training recurrent neural networks (RNNs) on low-dimensional tasks has been widely used to model functional biological networks. However, the solutions found by learning and the effect of initial connectivity are not well understood. Here, we examine RNNs trained using gradient descent on different tasks inspired by the neuroscience literature. We find that the changes in recurrent connectivity can be described by low-rank matrices. This observation holds even in the presence of random initial connectivity, although this initial connectivity has full rank and significantly accelerates training. To understand the origin of these observations, we turn to an analytically tractable setting: training a linear RNN on a simpler task. We show how the low-dimensional task structure leads to low-rank changes to connectivity, and how random initial connectivity facilitates learning. Altogether, our study opens a new perspective to understand learning in RNNs in light of low-rank connectivity changes and the synergistic role of random initialization.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

12/07/2020

Finding trainable sparse networks through Neural Tangent Transfer

Tianlin Liu, Friedemann Zenke

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

13:43

06/12/2020

What Do Neural Networks Learn When Trained With Random Labels?

Hartmut Maennel, Ibrahim Alabdulmohsin, Ilya Tolstikhin and
Robert Baldock, Olivier Bousquet, Sylvain Gelly, Daniel Keysers

Keywords Paper

0

0

0

0

3:22

06/12/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Minshuo Chen, Yu Bai, Jason Lee and
Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher

Keywords Paper

0

0

0

0

3:07

06/12/2021

Learning rule influences recurrent network representations but not attractor structure in decision-making tasks

Brandon McMahan, Michael Kleinman, Jonathan Kao

Keywords Paper

deep learning, neuroscience, interpretability

0

0

0

0

5:55

06/12/2021

Training Feedback Spiking Neural Networks by Implicit Differentiation on the Equilibrium State

Mingqing Xiao, Qingyan Meng, Zongpeng Zhang and
Yisen Wang, Zhouchen Lin

Keywords Paper

deep learning

0

0

0

0

12:22

06/12/2020

Identifying Learning Rules From Neural Network Observables

Aran Nayebi, Sanjana Srivastava, Surya Ganguli, Daniel Yamins

Keywords Paper

0

0

0

0

3:12

03/05/2021

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Keyulu Xu, Mozhi Zhang, Jingling Li and
Simon Du, Ken-Ichi Kawarabayashi, Stefanie Jegelka

Keywords Paper

graph neural networks, out-of-distribution, deep learning, extrapolation, deep learning theory

0

0

0

1

17:06

06/12/2021

When Are Solutions Connected in Deep Networks?

Quynh Nguyen, Pierre Bréchet, Marco Mondelli

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:44

05/01/2021

MUSCLE: Strengthening Semi-Supervised Learning via Concurrent Unsupervised Learning Using Mutual Information Maximization

Hanchen Xie, Mohamed E. Hussein, Aram Galstyan, Wael Abd-Almageed

Keywords Paper

0

0

0

0

4:49

06/12/2021

A self consistent theory of Gaussian Processes captures feature learning effects in finite CNNs

Gadi Naveh, Zohar Ringel

Keywords Paper

theory, deep learning, optimization, kernel methods

0

0

0

0

9:13

06/12/2020

Over-parameterized Adversarial Training: An Analysis Overcoming the Curse of Dimensionality

Yi Zhang, Orestis Plevrakis, Simon Du and
Xingguo Li, Zhao Song, Sanjeev Arora

Keywords Paper

0

0

0

0

2:56

06/12/2021

Differentiable Spike: Rethinking Gradient-Descent for Training Spiking Neural Networks

Yuhang Li, Yufei Guo, Shanghang Zhang and
Shikuang Deng, Yongqing Hai, Shi Gu

Keywords Paper

deep learning, optimization, machine learning

0

0

0

0

6:19

26/04/2020

Effect of Activation Functions on the Training of Overparametrized Neural Nets

Abhishek Panigrahi, Abhishek Shetty, Navin Goyal

Keywords Paper

activation functions, deep learning theory, neural networks

0

0

0

0

5:13

04/07/2020

Research on Task Discovery for Transfer Learning in Deep Neural Networks

Arda Akdemir

Keywords Paper

Task Discovery, Transfer Learning, task selection, NLP tasks

0

0

0

0

13:41

03/05/2021

DDPNOpt: Differential Dynamic Programming Neural Optimizer

Guan-Horng Liu, Tianrong Chen, Evangelos Theodorou

Keywords Paper

differential dynamica programming, trajectory optimization, deep learning training, optimal control

0

0

0

0

10:02

06/12/2021

MixACM: Mixup-Based Robustness Transfer via Distillation of Activated Channel Maps

Awais Muhammad, Fengwei Zhou, Chuanlong Xie and
Jiawei Li, Sung-Ho Bae, Zhenguo Li

Keywords Paper

deep learning, optimization, robustness, adversarial robustness and security

0

0

0

0

12:51

02/02/2021

Nearest Neighbor Classifier Embedded Network for Active Learning

Fang Wan, Tianning Yuan, Mengying Fu and
Xiangyang Ji, Qingming Huang, Qixiang Ye

Keywords Paper

0

0

0

0

15:47

06/12/2020

Minimax Lower Bounds for Transfer Learning with Linear and One-hidden Layer Neural Networks

Mohammadreza Mousavi Kalan, Zalan Fabian, Salman Avestimehr, Mahdi Soltanolkotabi

Keywords Paper

0

0

0

0

3:16

06/12/2020

Learning Parities with Neural Networks

Amit Daniely, Eran Malach

Keywords Paper

0

0

0

0

3:21

06/12/2020

A meta-learning approach to (re)discover plasticity rules that carve a desired function into a neural network

Basile Confavreux, Friedemann Zenke, Everton Agnes and
Timothy Lillicrap, Tim Vogels

Keywords Paper

0

0

0

0

3:25

18/07/2021

Training Adversarially Robust Sparse Networks via Bayesian Connectivity Sampling

Ozan Özdenizci, Robert Legenstein

Keywords Paper

Algorithms, Adversarial Examples

0

0

0

1

6:27

06/12/2020

The Surprising Simplicity of the Early-Time Learning Dynamics of Neural Networks

Wei Hu, Lechao Xiao, Ben Adlam, Jeffrey Pennington

Keywords Paper

0

0

0

0

3:20

22/11/2021

FFNB: Forgetting-Free Neural Blocks for Deep Continual Learning

Hichem Sahbi, Haoming Zhan

Keywords Paper

Continual and incremental learning, lifelong learning, catastrophic interference, catastrophic forgetting, dynamic neural networks, visual recognition

0

0

0

0

3:05

18/07/2021

On the Explicit Role of Initialization on the Convergence and Implicit Bias of Overparametrized Linear Networks

Hancheng Min, Salma Tarmoun, Rene Vidal, Enrique Mallada

Keywords Paper

Theory

0

0

0

0

5:16

26/04/2020

Training Recurrent Neural Networks Online by Learning Explicit State Variables

Somjit Nath, Vincent Liu, Alan Chan and
Xin Li, Adam White, Martha White

Keywords Paper

Recurrent Neural Network, Partial Observability, Online Prediction, Incremental Learning

0

0

0

0

5:06

26/08/2020

Neural Decomposition: Functional ANOVA with Variational Autoencoders

Kaspar Märtens, Christopher Yau

Keywords Paper

0

0

0

0

14:25

05/01/2021

Group Softmax Loss With Discriminative Feature Grouping

Takumi Kobayashi

Keywords Paper

0

0

0

0

4:49

06/12/2021

Local plasticity rules can learn deep representations using self-supervised contrastive predictions

Bernd Illing, Jean Ventura, Guillaume Bellec, Wulfram Gerstner

Keywords Paper

deep learning, neuroscience, self-supervised learning

0

0

0

0

13:09

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

06/12/2021

Reverse engineering recurrent neural networks with Jacobian switching linear dynamical systems

Jimmy Smith, Scott Linderman, David Sussillo

Keywords Paper

deep learning, optimization, machine learning, neuroscience, interpretability

0

0

0

0

5:12

02/02/2021

Learning with Retrospection

Xiang Deng, Zhongfei Zhang

Keywords Paper

0

0

0

0

14:30

03/05/2021

Learning N:M Fine-grained Structured Sparse Neural Networks From Scratch

Aojun Zhou, Yukun Ma, Junnan Zhu and
Jianbo Liu, Zhijie Zhang, Kun Yuan, Wenxiu Sun, Hongsheng Li

Keywords Paper

sparsity, efficient training and inference.

0

0

0

0

5:09

02/02/2021

Training Spiking Neural Networks with Accumulated Spiking Flow

Hao Wu, Yueyi Zhang, Wenming Weng and
Yongting Zhang, Zhiwei Xiong, Zheng-Jun Zha, Xiaoyan Sun, Feng Wu

Keywords Paper

0

0

0

0

16:45

03/05/2021

Theoretical Analysis of Self-Training with Deep Networks on Unlabeled Data

Colin Wei, Kendrick Shen, Yining Chen, Tengyu Ma

Keywords Paper

deep learning theory, semi-supervised learning theory, unsupervised learning theory, domain adaptation theory

1

1

0

0

14:46

06/12/2021

Predify: Augmenting deep neural networks with brain-inspired predictive coding dynamics

Bhavin Choksi, Milad Mozafari, Callum Biggs O'May and
B. ADOR, Andrea Alamia, Rufin VanRullen

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience, vision

0

0

0

0

11:21

06/12/2020

An analytic theory of shallow networks dynamics for hinge loss classification

Franco Pellegrini, Giulio Biroli

Keywords Paper

, Deep Learning -> Optimization for Deep Networks

0

0

0

0

3:11

18/07/2021

Selfish Sparse RNN Training

Shiwei Liu, Decebal Constantin Mocanu, Yulong Pei, Mykola Pechenizkiy

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

1

4:58

12/07/2020

Deep Reinforcement Learning with Smooth Policy

Qianli Shen, Yan Li, Haoming Jiang and
Zhaoran Wang, Tuo Zhao

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

9:51

06/12/2020

Organizing recurrent network dynamics by task-computation to enable continual learning

Lea Duncker, Laura N Driscoll, Krishna V Shenoy and
Maneesh Sahani, David Sussillo

Keywords Paper

0

0

0

0

3:07

06/12/2021

On the Provable Generalization of Recurrent Neural Networks

Lifu Wang, Bo Shen, Bo Hu, Xing Cao

Keywords Paper

theory, deep learning

0

0

0

0

5:01