On Plasticity, Invariance, and Mutually Frozen Weights in Sequential Task Learning

Abstract: Plastic neural networks have the ability to adapt to new tasks. However, in a continual learning setting, the configuration of parameters learned in previous tasks can severely reduce the adaptability to future tasks. In particular, we show that, when using weight decay, weights in successive layers of a deep network may become "mutually frozen". This has a double effect: on the one hand, it makes the network updates more invariant to nuisance factors, providing a useful bias for future tasks. On the other hand, it can prevent the network from learning new tasks that require significantly different features. In this context, we find that the local input sensitivity of a deep model is correlated with its ability to adapt, thus leading to an intriguing trade-off between adaptability and invariance when training a deep model more than once. We then show that a simple intervention that "resets" the mutually frozen connections can improve transfer learning on a variety of visual classification tasks. The efficacy of "resetting" itself depends on the size of the target dataset and the difference of the pre-training and target domains, allowing us to achieve state-of-the-art results on some datasets.

06/12/2021

On Plasticity, Invariance, and Mutually Frozen Weights in Sequential Task Learning

Julian Zilly, Alessandro Achille, Andrea Censi, Emilio Frazzoli

Comments

Similar Papers

CAM-GAN: Continual Adaptation Modules for Generative Adversarial Networks

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and Lawrence Carin, Piyush Rai

Keywords Abstract Paper

generative model, representation learning, continual learning

Conditionally Adaptive Multi-Task Learning: Improving Transfer Learning in NLP Using Fewer Parameters & Less Data

Jonathan Pilault, Amine EL hattami, Chris J Pal

Keywords Abstract Paper

Natural Language Processing, Transfer Learning, Adaptive Learning, Multi-Task Learning

LoCo: Local Contrastive Representation Learning

Yuwen Xiong, Mengye Ren, Raquel Urtasun

Keywords Abstract Paper

Transferring Pretrained Networks to Small Data via Category Decorrelation

Ying Jin, Zhangjie Cao, Mingsheng Long, Jianmin Wang

Keywords Abstract Paper

Category Decorrelation, Under Transfer

Learning Attentive Meta-Transfer

Jaesik Yoon, Gautam Singh, Sungjin Ahn

Keywords Abstract Paper

Sequential, Network, and Time-Series Modeling

Uncertainty-guided Continual Learning with Bayesian Neural Networks

Sayna Ebrahimi, Mohamed Elhoseiny, Trevor Darrell, Marcus Rohrbach

Keywords Abstract Paper

continual learning, catastrophic forgetting

Early Stopping in Deep Networks: Double Descent and How to Eliminate it

Reinhard Heckel, Fatih Furkan Yilmaz

Keywords Abstract Paper

early stopping, double descent

Conditional Channel Gated Networks for Task-Aware Continual Learning

Davide Abati, Jakub Tomczak, Tijmen Blankevoort and Simone Calderara, Rita Cucchiara, Babak Ehteshami Bejnordi

Keywords Abstract Paper

continual learning, channel gating, conditional computation, incremental learning, lifelong learning, hard attention

Towards Biologically Plausible Convolutional Networks

Roman Pogodin, Yash Mehta, Timothy Lillicrap, Peter E Latham

Keywords Abstract Paper

deep learning

Understanding the Role of Training Regimes in Continual Learning

Seyed Iman Mirzadeh, Mehrdad Farajtabar, Razvan Pascanu, Hassan Ghasemzadeh

Keywords Abstract Paper

MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training

Beidi Chen, Zichang Liu, Binghui Peng and Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re

Keywords Abstract Paper

Randomized Algorithms, Efficient Training, Large-scale Machine Learning, Large-scale Deep Learning

Tailoring: encoding inductive biases by optimizing unsupervised objectives at prediction time

Ferran Alet, Maria Bauza, Kenji Kawaguchi and Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Abstract Paper

deep learning, optimization, machine learning, self-supervised learning, meta learning

Decoupled Greedy Learning of CNNs

Eugene Belilovsky, Michael Eickenberg, Edouard Oyallon

Keywords Abstract Paper

Optimization - Large Scale, Parallel and Distributed

Domain Knowledge Empowered Structured Neural Net for End-to-End Event Temporal Relation Extraction

Rujun Han, Yichao Zhou, Nanyun Peng

Keywords Abstract Paper

extracting relations, information extraction, natural understanding, maximum inference

When MAML can adapt fast and how to assist when it cannot

Sébastien M. R. Arnold, Shariq Iqbal, Fei Sha

Keywords Abstract Paper

Learning to Forget for Meta-Learning

Sungyong Baik, Seokil Hong, Kyoung Mu Lee

Keywords Abstract Paper

meta learning, few-shot learning, reinforcement learning

A Discriminative Gaussian Mixture Model with Sparsity

Hideaki Hayashi, Seiichi Uchida

Keywords Abstract Paper

classification, Gaussian mixture model, sparse Bayesian learning

Go with the flow: Adaptive control for Neural ODEs

Mathieu Chalvidal, Matthew Ricci, Rufin VanRullen, Thomas Serre

Keywords Abstract Paper

Neural ODEs, Normalizing flows, Hypernetworks, Optimal Control Theory

Biological credit assignment through dynamic inversion of feedforward networks

William Podlaski, Christian K. Machens

Keywords Abstract Paper

Closed-Loop Matters: Dual Regression Networks for Single Image Super-Resolution

Yong Guo, Jian Chen, Jingdong Wang and Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Mingkui Tan

Keywords Abstract Paper

computer vision, image super-resolution, dual regression scheme, closed-loop

Sakshi Varshney, Vinay Kumar Verma, P. K. Srijith and
Lawrence Carin, Piyush Rai

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Davide Abati, Jakub Tomczak, Tijmen Blankevoort and
Simone Calderara, Rita Cucchiara, Babak Ehteshami Bejnordi

Keywords Paper

Keywords Paper

Keywords Paper

Beidi Chen, Zichang Liu, Binghui Peng and
Zhaozhuo Xu, Jonathan L Li, Tri Dao, Zhao Song, Anshumali Shrivastava, Christopher Re

Keywords Paper

Ferran Alet, Maria Bauza, Kenji Kawaguchi and
Nurullah Giray Kuru, Tomás Lozano-Pérez, Leslie Kaelbling

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yong Guo, Jian Chen, Jingdong Wang and
Qi Chen, Jiezhang Cao, Zeshuai Deng, Yanwu Xu, Mingkui Tan

Keywords Paper

Francisco Utrera, Evan Kravitz, N. Benjamin Erichson and
Rajiv Khanna, Michael W Mahoney

Keywords Paper

Dawei Gao, Xiaoxi He, Zimu Zhou and
Yongxin Tong, Ke Xu, Lothar Thiele

Keywords Paper

Keywords Paper

Feng Wang, Guoyizhe Wei, Qiao Liu and
Jinxiang Ou, xian wei, Hairong Lv

Keywords Paper

Itay Hubara, Brian Chmiel, Moshe Island and
Ron Banner, Joseph Naor, Daniel Soudry

Keywords Paper

Keywords Paper

Keywords Paper

Byeongho Heo, Sanghyuk Chun, Seong Joon Oh and
Dongyoon Han, Sangdoo Yun, Gyuwan Kim, Youngjung Uh, Jung-Woo Ha

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yujia Huang, Huan Zhang, Yuanyuan Shi and
J. Zico Kolter, Anima Anandkumar

Keywords Paper

Karthik Abinav Sankararaman, Soham De, Zheng Xu and
W. Ronny Huang, Tom Goldstein

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper