What is being transferred in transfer learning?

06/12/2020

What is being transferred in transfer learning?

Behnam Neyshabur, Hanie Sedghi, Chiyuan Zhang

Keywords:

Abstract Paper Similar Papers

Abstract: One desired capability for machines is the ability to transfer their understanding of one domain to another domain where data is (usually) scarce. Despite ample adaptation of transfer learning in many deep learning applications, we yet do not understand what enables a successful transfer and which part of the network is responsible for that. In this paper, we provide new tools and analysis to address these fundamental questions. Through a series of analysis on transferring to block-shuffled images, we separate the effect of feature reuse from learning high-level statistics of data and show that some benefit of transfer learning comes from the latter. We present that when training from pre-trained weights, the model stays in the same basin in the loss landscape and different instances of such model are similar in feature space and close in parameter space.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

A Combinatorial Perspective on Transfer Learning

Jianan Wang, Eren Sezener, David Budden and
Marcus Hutter, Joel Veness

Keywords Paper

0

0

0

0

3:22

19/08/2021

Regularising Knowledge Transfer by Meta Functional Learning

Pan Li, Yanwei Fu, Shaogang Gong

Keywords Paper

Machine Learning, Classification, Transfer, Adaptation, Multi-task Learning, Weakly Supervised Learning

0

0

0

0

13:41

14/06/2020

Multi-Mutual Consistency Induced Transfer Subspace Learning for Human Motion Segmentation

Tao Zhou, Huazhu Fu, Chen Gong and
Jianbing Shen, Ling Shao, Fatih Porikli

Keywords Paper

human motion segmentation, transfer subspace learning, multi-level features, multi-mutual consistency learning.

0

0

0

0

1:00

04/07/2020

Research on Task Discovery for Transfer Learning in Deep Neural Networks

Arda Akdemir

Keywords Paper

Task Discovery, Transfer Learning, task selection, NLP tasks

0

0

0

0

13:41

26/04/2020

Continual Learning with Adaptive Weights (CLAW)

Tameem Adel, Han Zhao, Richard E. Turner

Keywords Paper

Continual learning

0

0

0

0

4:58

16/11/2020

Self-Paced Learning for Neural Machine Translation

Yu Wan, Baosong Yang, Derek F. Wong and
Yikai Zhou, Lidia S. Chao, Haibo Zhang, Boxing Chen

Keywords Paper

neural, curriculum learning, translation tasks, nmt

0

0

0

0

6:03

06/12/2020

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Massimiliano Patacchiola, Jack Turner, Elliot Crowley and
Michael O'Boyle, Amos Storkey

Keywords Paper

Deep Learning; Deep Learning -> CNN Architectures; Theory -> Spaces of Functions and Kernels, Theory

0

0

0

0

3:11

06/12/2020

Organizing recurrent network dynamics by task-computation to enable continual learning

Lea Duncker, Laura N Driscoll, Krishna V Shenoy and
Maneesh Sahani, David Sussillo

Keywords Paper

0

0

0

0

3:07

03/05/2021

Conditional Generative Modeling via Learning the Latent Space

Sameera Ramasinghe, Kanchana Ranasinghe, Salman Khan and
Nick Barnes, Stephen Gould

Keywords Paper

Generative Modeling, Conditional Generation, Multimodal Spaces

0

0

0

0

4:57

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

12/07/2020

Unsupervised Transfer Learning for Spatiotemporal Predictive Networks

Zhiyu Yao, Yunbo Wang, Mingsheng Long, Jianmin Wang

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

15:19

06/12/2020

Probabilistic Active Meta-Learning

Jean Kaddour, Steindor Saemundsson, Marc Deisenroth

Keywords Paper

0

0

0

0

3:17

12/07/2020

Multigrid Neural Memory

Tri Huynh, Michael Maire, Matthew Walter

Keywords Paper

Deep Learning - General

0

0

0

0

13:47

06/12/2020

Learning Invariants through Soft Unification

Nuri Cingillioglu, Alessandra Russo

Keywords Paper

Algorithms -> Density Estimation; Algorithms -> Unsupervised Learning; Deep Learning, Deep Learning -> Generative Models

0

0

0

0

3:23

26/04/2020

A Neural Dirichlet Process Mixture Model for Task-Free Continual Learning

Soochan Lee, Junsoo Ha, Dongsu Zhang, Gunhee Kim

Keywords Paper

continual learning, task-free, task-agnostic

0

0

0

0

5:08

06/12/2020

Network-to-Network Translation with Conditional Invertible Neural Networks

Robin Rombach, Patrick Esser, Bjorn Ommer

Keywords Paper

0

0

0

0

3:25

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2020

Ensemble Distillation for Robust Model Fusion in Federated Learning

Tao Lin, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Paper

0

0

0

0

2:59

03/05/2021

Dataset Meta-Learning from Kernel Ridge-Regression

Timothy Nguyen, Zhourong Chen, Jaehoon Lee

Keywords Paper

dataset corruption, infinite-width networks, neural kernels, kernel-ridge regression, dataset compression, dataset distillation, meta-learning

0

0

0

0

4:59

03/05/2021

Generalized Multimodal ELBO

Thomas Sutter, Imant Daunhawer, Julia E Vogt

Keywords Paper

self-supervised, generative learning, ELBO, VAE, Multimodal

0

0

0

0

5:15

06/12/2020

Task-Agnostic Online Reinforcement Learning with an Infinite Mixture of Gaussian Processes

Mengdi Xu, Wenhao Ding, Jiacheng Zhu and
ZUXIN LIU, Baiming Chen, Ding Zhao

Keywords Paper

0

0

0

0

3:21

03/05/2021

Meta-Learning with Neural Tangent Kernels

Yufan Zhou, Zhenyi Wang, Jiayi Xian and
Changyou Chen, Jinhui Xu

Keywords Paper

neural tangent kernel, meta-learning

0

0

0

0

3:54

03/05/2021

Few-Shot Bayesian Optimization with Deep Kernel Surrogates

Martin Wistuba, Josif Grabocka

Keywords Paper

automl, bayesian optimization, metalearning, few-shot learning

0

0

0

0

5:18

06/12/2021

Revisit Multimodal Meta-Learning through the Lens of Multi-Task Learning

Milad Abdollahzadeh, Touba Malekzadeh, Ngai-Man (Man) Cheung

Keywords Paper

meta learning, few shot learning

0

0

0

0

13:08

18/07/2021

Bayesian Structural Adaptation for Continual Learning

Abhishek Kumar, Sunabha Chatterjee, Piyush Rai

Keywords Paper

Probabilistic Methods, Bayesian Methods

0

0

0

0

7:39

08/12/2020

Dynamic Curriculum Learning for Low-Resource Neural Machine Translation

Chen Xu, Bojie Hu, Yufan Jiang and
Kai Feng, Zeyang Wang, Shen Huang, Qi Ju, Tong Xiao, Jingbo Zhu

Keywords Paper

0

0

0

0

13:28

03/05/2021

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh, Huihan Liu, Gaoyue Zhou and
Albert Yu, Nicholas Rhinehart, Sergey Levine

Keywords Paper

reinforcement learning, imitation learning

0

0

0

0

14:21

19/04/2021

Keep learning: Self-supervised meta-learning for learning from inference

Akhil Kedia, Sai Chetan Chinthakindi

Keywords Paper

0

0

0

0

11:27

06/12/2020

Counterexample-Guided Learning of Monotonic Neural Networks

Aishwarya Sivaraman, Golnoosh Farnadi, Todd Millstein, Guy Van den Broeck

Keywords Paper

0

0

0

0

3:22

16/11/2020

Transformer Based Multi-Source Domain Adaptation

Dustin Wright, Isabelle Augenstein

Keywords Paper

unsupervised adaptation, cnns, rnns, domain classifiers

0

0

0

0

11:30

12/07/2020

Hierarchically Decoupled Morphological Transfer

Donald Hejna, Lerrel Pinto, Pieter Abbeel

Keywords Paper

Reinforcement Learning - Deep RL

0

0

0

0

15:14

02/02/2021

LRSC: Learning Representations for Subspace Clustering

Changsheng Li, Chen Yang, Bo Liu and
Ye Yuan, Guoren Wang

Keywords Paper

0

0

0

0

15:09

26/04/2020

Domain Adaptive Multibranch Networks

Róger Bermúdez-Chacón, Mathieu Salzmann, Pascal Fua

Keywords Paper

Domain Adaptation, Computer Vision

0

0

0

0

5:26

06/12/2020

Multi-Task Reinforcement Learning with Soft Modularization

Ruihan Yang, Huazhe Xu, YI WU, Xiaolong Wang

Keywords Paper

0

0

0

0

3:18

16/11/2020

Accelerating Reinforcement Learning with Learned Skill Priors

Karl Pertsch, Youngwoon Lee, Joseph Lim

Keywords Paper

0

0

0

0

5:12

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

18/07/2021

Provable Meta-Learning of Linear Representations

Nilesh Tripuraneni, Chi Jin, Michael Jordan

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:09

06/12/2020

Certified Monotonic Neural Networks

Xingchao Liu, Aaron Han, Na Zhang, Qiang Liu

Keywords Paper

0

0

0

0

3:22

26/04/2020

Federated Adversarial Domain Adaptation

Xingchao Peng, Zijun Huang, Yizhe Zhu, Kate Saenko

Keywords Paper

Federated Learning, Domain Adaptation, Transfer Learning, Feature Disentanglement

0

0

0

2

4:57

03/05/2021

Contextual Transformation Networks for Online Continual Learning

Quang Pham, Chenghao Liu, Doyen Sahoo, Steven HOI

Keywords Paper

Continual Learning

0

0

0

0

4:48