Scalable Transfer Learning with Expert Models

Abstract: Transfer of pre-trained representations can improve sample efficiency and reduce computational requirements for new tasks. However, representations used for transfer are usually generic, and are not tailored to a particular distribution of downstream tasks. We explore the use of expert representations for transfer with a simple, yet effective, strategy. We train a diverse set of experts by exploiting existing label structures, and use cheap-to-compute performance proxies to select the relevant expert for each target task. This strategy scales the process of transferring to new tasks, since it does not revisit the pre-training data during transfer. Accordingly, it requires little extra compute per target task, and results in a speed-up of 2-3 orders of magnitude compared to competing approaches. Further, we provide an adapter-based architecture able to compress many experts into a single model. We evaluate our approach on two different data sources and demonstrate that it outperforms baselines on over 20 diverse vision tasks in both cases.

06/12/2021

neural architecture search, supernet, one-shot nas, single path, greedy algorithm, exploration and exploitation, searching efficiency

1:01

06/12/2020

Scalable Transfer Learning with Expert Models

Joan Puigcerver Puigcerver i Perez, Carlos Riquelme, Basil Mustafa, Cedric Renggli, André Susano Pinto, Sylvain Gelly, Daniel Keysers, Neil Houlsby

Comments

Similar Papers

Adversarial Robustness without Adversarial Training: A Teacher-Guided Curriculum Learning Approach

Anindya Sarkar, Anirban Sarkar, Sowrya Gali, Vineeth N Balasubramanian

Keywords Abstract Paper

robustness, adversarial robustness and security

Few-Cost Salient Object Detection with Adversarial-Paced Learning

Dingwen Zhang, HaiBin Tian, Jungong Han

Keywords Abstract Paper

Active Imitation Learning with Noisy Guidance

Kianté Brantley, Hal Daumé III, Amr Sharaf

Keywords Abstract Paper

Active Learning, structured tasks, sequence tasks, Imitation algorithms

Transformer Based Multi-Source Domain Adaptation

Dustin Wright, Isabelle Augenstein

Keywords Abstract Paper

unsupervised adaptation, cnns, rnns, domain classifiers

Scalable Neural Data Server: A Data Recommender for Transfer Learning

Tianshi Cao, Sasha (Alexandre) Doubov, David Acuna, Sanja Fidler

Keywords Abstract Paper

machine learning, vision, transfer learning

Once-for-All Adversarial Training: In-Situ Tradeoff between Robustness and Accuracy for Free

Haotao Wang, Tianlong Chen, Shupeng Gui and TingKuei Hu, Ji Liu, Zhangyang Wang

Keywords Abstract Paper

Co-Tuning for Transfer Learning

Kaichao You, Zhi Kou, Mingsheng Long, Jianmin Wang

Keywords Abstract Paper

GreedyNAS: Towards Fast One-Shot NAS With Greedy Supernet

Shan You, Tao Huang, Mingmin Yang and Fei Wang, Chen Qian, Changshui Zhang

Keywords Abstract Paper

neural architecture search, supernet, one-shot nas, single path, greedy algorithm, exploration and exploitation, searching efficiency

Modular Meta-Learning with Shrinkage

Yutian Chen, Abe Friesen, Feryal Behbahani and Arnaud Doucet, David Budden, Matthew Hoffman, Nando de Freitas

Keywords Abstract Paper

Opening the Blackbox: Accelerating Neural Differential Equations by Regularizing Internal Solver Heuristics

Avik Pal, Yingbo Ma, Viral Shah, Christopher Rackauckas

Keywords Abstract Paper

Deep Learning

Dense Unsupervised Learning for Video Segmentation

Nikita Araslanov, Simone Schaub-Meyer, Stefan Roth

Keywords Abstract Paper

self-supervised learning, representation learning

Few-Shot NLG with Pre-Trained Language Model

Zhiyu Chen, Harini Eavani, Wenhu Chen and Yinyin Liu, William Yang Wang

Keywords Abstract Paper

natural generation, NLG, real-world applications, content selection

Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online

Yangchen Pan, Kirby Banman, Martha White

Keywords Abstract Paper

natural sparsity, Reinforcement learning, fuzzy tiling activation function, sparse representation

Top-KAST: Top-K Always Sparse Training

Sid Jayakumar, Razvan Pascanu, Jack Rae and Simon Osindero, Erich Elsen

Keywords Abstract Paper

Zero-Resource Cross-Domain Named Entity Recognition

Zihan Liu, Genta Indra Winata, Pascale Fung

Keywords Abstract Paper

One-Shot Deep Model for End-to-End Multi-Person Activity Recognition

Shuhei Tarashima

Keywords Abstract Paper

Group Activity Recognition, Action Recognition, Multi-Object Tracking, Multi-task Learning

Ensemble Distillation for Robust Model Fusion in Federated Learning

Tao Lin, Lingjing Kong, Sebastian Stich, Martin Jaggi

Keywords Abstract Paper

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Abstract Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

MetaPerturb: Transferable Regularizer for Heterogeneous Tasks and Architectures

Jeong Un Ryu, JWoong Shin, Hae Beom Lee, Sung Ju Hwang

Keywords Abstract Paper

CompOFA – Compound Once-For-All Networks for Faster Multi-Platform Deployment

Manas Sahni, Shreya Varshini, Alind Khare, Alexey Tumanov

Keywords Abstract Paper

AutoML, Latency-aware Neural Architecture Search, Efficient Deep Learning

Shape your Space: A Gaussian Mixture Regularization Approach to Deterministic Autoencoders

Amrutha Saseendran, Kathrin Skubch, Stefan Falkner, Margret Keuper

Keywords Abstract Paper

generative model

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Haotao Wang, Tianlong Chen, Shupeng Gui and
TingKuei Hu, Ji Liu, Zhangyang Wang

Keywords Paper

Keywords Paper

Shan You, Tao Huang, Mingmin Yang and
Fei Wang, Chen Qian, Changshui Zhang

Keywords Paper

Yutian Chen, Abe Friesen, Feryal Behbahani and
Arnaud Doucet, David Budden, Matthew Hoffman, Nando de Freitas

Keywords Paper

Keywords Paper

Keywords Paper

Zhiyu Chen, Harini Eavani, Wenhu Chen and
Yinyin Liu, William Yang Wang

Keywords Paper

Keywords Paper

Sid Jayakumar, Razvan Pascanu, Jack Rae and
Simon Osindero, Erich Elsen

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Zibo Lin, Deng Cai, Yan Wang and
Xiaojiang Liu, Haitao Zheng, Shuming Shi

Keywords Paper

Keywords Paper

Keywords Paper

Roberta Raileanu, Maxwell Goldstein, Denis Yarats and
Ilya Kostrikov, Rob Fergus

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kafeng Wang, Xitong Gao, Yiren Zhao and
Xingjian Li, Dejing Dou, Cheng-Zhong Xu

Keywords Paper

Runtian Zhai, Chen Dan, Di He and
Huan Zhang, Boqing Gong, Pradeep Ravikumar, Cho-Jui Hsieh, Liwei Wang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Mitchell Wortsman, Vivek Ramanujan, Rosanne Liu and
Ani Kembhavi, Mohammad Rastegari, Jason Yosinski, Ali Farhadi

Keywords Paper

Danyang Wu, Jin Xu, Xia Dong and
Meng Liao, Rong Wang, Feiping Nie, Xuelong Li

Keywords Paper