Few-Shot Learning via Learning the Representation, Provably

03/05/2021

Few-Shot Learning via Learning the Representation, Provably

Simon Du, Wei Hu, Sham M Kakade, Jason Lee, Qi Lei

Keywords: statistical learning theory, representation learning

Abstract Paper Similar Papers

Abstract: This paper studies few-shot learning via representation learning, where one uses $T$ source tasks with $n_1$ data per task to learn a representation in order to reduce the sample complexity of a target task for which there is only $n_2 (\ll n_1)$ data. Specifically, we focus on the setting where there exists a good common representation between source and target, and our goal is to understand how much a sample size reduction is possible. First, we study the setting where this common representation is low-dimensional and provide a risk bound of $\tilde{O}(\frac{dk}{n_1T} + \frac{k}{n_2})$ on the target task for the linear representation class; here $d$ is the ambient input dimension and $k (\ll d)$ is the dimension of the representation. This result bypasses the $\Omega(\frac{1}{T})$ barrier under the i.i.d. task assumption, and can capture the desired property that all $n_1T$ samples from source tasks can be \emph{pooled} together for representation learning. We further extend this result to handle a general representation function class and obtain a similar result. Next, we consider the setting where the common representation may be high-dimensional but is capacity-constrained (say in norm); here, we again demonstrate the advantage of representation learning in both high-dimensional linear regression and neural networks, and show that representation learning can fully utilize all $n_1T$ samples from source tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

On the Theory of Transfer Learning: The Importance of Task Diversity

Nilesh Tripuraneni, Michael Jordan, Chi Jin

Keywords Paper

0

0

0

0

3:19

03/05/2021

Effective Distributed Learning with Random Features: Improved Bounds and Algorithms

Yong Liu, Jiankun Liu, Shuqiang Wang

Keywords Paper

statistical learning theory, kernel methods, Risk bound

0

0

0

0

4:25

13/04/2021

The sample complexity of meta sparse regression

Zhanyu Wang, Jean Honorio

Keywords Paper

0

0

0

0

2:57

13/04/2021

Contrastive learning of strong-mixing continuous-time stochastic processes

Bingbin Liu, Pradeep Ravikumar, Andrej Risteski

Keywords Paper

0

0

0

0

2:57

18/07/2021

Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations

Patrick Emami, Pan He, Sanjay Ranka, Anand Rangarajan

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:10

04/08/2021

Near Optimal Distributed Learning of Halfspaces with Two Parties

Mark Braverman, Gillat Kol, Shay Moran, Raghuvansh R. Saxena

Keywords Paper

0

0

0

0

16:43

12/07/2020

Training Neural Networks for and by Interpolation

Leonard Berrada, M. Pawan Kumar, Andrew Zisserman

Keywords Paper

Deep Learning - General

0

0

0

0

16:12

22/11/2021

Self-supervised Knowledge Distillation for Few-shot Learning

Jathushan Rajasegaran, Salman Khan, Munawar Hayat and
Fahad Shahbaz Khan, Mubarak Shah

Keywords Paper

Self-supervision, Knowledge Distillation, Few-shot Learning

0

0

0

0

2:49

06/12/2021

Towards Sample-efficient Overparameterized Meta-learning

Yue Sun, Adhyyan Narang, Ibrahim Gulluk and
Samet Oymak, Maryam Fazel

Keywords Paper

theory, machine learning, meta learning, representation learning, few shot learning

0

0

0

0

13:54

06/12/2021

Remember What You Want to Forget: Algorithms for Machine Unlearning

Ayush Sekhari, Jayadev Acharya, Gautam Kamath, Ananda Theertha Suresh

Keywords Paper

theory, privacy

0

0

0

0

10:50

14/06/2020

Few-Shot Learning via Embedding Adaptation With Set-to-Set Functions

Han-Jia Ye, Hexiang Hu, De-Chuan Zhan, Fei Sha

Keywords Paper

few-shot learning, meta-learning, embedding learning, embedding adaptation, set-to-set

0

0

0

0

1:04

30/11/2020

Regularizing Meta-Learning via Gradient Dropout

Hung-Yu Tseng, Yi-Wen Chen, Yi-Hsuan Tsai and
Sifei Liu, Yen-Yu Lin, Ming-Hsuan Yang

Keywords Paper

0

0

0

0

3:21

06/12/2020

Sample Efficient Reinforcement Learning via Low-Rank Matrix Estimation

Devavrat Shah, Dogyoon Song, Zhi Xu, Yuzhe Yang

Keywords Paper

0

0

0

0

3:22

02/02/2021

Task Cooperation for Semi-Supervised Few-Shot Learning

Han-Jia Ye, Xin-Chun Li, De-Chuan Zhan

Keywords Paper

0

0

0

0

16:06

02/02/2021

Learning Invariant Representations using Inverse Contrastive Loss

Aditya Kumar Akash, Vishnu Suresh Lokhande, Sathya N. Ravi, Vikas Singh

Keywords Paper

0

0

0

0

17:44

26/04/2020

Rapid Learning or Feature Reuse? Towards Understanding the Effectiveness of MAML

Aniruddh Raghu, Maithra Raghu, Samy Bengio, Oriol Vinyals

Keywords Paper

deep learning analysis, representation learning, meta-learning, few-shot learning

0

0

0

0

5:25

09/07/2020

Privately Learning Thresholds: Closing the Exponential Gap

Haim Kaplan, Katrina Ligett, Yishay Mansour and
Moni Naor, Uri Stemmer

Keywords Paper

Privacy, fairness, PAC learning

0

0

0

0

14:44

18/07/2021

A Distribution-dependent Analysis of Meta Learning

Mikhail Konobeev, Ilja Kuzborskij, Csaba Szepesvari

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:06

18/07/2021

Provable Meta-Learning of Linear Representations

Nilesh Tripuraneni, Chi Jin, Michael Jordan

Keywords Paper

Theory, Statistical Learning Theory

0

0

0

0

5:09

20/07/2020

Large deviations for the perceptron model and consequences for active learning

Hugo Cui, Luca Saglietti, Lenka Zdeborova

Keywords Paper

0

0

0

0

14:19

03/05/2021

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li, Pan Zhou, Caiming Xiong, Steven Hoi

Keywords Paper

self-supervised learning, unsupervised learning, representation learning, contrastive learning

0

0

0

0

4:51

03/05/2021

IEPT: Instance-Level and Episode-Level Pretext Tasks for Few-Shot Learning

Manli Zhang, Jianhong Zhang, Zhiwu Lu and
Tao Xiang, Mingyu Ding, Songfang Huang

Keywords Paper

self-supervised learning, few-shot learning, episode-level pretext task

0

0

0

0

5:03

18/07/2021

Training Data Subset Selection for Regression with Controlled Generalization Error

Durga S, Rishabh Iyer, Ganesh Ramakrishnan, Abir De

Keywords Paper

, Algorithms, Online Learning, Algorithms, Supervised Learning

0

0

0

0

4:15

06/12/2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Paper

0

0

0

0

3:34

14/06/2020

Improved Few-Shot Visual Classification

Peyman Bateni, Raghav Goyal, Vaden Masrani and
Frank Wood, Leonid Sigal

Keywords Paper

meta-learning, few-shot classification, transfer learning, mahalanobis metric, bergman divergences

0

0

0

0

1:01

06/12/2020

Functional Regularization for Representation Learning: A Unified Theoretical Perspective

Siddhant Garg, Yingyu Liang

Keywords Paper

0

0

0

0

3:19

18/07/2021

Learning from Similarity-Confidence Data

Yuzhou Cao, Lei Feng, Yitian Xu and
Bo An, Gang Niu, Masashi Sugiyama

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

4:05

06/12/2021

How Fine-Tuning Allows for Effective Meta-Learning

Kurtland Chua, Qi Lei, Jason Lee

Keywords Paper

theory, deep learning, optimization, meta learning, representation learning, few shot learning

0

0

0

0

10:40

06/12/2021

On the Power of Differentiable Learning versus PAC and SQ Learning

Emmanuel Abbe, Pritish Kamath, Eran Malach and
Colin Sandon, Nathan Srebro

Keywords Paper

theory, deep learning, optimization

0

0

0

0

14:57

06/12/2021

Towards Enabling Meta-Learning from Target Models

Su Lu, Han-Jia Ye, Le Gan, De-Chuan Zhan

Keywords Paper

meta learning, few shot learning

0

0

0

0

11:12

18/07/2021

Meta Learning for Support Recovery in High-dimensional Precision Matrix Estimation

Qian Zhang, Yilin Zheng, Jean Honorio

Keywords Paper

Algorithms, Meta-Learning, Algorithms, Few-Shot Learning; Algorithms, Multitask and Transfer Learning, Theory, Statistical Learning Theory

0

0

0

0

5:03

06/12/2020

Towards Understanding Hierarchical Learning: Benefits of Neural Representations

Minshuo Chen, Yu Bai, Jason Lee and
Tuo Zhao, Huan Wang, Caiming Xiong, Richard Socher

Keywords Paper

0

0

0

0

3:07

06/12/2021

Fast Axiomatic Attribution for Neural Networks

Robin Hesse, Simone Schaub-Meyer, Stefan Roth

Keywords Paper

deep learning, interpretability

0

0

0

0

14:49

18/07/2021

A Representation Learning Perspective on the Importance of Train-Validation Splitting in Meta-Learning

Nikunj Saunshi, Arushi Gupta, Wei Hu

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

5:20

06/12/2021

Simple Stochastic and Online Gradient Descent Algorithms for Pairwise Learning

ZHENHUAN YANG, Yunwen Lei, Puyu Wang and
Tianbao Yang, Yiming Ying

Keywords Paper

optimization, machine learning, privacy

0

0

0

0

14:40

06/12/2021

On learning sparse vectors from mixture of responses

Nikita Polyanskii

Keywords Paper

generative model

0

0

0

0

10:55

18/07/2021

Toward Understanding the Feature Learning Process of Self-supervised Contrastive Learning

Zixin Wen, Yuanzhi Li

Keywords Paper

Theory, Deep learning Theory

0

0

0

0

5:48

06/12/2020

Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

Massimiliano Patacchiola, Jack Turner, Elliot Crowley and
Michael O'Boyle, Amos Storkey

Keywords Paper

Deep Learning; Deep Learning -> CNN Architectures; Theory -> Spaces of Functions and Kernels, Theory

0

0

0

0

3:11

22/11/2021

Few-shot Action Recognition with Prototype-centered Attentive Learning

Xiatian Zhu, Antoine S Toisoul, Juan-Manuel Perez-Rua and
Li Zhang, Brais Martinez, Tao Xiang

Keywords Paper

Few-shot learning, Video recognition, Action classification, Small training data, Model pre-training, Meta-learning, Transformer, Self-attention learning, Cross-attention learning, Prototype learning, Prototype-centered learning, Hybrid-attention learning

0

0

0

0

2:22

12/07/2020

Supervised learning: no loss no cry

Richard Nock, Aditya Menon

Keywords Paper

Learning Theory

0

0

0

0

15:18