Statistically and Computationally Efficient Linear Meta-representation Learning

Abstract: In typical few-shot learning, each task is not equipped with enough data to be learned in isolation. To cope with such data scarcity, meta-representation learning methods train across many related tasks to find a shared (lower-dimensional) representation of the data where all tasks can be solved accurately. It is hypothesized that any new arriving tasks can be rapidly trained on this low-dimensional representation using only a few samples. Despite the practical successes of this approach, its statistical and computational properties are less understood. Moreover, the prescribed algorithms in these studies have little resemblance to those used in practice or they are computationally intractable. To understand and explain the success of popular meta-representation learning approaches such as ANIL, MetaOptNet, R2D2, and OML, we study a alternating gradient-descent minimization (AltMinGD) method (and its variant alternating minimization (AltMin)) which underlies the aforementioned methods. For a simple but canonical setting of shared linear representations, we show that AltMinGD achieves nearly-optimal estimation error, requiring only $\Omega(\mathrm{polylog}\,d)$ samples per task. This agrees with the observed efficacy of this algorithm in the practical few-shot learning scenarios.

13/04/2021

Statistically and Computationally Efficient Linear Meta-representation Learning

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Comments

Similar Papers

Exponential convergence rates of classification errors on learning with SGD and random features

Shingo Yashima, Atsushi Nitanda, Taiji Suzuki

Keywords Abstract Paper

On the generalization properties of adversarial training

Yue Xing, Qifan Song, Guang Cheng

Keywords Abstract Paper

Deciphering and Optimizing Multi-Task Learning: a Random Matrix Approach

Malik Tiomoko, Hafiz Tiomoko Ali, Romain Couillet

Keywords Abstract Paper

Transfer Learning, Random Matrix Theory, Multi Task Learning

Semi-supervised learning with meta-gradient

Taihong Xiao, Xin-Yu Zhang, Haolin Jia and Ming-Ming Cheng, Ming-Hsuan Yang

Keywords Abstract Paper

Meta-learning with negative learning rates

Alberto Bernacchia

Keywords Abstract Paper

Meta-learning

On the Acceleration of Deep Learning Model Parallelism With Staleness

An Xu, Zhouyuan Huo, Heng Huang

Keywords Abstract Paper

layer-wise staleness, asynchronous model parallelism, convolutional neural networks.

Meta-Learning without Memorization

Mingzhang Yin, George Tucker, Mingyuan Zhou and Sergey Levine, Chelsea Finn

Keywords Abstract Paper

meta-learning, memorization, regularization, overfitting, mutually-exclusive

PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided Exploration

Yuda Song, Wen Sun

Keywords Abstract Paper

Reinforcement Learning and Planning

Few-Shot Zero-Shot Learning: Knowledge Transfer with Less Supervision

Nanyi Fei, Jiechao Guan, Zhiwu Lu, Yizhao Gao

Keywords Abstract Paper

SAT-based Decision Tree Learning for Large Data Sets

Andre Schidler, Stefan Szeider

Keywords Abstract Paper

An Analysis of Constant Step Size SGD in the Non-convex Regime: Asymptotic Normality and Bias

Lu Yu, Krishnakumar Balasubramanian, Stanislav Volgushev, Murat Erdogdu

Keywords Abstract Paper

optimization, machine learning

Adversarially Robust Low Dimensional Representations

Pranjal Awasthi, Vaggos Chatziafratis, Xue Chen, Aravindan Vijayaraghavan

Keywords Abstract Paper

SGD: The Role of Implicit Regularization, Batch-size and Multiple-epochs

Ayush Sekhari, Karthik Sridharan, Satyen Kale

Keywords Abstract Paper

theory, deep learning, optimization

MetaNorm: Learning to Normalize Few-Shot Batches Across Domains

Yingjun Du, Xiantong Zhen, Ling Shao, Cees G Snoek

Keywords Abstract Paper

batch normalization, Meta-learning, few-shot domain generalization

Robust Meta-learning for Mixed Linear Regression with Small Batches

Weihao Kong, Raghav Somani, Sham Kakade, Sewoong Oh

Keywords Abstract Paper

The Implications of Local Correlation on Learning Some Deep Functions

Eran Malach, Shai Shalev-Shwartz

Keywords Abstract Paper

Uniform Sampling over Episode Difficulty

Sébastien Arnold, Guneet Dhillon, Avinash Ravichandran, Stefano Soatto

Keywords Abstract Paper

meta learning, few shot learning

Exploring the limits of few-shot link prediction in knowledge graphs

Dora Jambor, Komal Teru, Joelle Pineau, William L. Hamilton

Keywords Abstract Paper

PACOH: Bayes-Optimal Meta-Learning with PAC-Guarantees

Jonas Rothfuss, Vincent Fortuin, Martin Josifoski, Andreas Krause

Keywords Abstract Paper

Algorithms, Multitask, Transfer, and Meta Learning

Learning Parities with Neural Networks

Amit Daniely, Eran Malach

Keywords Abstract Paper

On the Convergence of FedAvg on Non-IID Data

Xiang Li, Kaixuan Huang, Wenhao Yang and Shusen Wang, Zhihua Zhang

Keywords Abstract Paper

Federated Learning, stochastic optimization, Federated Averaging

On sensitivity of meta-learning to support data

Mayank Agarwal, Mikhail Yurochkin, Yuekai Sun

Keywords Paper

Keywords Paper

Keywords Paper

Taihong Xiao, Xin-Yu Zhang, Haolin Jia and
Ming-Ming Cheng, Ming-Hsuan Yang

Keywords Paper

Keywords Paper

Keywords Paper

Mingzhang Yin, George Tucker, Mingyuan Zhou and
Sergey Levine, Chelsea Finn

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Xiang Li, Kaixuan Huang, Wenhao Yang and
Shusen Wang, Zhihua Zhang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and
Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Paper

Keywords Paper

Keywords Paper

Nan Ding, Xi Chen, Tomer Levinboim and
Sebastian Goodman, Radu Soricut

Keywords Paper

Jiawei Huang, Ruomin Huang, wenjie liu and
Nikolaos Freris, Hu Ding

Keywords Paper

Sai Praneeth Reddy Karimireddy, Satyen Kale, Mehryar Mohri and
Sashank Jakkam Reddi, Sebastian Stich, Ananda Theertha Suresh

Keywords Paper

Zhishuai Guo, Mingrui Liu, Zhuoning Yuan and
Li Shen, Wei Liu, Tianbao Yang

Keywords Paper

Yujun Yan, Kevin Swersky, Danai Koutra and
Parthasarathy Ranganathan, Milad Hashemi

Keywords Paper