TaskNorm: Rethinking Batch Normalization for Meta-Learning

12/07/2020

TaskNorm: Rethinking Batch Normalization for Meta-Learning

John Bronskill, Jonathan Gordon, James Requeima, Sebastian Nowozin, Richard Turner

Keywords: Transfer, Multitask and Meta-learning

Abstract Paper Similar Papers

Abstract: Modern meta-learning approaches for image classification rely on increasingly deep networks to achieve state-of-the-art performance, making batch normalization an essential component of meta-learning pipelines. However, the hierarchical nature of the meta-learning setting presents several challenges that can render conventional batch normalization ineffective, giving rise to the need to rethink normalization in this setting. We evaluate a range of approaches to batch normalization for meta-learning scenarios, and develop a novel approach that we call TaskNorm. Experiments on fourteen datasets demonstrate that the choice of batch normalization has a dramatic effect on both classification accuracy and training time for both gradient based- and gradient-free meta-learning approaches. Importantly, TaskNorm is found to consistently improve performance. Finally, we provide a set of best practices for normalization that will allow fair comparison of meta-learning algorithms.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Analyzing and Improving the Image Quality of StyleGAN

Tero Karras, Samuli Laine, Miika Aittala and
Janne Hellsten, Jaakko Lehtinen, Timo Aila

Keywords Paper

generative modeling, image synthesis, representation learning

0

0

0

0

1:01

06/12/2021

Beyond BatchNorm: Towards a Unified Understanding of Normalization in Deep Learning

Ekdeep S Lubana, Robert Dick, Hidenori Tanaka

Keywords Paper

deep learning

0

0

0

0

8:28

02/02/2021

DeepCollaboration: Collaborative Generative and Discriminative Models for Class Incremental Learning

Bo Cui, Guyue Hu, Shan Yu

Keywords Paper

0

0

0

0

15:13

06/12/2020

GradAug: A New Regularization Method for Deep Neural Networks

Taojiannan Yang, Sijie Zhu, Chen Chen

Keywords Paper

0

0

0

0

3:18

06/12/2020

Do Adversarially Robust ImageNet Models Transfer Better?

Hadi Salman, Andrew Ilyas, Logan Engstrom and
Ashish Kapoor, Aleksander Madry

Keywords Paper

0

0

0

0

4:16

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

14/06/2020

Exemplar Normalization for Learning Deep Representation

Ruimao Zhang, Zhanglin Peng, Lingyun Wu and
Zhen Li, Ping Luo

Keywords Paper

normalization, learning to normalize, sample-adaptive, deep learning, image classification, semantic segmentation

0

0

0

0

1:00

19/10/2020

Dimension relation modeling for click-through rate prediction

Zihao Zhao, Zhiwei Fang, Yong Li and
Changping Peng, Yongjun Bao, Weipeng Yan

Keywords Paper

recommendation, deep learning, neural networks

0

0

0

0

6:18

05/01/2021

Learning Fast Converging, Effective Conditional Generative Adversarial Networks With a Mirrored Auxiliary Classifier

Zi Wang

Keywords Paper

0

0

0

0

4:59

26/04/2020

Progressive learning and disentanglement of hierarchical representations

Zhiyuan Li, Jaideep Vitthal Murkute, Prashnna Kumar Gyawali, Linwei Wang

Keywords Paper

generative model, disentanglement, progressive learning, VAE

0

0

0

0

5:06

22/11/2021

Mini-batch Similarity Graphs for Robust Image Classification

Arnab Kumar Mondal, Vineet Jain, Kaleem Siddiqi

Keywords Paper

Minibatch Graph, Graph Neural Network, Robustness, Image Classification, Adversarial Attacks, GAN

0

0

0

0

3:06

02/02/2021

Joint-Label Learning by Dual Augmentation for Time Series Classification

Qianli Ma, Zhenjing Zheng, Jiawei Zheng and
Sen Li, Wanqing Zhuang, Garrison W. Cottrell

Keywords Paper

0

0

0

0

15:59

18/07/2021

Improving Generalization in Meta-learning via Task Augmentation

Huaxiu Yao, Long-Kai Huang, Linjun Zhang and
Ying WEI, Li Tian, James Zou, Junzhou Huang, Zhenhui (Jessie) Li

Keywords Paper

Algorithms, Multitask, Transfer, and Meta Learning

0

0

0

0

8:27

03/05/2021

Learning Task Decomposition with Ordered Memory Policy Network

Yuchen Lu, Yikang Shen, Siyuan Zhou and
Aaron Courville, Joshua B Tenenbaum, Chuang Gan

Keywords Paper

Task Segmentation, Network Inductive Bias, Hierarchical Imitation Learning

0

0

0

0

4:57

02/02/2021

Generate Your Counterfactuals: Towards Controlled Counterfactual Generation for Text

Nishtha Madaan, Inkit Padhi, Naveen Panwar, Diptikalyan Saha

Keywords Paper

0

0

0

0

20:15

14/06/2020

Focus on Defocus: Bridging the Synthetic to Real Domain Gap for Depth Estimation

Maxim Maximov, Kevin Galim, Laura Leal-Taixé

Keywords Paper

depth estimation, generalisation, depth from focus, blur estimation, depth

0

0

0

0

1:01

02/02/2021

IsoBN: Fine-Tuning BERT with Isotropic Batch Normalization

Wenxuan Zhou, Bill Yuchen Lin, Xiang Ren

Keywords Paper

0

0

0

0

16:25

22/11/2021

Subpixel Heatmap Regression for Facial Landmark Localization

Adrian Bulat, Enrique Sanchez, Georgios Tzimiropoulos

Keywords Paper

face alignment, landmarks estimation, face tracking

0

0

0

0

2:23

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi and
Yunjae Jung, In So Kweon

Keywords Paper

Visual Question Answering, Vision and Language, Active Learning

0

0

0

0

2:42

14/06/2020

MaskGAN: Towards Diverse and Interactive Facial Image Manipulation

Cheng-Han Lee, Ziwei Liu, Lingyun Wu, Ping Luo

Keywords Paper

facial image manipulation, face segmentation, image synthesis, generative adversarial network

0

0

0

0

1:00

03/05/2021

Improving Transformation Invariance in Contrastive Representation Learning

Adam Foster, Rattana Pukdee, Tom Rainforth

Keywords Paper

transformation invariance, contrastive learning, representation learning

0

0

0

0

5:23

22/11/2021

MAGECally invert images for realistic editing

Asya Grechka, jean Francois Goudou, Matthieu Cord

Keywords Paper

gan inversion, gan, stylegan2, gan editing, image editing, gan projection, stylegan, semantic editing, latent space manipulation, latent editing

0

0

0

0

3:01

05/01/2021

Continual Representation Learning for Biometric Identification

Bo Zhao, Shixiang Tang, Dapeng Chen and
Hakan Bilen, Rui Zhao

Keywords Paper

0

0

0

0

4:36

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

03/05/2021

Generalized Variational Continual Learning

Noel Loo, Siddharth Swaroop, Rich E Turner

Keywords Paper

0

0

0

0

5:30

06/12/2021

Flexible Option Learning

Martin Klissarov, Doina Precup

Keywords Paper

reinforcement learning and planning

1

0

0

0

15:47

18/07/2021

ActNN: Reducing Training Memory Footprint via 2-Bit Activation Compressed Training

Jianfei Chen, Lianmin Zheng, Zhewei Yao and
Dequan Wang, Ion Stoica, Michael Mahoney, Joseph E Gonzalez

Keywords Paper

Algorithms, Large Scale Learning

0

0

0

0

18:54

30/11/2020

Restoring Spatially-Heterogeneous Distortions using Mixture of Experts Network

Sijin Kim, Namhyuk Ahn, Kyung-Ah Sohn

Keywords Paper

0

0

0

0

8:01

02/02/2021

Harmonized Dense Knowledge Distillation Training for Multi-Exit Architectures

Xinglu Wang, Yingming Li

Keywords Paper

0

0

0

0

15:12

06/12/2020

LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond

Wenbo Li, Kun Zhou, lu Qi and
Nianjuan Jiang, Jiangbo Lu, Jiaya Jia

Keywords Paper

0

0

0

0

3:09

14/06/2020

Online Deep Clustering for Unsupervised Representation Learning

Xiaohang Zhan, Jiahao Xie, Ziwei Liu and
Yew-Soon Ong, Chen Change Loy

Keywords Paper

unsupervised representation learning, self-supervised learning, clustering, unsupervised learning, unlabeled data, recognition, low-shot, classification, imagenet, feature

0

0

0

0

1:00

26/04/2020

Don't Use Large Mini-batches, Use Local SGD

Tao Lin, Sebastian U. Stich, Kumar Kshitij Patel, Martin Jaggi

Keywords Paper

0

0

0

0

4:36

05/01/2021

Unsupervised Domain Adaptation in Semantic Segmentation via Orthogonal and Clustered Embeddings

Marco Toldo, Umberto Michieli, Pietro Zanuttigh

Keywords Paper

0

0

0

0

4:59

06/12/2021

Unleashing the Power of Contrastive Self-Supervised Visual Models via Contrast-Regularized Fine-Tuning

Yifan Zhang, Bryan Hooi, Dapeng Hu and
Jian Liang, Jiashi Feng

Keywords Paper

optimization, machine learning, self-supervised learning, vision, contrastive learning, representation learning, transfer learning

0

0

0

0

14:34

14/06/2020

Orderless Recurrent Models for Multi-Label Classification

Vacit Oguz Yazici, Abel Gonzalez-Garcia, Arnau Ramisa and
Bartłomiej Twardowski, Joost van de Weijer

Keywords Paper

multi-label classification, unordered set prediction, rnn, lstm, orderless loss function, hungarian algorithm, attention, ms-coco, alignment, binary cross entropy

0

0

0

0

0:56

14/06/2020

Deep Homography Estimation for Dynamic Scenes

Hoang Le, Feng Liu, Shu Zhang, Aseem Agarwala

Keywords Paper

homography estimation, dynamic scenes, motion estimation, multi-task learning, deep learning

0

0

0

0

1:01

06/12/2020

Is normalization indispensable for training deep neural network?

Jie Shao, Kai Hu, Changhu Wang and
Xiangyang Xue, Bhiksha Raj

Keywords Paper

0

0

0

0

4:01

19/04/2021

Bootstrapping relation extractors using syntactic search by examples

Matan Eyal, Asaf Amrami, Hillel Taub-Tabib, Yoav Goldberg

Keywords Paper

0

0

0

0

9:55

03/05/2021

Auxiliary Task Update Decomposition: The Good, the Bad and the Neutral

Lucio Dery, Yann Dauphin, David Grangier

Keywords Paper

multitask learning, deeplearning, pre-training, gradient decomposition

0

0

0

0

5:22

02/02/2021

Towards Reusable Network Components by Learning Compatible Representations

Michael Gygli, Jasper Uijlings, Vittorio Ferrari

Keywords Paper

0

0

0

0

19:58