A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

22/11/2021

A Closer Look at Few-Shot Video Classification: A New Baseline and Benchmark

Zhenxi Zhu, Limin Wang, Sheng Guo, Gangshan Wu

Keywords: few-shot learning, classifier-based baseline, new benchmark, action recognition

Abstract Paper Code Similar Papers

Abstract: The existing few-shot video classification methods often employ a meta-learning paradigm by designing customized temporal alignment module for similarity calculation. While significant progress has been made, these methods fail to focus on learning effective representations, and heavily rely on the ImageNet pre-training, which might be unreasonable for the few-shot recognition setting due to semantics overlap. In this paper, we aim to present an in-depth study on few-shot video classification by making three contributions. First, we perform a consistent comparative study on the existing metric-based methods to figure out their limitations in representation learning. Accordingly, we propose a simple classifier-based baseline without any temporal alignment that surprisingly outperforms the state-of-the-art meta-learning based methods. Second, we discover that there is a high correlation between the novel action class and the ImageNet object class, which is problematic in the few-shot recognition setting. Our results show that the performance of training from scratch drops significantly, which implies that the existing benchmarks cannot provide enough base data. Finally, we present a new benchmark with more base data to facilitate future few-shot video classification without pre-training. The code will be made available at https://github.com/MCG-NJU/FSL-Video.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

Unsupervised Part Discovery from Contrastive Reconstruction

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Keywords Paper

machine learning, self-supervised learning, clustering, representation learning

0

0

0

0

6:46

30/11/2020

Few-Shot Zero-Shot Learning: Knowledge Transfer with Less Supervision

Nanyi Fei, Jiechao Guan, Zhiwu Lu, Yizhao Gao

Keywords Paper

0

0

0

0

7:37

14/06/2020

Learning Meta Face Recognition in Unseen Domains

Jianzhu Guo, Xiangyu Zhu, Chenxu Zhao and
Dong Cao, Zhen Lei, Stan Z. Li

Keywords Paper

face recognition, meta learning, domain generalization, metric learning

0

0

0

0

5:01

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

22/11/2021

Temporal Alignment via Event Boundary for Few-shot Action Recongnition

Shuyuan Li, Huabin Liu, Mengjuan Fei and
Xiaoyuan Yu, Weiyao Lin

Keywords Paper

few-shot action recognition, temporal alignment, event boundary

0

0

0

0

2:32

19/08/2021

Learning Implicit Temporal Alignment for Few-shot Video Classification

Songyang Zhang, Jiale Zhou, Xuming He

Keywords Paper

Computer Vision, Action Recognition, Deep Learning

0

0

0

0

6:20

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

06/12/2021

Joint Semantic Mining for Weakly Supervised RGB-D Salient Object Detection

Jingjing Li, Wei Ji, Qi Bi and
Cheng Yan, Miao Zhang, Yongri Piao, Huchuan Lu, Li cheng

Keywords Paper

vision

0

0

0

0

9:03

22/11/2021

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

Keywords Paper

person re-identification, cross-modal image retrieval, fine-grained image retrieval, text-based person search

0

0

0

0

3:04

06/12/2020

A Closer Look at Accuracy vs. Robustness

Yao-Yuan Yang, Cyrus Rashtchian, Hongyang Zhang and
Russ Salakhutdinov, Kamalika Chaudhuri

Keywords Paper

0

0

0

0

3:24

02/02/2021

RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning

Peihao Chen, Deng Huang, Dongliang He and
Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan

Keywords Paper

0

0

0

0

14:14

14/06/2020

Dense Regression Network for Video Grounding

Runhao Zeng, Haoming Xu, Wenbing Huang and
Peihao Chen, Mingkui Tan, Chuang Gan

Keywords Paper

video grounding, sparse annotations, dense regression, multi-level fusion

0

0

0

0

0:57

05/01/2021

Multimodal Prototypical Networks for Few-Shot Learning

Frederik Pahde, Mihai Puscas, Tassilo Klein, Moin Nabi

Keywords Paper

0

0

0

0

4:56

02/02/2021

Weakly Supervised Temporal Action Localization Through Learning Explicit Subspaces for Action and Context

Ziyi Liu, Le Wang, Wei Tang and
Junsong Yuan, Nanning Zheng, Gang Hua

Keywords Paper

0

0

0

0

19:49

05/01/2021

AdarGCN: Adaptive Aggregation GCN for Few-Shot Learning

Jianhong Zhang, Manli Zhang, Zhiwu Lu, Tao Xiang

Keywords Paper

0

0

0

0

4:45

14/06/2020

Semi-Supervised Learning for Few-Shot Image-to-Image Translation

Yaxing Wang, Salman Khan, Abel Gonzalez-Garcia and
Joost van de Weijer, Fahad Shahbaz Khan

Keywords Paper

image-to-image translation, few-shot image generation, unsupervised image-to-image translation, conditional image generation, semi-supervised image-to-image translation

0

0

0

0

0:58

06/12/2021

Few-Shot Segmentation via Cycle-Consistent Transformer

Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei

Keywords Paper

transformers, vision, few shot learning

0

0

0

0

11:58

02/02/2021

Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning

Jiannan Ge, Hongtao Xie, Shaobo Min, Yongdong Zhang

Keywords Paper

0

0

0

0

16:22

26/04/2020

Meta-Learning without Memorization

Mingzhang Yin, George Tucker, Mingyuan Zhou and
Sergey Levine, Chelsea Finn

Keywords Paper

meta-learning, memorization, regularization, overfitting, mutually-exclusive

0

0

0

0

5:09

07/09/2020

Graph Density-Aware Losses for Novel Compositions in Scene Graph Generation

Boris Knyazev, Harm De Vries, Cătălina Cangea and
Graham Taylor, Aaron Courville, Eugene Belilovsky

Keywords Paper

scene graphs, scene graph generation, graph density, compositional generalization, visual genome, gqa, message passing

0

0

0

0

10:04

14/06/2020

Novel Object Viewpoint Estimation Through Reconstruction Alignment

Mohamed El Banani, Jason J. Corso, David F. Fouhey

Keywords Paper

viewpoint estimation, geometry-aware, alignment, reconstruction, 3d, cross-dataset, generalization

0

0

0

0

1:01

12/07/2020

Semi-Supervised StyleGAN for Disentanglement Learning

Weili Nie, Tero Karras, Animesh Garg and
Shoubhik Debnath, Anjul Patney, Ankit Patel, Anima Anandkumar

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

16:02

19/08/2021

Conditional Self-Supervised Learning for Few-Shot Classification

Yuexuan An, Hui Xue, Xingyu Zhao, Lu Zhang

Keywords Paper

Machine Learning, Classification, Transfer, Adaptation, Multi-task Learning, Unsupervised Learning

0

0

0

0

9:06

06/12/2020

Make One-Shot Video Object Segmentation Efficient Again

Tim Meinhardt, Laura Leal-Taixé

Keywords Paper

0

0

0

0

3:17

06/12/2021

Statistically and Computationally Efficient Linear Meta-representation Learning

Kiran Thekumparampil, Prateek Jain, Praneeth Netrapalli, Sewoong Oh

Keywords Paper

optimization, meta learning, representation learning, few shot learning

1

0

0

1

12:56

22/11/2021

Global Context and Geometric Priors for Effective Non-Local Self-Attention

Sanghyun Woo, Dahun Kim, Joon-Young Lee, In So Kweon

Keywords Paper

self-attention, non-local attention, attention, transformer, context, position encoding

0

0

0

0

3:03

14/06/2020

Real-World Person Re-Identification via Degradation Invariance Learning

Yukun Huang, Zheng-Jun Zha, Xueyang Fu and
Richang Hong, Liang Li

Keywords Paper

disentangled representation learning, person re-identification, generative adversarial network, image degradation, self-supervised learning

0

0

0

0

1:01

19/04/2021

Neural data-to-text generation with LM-based text augmentation

Ernie Chang, Xiaoyu Shen, Dawei Zhu and
Vera Demberg, Hui Su

Keywords Paper

0

0

0

0

7:32

30/11/2020

Raw-Guided Enhancing Reprocess of Low-Light Image via Deep Exposure Adjustment

Haofeng Huang, Wenhan Yang, Yueyu Hu, Jiaying Liu

Keywords Paper

0

0

0

0

7:20

22/11/2021

Deep Image Matting with Flexible Guidance Input

Hang Cheng, Shugong Xu, Xiufeng Jiang, Rongrong Wang

Keywords Paper

matting, data augmentation, guidance information, trimap-free

0

0

0

0

2:58

05/01/2021

Class-Wise Metric Scaling for Improved Few-Shot Classification

Ge Liu, Linglan Zhao, Wei Li and
Dashan Guo, Xiangzhong Fang

Keywords Paper

0

0

0

0

5:01

02/02/2021

A Hybrid Attention Mechanism for Weakly-Supervised Temporal Action Localization

Ashraful Islam, Chengjiang Long, Richard Radke

Keywords Paper

0

0

0

0

16:53

14/06/2020

Weakly-Supervised Salient Object Detection via Scribble Annotations

Jing Zhang, Xin Yu, Aixuan Li and
Peipei Song, Bowen Liu, Yuchao Dai

Keywords Paper

rgb saliency detection, scribble annotation, weakly-supervised

0

0

0

0

1:00

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

06/12/2020

Generalized Hindsight for Reinforcement Learning

Alex Li, Lerrel Pinto, Pieter Abbeel

Keywords Paper

0

0

0

0

3:20

14/06/2020

Scale-Space Flow for End-to-End Optimized Video Compression

Eirikur Agustsson, David Minnen, Nick Johnston and
Johannes Ballé, Sung Jin Hwang, George Toderici

Keywords Paper

learned video compression, scale-space flow, bilinear warping

0

0

0

0

0:55

02/02/2021

LREN: Low-Rank Embedded Network for Sample-Free Hyperspectral Anomaly Detection

Kai Jiang, Weiying Xie, Jie Lei and
Tao Jiang, Yunsong Li

Keywords Paper

0

0

0

0

12:56

03/05/2021

SaliencyMix: A Saliency Guided Data Augmentation Strategy for Better Regularization

A F M Shahab Uddin, Mst. Sirazam Monira, Wheemyung Shin and
TaeChoong Chung, Sung-Ho Bae

Keywords Paper

Regularization, Data Augmentation, Saliency Guided Data Augmentation, SaliencyMix

0

0

0

0

3:36

22/11/2021

Few-shot Action Recognition with Prototype-centered Attentive Learning

Xiatian Zhu, Antoine S Toisoul, Juan-Manuel Perez-Rua and
Li Zhang, Brais Martinez, Tao Xiang

Keywords Paper

Few-shot learning, Video recognition, Action classification, Small training data, Model pre-training, Meta-learning, Transformer, Self-attention learning, Cross-attention learning, Prototype learning, Prototype-centered learning, Hybrid-attention learning

0

0

0

0

2:22

22/11/2021

Zero-Shot Action Recognition from Diverse Object-Scene Compositions

Carlo Bretti, Pascal Mettes

Keywords Paper

action recognition, zero-shot learning, object-scene compositions

0

0

0

0

2:43