Active Contrastive Learning of Audio-Visual Video Representations

03/05/2021

Active Contrastive Learning of Audio-Visual Video Representations

Shuang Ma, Zhaoyang Zeng, Daniel McDuff, Yale Song

Keywords: video recognition, audio-visual representation, self-supervised learning, active learning, contrastive representation learning

Abstract Paper Similar Papers

Abstract: Contrastive learning has been shown to produce generalizable representations of audio and visual data by maximizing the lower bound on the mutual information (MI) between different views of an instance. However, obtaining a tight lower bound requires a sample size exponential in MI and thus a large set of negative samples. We can incorporate more samples by building a large queue-based dictionary, but there are theoretical limits to performance improvements even with a large number of negative samples. We hypothesize that random negative sampling leads to a highly redundant dictionary that results in suboptimal representations for downstream tasks. In this paper, we propose an active contrastive learning approach that builds an actively sampled dictionary with diverse and informative items, which improves the quality of negative samples and improves performances on tasks where there is high mutual information in the data, e.g., video classification. Our model achieves state-of-the-art performance on challenging audio and visual downstream benchmarks including UCF101, HMDB51 and ESC50.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICLR 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/04/2021

Multilingual neural machine translation with deep encoder and multiple shallow decoders

Xiang Kong, Adithya Renduchintala, James Cross and
Yuqing Tang, Jiatao Gu, Xian Li

Keywords Paper

0

0

0

0

10:26

06/12/2021

Generalized and Discriminative Few-Shot Object Detection via SVD-Dictionary Enhancement

Aming WU, Suqi Zhao, Cheng Deng, Wei Liu

Keywords Paper

machine learning, vision

0

0

0

0

9:04

22/11/2021

Siamese Prototypical Contrastive Learning

Shentong Mo, Zhun Sun, Chao Li

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

2:50

07/09/2020

Boosting Image and Video Compression via Learning Latent Residual Patterns

Yen-Chung Chen, Keng-Jui Chang, Yi-Hsuan Tsai, Wei-Chen Chiu

Keywords Paper

compression artifacts, image compression, video compression, latent residual

0

0

0

0

7:48

03/05/2021

Support-set bottlenecks for video-text representation learning

Mandela Patrick, Po-Yao Huang, Yuki Asano and
Florian Metze, Alexander G Hauptmann, Joao F. Henriques, Andrea Vedaldi

Keywords Paper

contrastive learning, video-text learning, multi-modal learning, video representation learning

0

0

0

0

6:40

14/06/2020

Momentum Contrast for Unsupervised Visual Representation Learning

Kaiming He, Haoqi Fan, Yuxin Wu and
Saining Xie, Ross Girshick

Keywords Paper

unsupervised learning, representation learning.

0

0

0

0

4:53

04/07/2020

Improving Massively Multilingual Neural Machine Translation and Zero-Shot Translation

Biao Zhang, Philip Williams, Ivan Titov, Rico Sennrich

Keywords Paper

Massively Translation, Zero-Shot Translation, neural translation, NMT

0

0

0

0

11:47

06/12/2020

Hard Negative Mixing for Contrastive Learning

Yannis Kalantidis, Mert Bulent Sariyildiz, Noe Pion and
Philippe Weinzaepfel, Diane Larlus

Keywords Paper

0

0

0

0

3:17

05/01/2021

Intra-Class Part Swapping for Fine-Grained Image Classification

Lianbo Zhang, Shaoli Huang, Wei Liu

Keywords Paper

0

0

0

0

4:43

18/07/2021

Robust Representation Learning via Perceptual Similarity Metrics

Saeid A Taghanaki, Kristy Choi, Amir Hosein Khasahmadi, Anirudh Goyal

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

4:48

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

06/12/2021

Generalized DataWeighting via Class-Level Gradient Manipulation

Can Chen, Shuhao Zheng, Xi Chen and
Erqun Dong, Xue (Steve) Liu, Hao Liu, Dejing Dou

Keywords Paper

optimization, machine learning, meta learning

0

0

0

0

13:13

14/06/2020

Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking

Jin Gao, Weiming Hu, Yan Lu

Keywords Paper

online learning, visual tracking, continual learning, recursive least-squares estimation, deep learning, memory retention, recursive learning, mini-batch sgd, normal equation, mlp layer

0

0

0

0

5:01

14/06/2020

Forward and Backward Information Retention for Accurate Binary Neural Networks

Haotong Qin, Ruihao Gong, Xianglong Liu and
Mingzhu Shen, Ziran Wei, Fengwei Yu, Jingkuan Song

Keywords Paper

model compression, binary neural networks, deep learning, quantization, computer vision

0

0

0

0

1:00

06/12/2020

All Word Embeddings from One Embedding

Sho Takase, Sosuke Kobayashi

Keywords Paper

0

0

0

0

3:11

06/12/2021

Compressive Visual Representations

Kuang-Huei Lee, Anurag Arnab, Sergio Guadarrama and
John Canny, Ian Fischer

Keywords Paper

theory, machine learning, robustness, self-supervised learning, contrastive learning

0

0

0

0

6:30

08/12/2020

Automatic Word Association Norms (AWAN)

Jorge Reyes-Magaña, Gerardo Sierra Martínez, Gemma Bel-Enguix, Helena Gomez-Adorno

Keywords Paper

0

0

0

0

14:34

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

06/12/2020

Learning Diverse and Discriminative Representations via the Principle of Maximal Coding Rate Reduction

Yaodong Yu, Ryan Chan, Chong You and
Chaobing Song, Yi Ma

Keywords Paper

0

0

0

0

3:20

13/04/2021

Probabilistic sequential matrix factorization

Omer Deniz Akyildiz, Gerrit Burg, Theodoros Damoulas, Mark Steel

Keywords Paper

0

0

0

0

2:48

06/12/2020

Multi-label Contrastive Predictive Coding

Jiaming Song, Stefano Ermon

Keywords Paper

0

0

0

0

3:10

06/12/2021

Near-optimal Offline and Streaming Algorithms for Learning Non-Linear Dynamical Systems

Suhas Kowshik, Dheeraj Nagaraj, Prateek Jain, Praneeth Netrapalli

Keywords Paper

theory

0

0

0

0

14:43

25/07/2020

Training effective neural CLIR by bridging the translation gap

Hamed Bonab, Sheikh Muhammad Sarwar, James Allan

Keywords Paper

cross-lingual word embedding, cross-lingual information retrieval, neural clir, translation gap

0

0

0

0

15:33

14/06/2020

On Vocabulary Reliance in Scene Text Recognition

Zhaoyi Wan, Jielei Zhang, Liang Zhang and
Jiebo Luo, Cong Yao

Keywords Paper

scene text recognition, text spotting, document analysis, ocr, scene text detection, sequence recognition, language and vision

0

0

0

0

1:00

04/07/2020

Towards Robustifying NLI Models Against Lexical Dataset Biases

Xiang Zhou, Mohit Bansal

Keywords Paper

Natural Inference, data augmentation, Robustifying Models, deep models

0

0

0

0

11:34

14/06/2020

Inflated Episodic Memory With Region Self-Attention for Long-Tailed Visual Recognition

Linchao Zhu, Yi Yang

Keywords Paper

long-tailed visual recognition, region self-attention, inflated episodic memory, long-tailed video classification

0

0

0

0

1:00

03/05/2021

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning

Beliz Gunel, Jingfei Du, Alexis Conneau, Veselin Stoyanov

Keywords Paper

supervised contrastive learning, pre-trained language model fine-tuning, natural language understanding, generalization, few-shot learning, robustness

0

0

0

0

4:44

13/04/2021

Nonlinear functional output regression: A dictionary approach

Dimitri Bouche, Marianne Clausel, François Roueff, Florence d’Alché-Buc

Keywords Paper

0

0

0

0

3:04

22/11/2021

Alleviating Noisy-label Effects in Image Classification via Probability Transition Matrix

Ziqi Zhang, Yuexiang Li, Hongxin Wei and
Kai Ma, Tao Xu, Yefeng Zheng

Keywords Paper

noisy labels, image classification, instance selection, robust learning, inter-class correlation, soft label, medical image

0

0

0

0

2:52

26/04/2020

Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification

Yixiao Ge, Dapeng Chen, Hongsheng Li

Keywords Paper

Label Refinery, Unsupervised Domain Adaptation, Person Re-identification

0

0

0

0

5:03

18/07/2021

Order-Agnostic Cross Entropy for Non-Autoregressive Machine Translation

Cunxiao Du, Zhaopeng Tu, Jing Jiang

Keywords Paper

Applications, Natural Language Processing

0

0

0

0

17:21

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

06/12/2021

Consistency Regularization for Variational Auto-Encoders

Samarth Sinha, Adji Bousso Dieng

Keywords Paper

deep learning, machine learning, self-supervised learning, generative model, contrastive learning, representation learning

0

0

0

0

10:52

07/09/2020

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

Binh Nguyen, Binh Nguyen, Gustavo Carneiro and
Erman Tjiputra, Quang Tran, Thanh-Toan Do

Keywords Paper

unsupervised deep metric learning, unsupervised feature learning, unsupervised metric loss, negative mining, deep clustering, pseudo labels, reconstruction, centroid representations, retrieval, multi-task

0

0

0

0

6:18

03/05/2021

Contrastive Learning with Hard Negative Samples

Joshua Robinson, Ching-Yao Chuang, Suvrit Sra, Stefanie Jegelka

Keywords Paper

hard negative sampling, unsupervised representation learning, contrastive learning

0

0

0

0

5:09

26/04/2020

Residual Energy-Based Models for Text Generation

Yuntian Deng, Anton Bakhtin, Myle Ott and
Arthur Szlam, Marc'Aurelio Ranzato

Keywords Paper

energy-based models, text generation

0

0

0

0

4:59

04/07/2020

Addressing Posterior Collapse with Mutual Information for Improved Variational Neural Machine Translation

Arya D. McCarthy, Xian Li, Jiatao Gu, Ning Dong

Keywords Paper

Variational Translation, posterior collapse, auxiliary task, uncertainty

0

0

0

0

11:00

18/07/2021

Class2Simi: A Noise Reduction Perspective on Learning with Noisy Labels

Songhua Wu, Xiaobo Xia, Tongliang Liu and
Bo Han, Mingming Gong, Nannan Wang, Haifeng Liu, Gang Niu

Keywords Paper

Algorithms, Semi-Supervised Learning

0

0

0

0

4:54

02/02/2021

Decoupled and Memory-Reinforced Networks: Towards Effective Feature Learning for One-Step Person Search

Chuchu Han, Zhedong Zheng, Changxin Gao and
Nong Sang, Yi Yang

Keywords Paper

0

0

0

0

10:34

02/02/2021

Deep Semantic Dictionary Learning for Multi-label Image Classification

Fengtao Zhou, Sheng Huang, Yun Xing

Keywords Paper

0

0

0

0

15:06