Single-Modal Entropy based Active Learning for Visual Question Answering

22/11/2021

Single-Modal Entropy based Active Learning for Visual Question Answering

Dong-Jin Kim, Jae Won Cho, Jinsoo Choi, Yunjae Jung, In So Kweon

Keywords: Visual Question Answering, Vision and Language, Active Learning

Abstract Paper Similar Papers

Abstract: Constructing a large-scale labeled dataset in the real world, especially for high-level tasks (e.g, Visual Question Answering), can be expensive and time-consuming. In addition, with the ever-growing amounts of data and architecture complexity, Active Learning has become an important aspect of computer vision research. In this work, we address Active Learning in the multi-modal setting of Visual Question Answering (VQA). In light of the multi-modal inputs, image and question, we propose a novel method for effective sample acquisition through the use of ad hoc single-modal branches for each input to leverage its information. Our mutual information based sample acquisition strategy Single-Modal Entropic Measure (SMEM) in addition to our self-distillation technique enables the sample acquisitor to exploit all present modalities and find the most informative samples. Our novel idea is simple to implement, cost-efficient, and readily adaptable to other multi-modal tasks. We confirm our findings on various VQA datasets through state-of-the-art performance by comparing to existing Active Learning baselines.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Over-MAP: Structural Attention Mechanism and Automated Semantic Segmentation Ensembled for Uncertainty Prediction

Charles A. Kantor, Léonard Boussioux, Brice Rauby, Hugues Talbot

Keywords Paper

0

0

0

0

16:38

18/07/2021

Data Augmentation for Meta-Learning

Renkun Ni, Micah Goldblum, Amr Sharaf and
Kezhi Kong, Tom Goldstein

Keywords Paper

Deep Learning

0

0

0

0

5:09

14/06/2020

Focus on Defocus: Bridging the Synthetic to Real Domain Gap for Depth Estimation

Maxim Maximov, Kevin Galim, Laura Leal-Taixé

Keywords Paper

depth estimation, generalisation, depth from focus, blur estimation, depth

0

0

0

0

1:01

23/08/2020

Spectrum-guided adversarial disparity learning

Zhe Liu, Lina Yao, Lei Bai and
Xianzhi Wang, Can Wang

Keywords Paper

adversarial autoencoder, generative models, intraclass variability, activity recognition

0

0

0

0

14:30

12/07/2020

Revisiting Training Strategies and Generalization Performance in Deep Metric Learning

Karsten Roth, Timo Milbich, Samrath Sinha and
Prateek Gupta, Bjorn Ommer, Joseph Paul Cohen

Keywords Paper

Applications - Computer Vision

0

0

0

0

14:15

06/12/2021

Referring Transformer: A One-step Approach to Multi-task Visual Grounding

Muchen Li, Leonid Sigal

Keywords Paper

transformers, vision

0

0

0

0

7:54

02/02/2021

End-to-End Differentiable Learning to HDR Image Synthesis for Multi-exposure Images

Junghee Kim, Siyeong Lee, Suk-Ju Kang

Keywords Paper

0

0

0

0

15:35

26/04/2020

Frequency-based Search-control in Dyna

Yangchen Pan, Jincheng Mei, Amir-massoud Farahmand

Keywords Paper

Model-based reinforcement learning, search-control, Dyna, frequency of a signal

0

0

0

0

4:32

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

05/01/2021

Rotate to Attend: Convolutional Triplet Attention Module

Diganta Misra, Trikay Nalamada, Ajay Uppili Arasanipalai, Qibin Hou

Keywords Paper

0

0

0

0

4:47

03/05/2021

$i$-Mix: A Domain-Agnostic Strategy for Contrastive Representation Learning

Kibok Lee, Yian Zhu, Kihyuk Sohn and
Chun-Liang Li, Jinwoo Shin, Honglak Lee

Keywords Paper

self-supervised learning, unsupervised representation learning, data augmentation, MixUp, contrastive representation learning

0

0

0

0

5:04

03/05/2021

Learning perturbation sets for robust machine learning

Eric Wong, Zico Kolter

Keywords Paper

conditional variational autoencoder, adversarial examples, perturbation sets, robust machine learning

0

1

0

0

5:06

26/04/2020

Decoupling Representation and Classifier for Long-Tailed Recognition

Bingyi Kang, Saining Xie, Marcus Rohrbach and
Zhicheng Yan, Albert Gordo, Jiashi Feng, Yannis Kalantidis

Keywords Paper

long-tailed recognition, classification

0

0

0

1

5:00

06/12/2021

Align before Fuse: Vision and Language Representation Learning with Momentum Distillation

Junnan Li, Ramprasaath Selvaraju, Akhilesh Gotmare and
Shafiq Joty, Caiming Xiong, Steven Chu Hong Hoi

Keywords Paper

transformers, vision, representation learning

0

0

0

0

9:40

14/06/2020

Gold Seeker: Information Gain From Policy Distributions for Goal-Oriented Vision-and-Langauge Reasoning

Ehsan Abbasnejad, Iman Abbasnejad, Qi Wu and
Javen Shi, Anton van den Hengel

Keywords Paper

information-seeking agent vision and language tasks vqa interactive agents reinforcement learning

0

0

0

0

0:59

05/01/2021

SSGP: Sparse Spatial Guided Propagation for Robust and Generic Interpolation

Rene Schuster, Oliver Wasenmuller, Christian Unger, Didier Stricker

Keywords Paper

0

0

0

0

4:53

06/12/2020

Boosting Adversarial Training with Hypersphere Embedding

Tianyu Pang, Xiao Yang, Yinpeng Dong and
Kun Xu, Jun Zhu, Hang Su

Keywords Paper

0

0

0

0

2:59

04/07/2020

Generalizing Natural Language Analysis through Span-relation Representations

Zhengbao Jiang, Wei Xu, Jun Araki, Graham Neubig

Keywords Paper

Natural Analysis, Natural processing, dependency parsing, semantic labeling

0

0

0

0

8:30

06/12/2020

Network-to-Network Translation with Conditional Invertible Neural Networks

Robin Rombach, Patrick Esser, Bjorn Ommer

Keywords Paper

0

0

0

0

3:25

06/12/2020

Self-Learning Transformations for Improving Gaze and Head Redirection

Yufeng Zheng, Seonwook Park, Xucong Zhang and
Shalini De Mello, Otmar Hilliges

Keywords Paper

0

0

0

0

3:20

12/07/2020

TaskNorm: Rethinking Batch Normalization for Meta-Learning

John Bronskill, Jonathan Gordon, James Requeima and
Sebastian Nowozin, Richard Turner

Keywords Paper

Transfer, Multitask and Meta-learning

0

0

0

0

13:56

14/06/2020

Self-Supervised Monocular Trained Depth Estimation Using Self-Attention and Discrete Disparity Volume

Adrian Johnston, Gustavo Carneiro

Keywords Paper

self-supervised depth estimation, self-supervised learning, self-attention, depth estimation, uncertainty

0

0

0

0

1:01

03/08/2020

Walking on Two Legs: Learning Image Segmentation with Noisy Labels

Guohua Cheng, Hongli Ji, Yan Tian

Keywords Paper

0

0

0

0

10:02

05/01/2021

Distillation Multiple Choice Learning for Multimodal Action Recognition

Nuno Cruz Garcia, Sarah Adel Bargal, Vitaly Ablavsky and
Pietro Morerio, Vittorio Murino, Stan Sclaroff

Keywords Paper

0

0

0

1

4:31

06/12/2021

Twins: Revisiting the Design of Spatial Attention in Vision Transformers

Xiangxiang Chu, Zhi Tian, Yuqing Wang and
Bo Zhang, Haibing Ren, Xiaolin Wei, Huaxia Xia, Chunhua Shen

Keywords Paper

deep learning, machine learning, transformers, vision

0

0

0

0

5:29

06/12/2021

Batch Active Learning at Scale

Gui Citovsky, Giulia DeSalvo, Claudio Gentile and
Lazaros Karydas, Anand Rajagopalan, Afshin Rostamizadeh, Sanjiv Kumar

Keywords Paper

active learning

0

0

0

0

12:19

18/07/2021

Quasi-global Momentum: Accelerating Decentralized Deep Learning on Heterogeneous Data

Tao Lin, Praneeth Karimireddy, Sebastian Stich, Martin Jaggi

Keywords Paper

Deep Learning, Optimization for Deep Networks

0

0

0

0

5:14

13/04/2021

Sample elicitation

Jiaheng Wei, Zuyue Fu, Yang Liu and
Xingyu Li, Zhuoran Yang, Zhaoran Wang

Keywords Paper

0

0

0

0

3:16

26/04/2020

Learn to Explain Efficiently via Neural Logic Inductive Learning

Yuan Yang, Le Song

Keywords Paper

inductive logic programming, interpretability, attention

0

0

0

0

5:01

18/07/2021

AutoSampling: Search for Effective Data Sampling Schedules

MING SUN, Haoxuan Dou, Baopu Li and
Junjie Yan, Wanli Ouyang, Lei Cui

Keywords Paper

Algorithms, AutoML

0

0

0

0

6:05

06/12/2020

Deep Transformation-Invariant Clustering

Tom Monnier, Thibault Groueix, Mathieu Aubry

Keywords Paper

0

0

0

0

3:22

14/06/2020

A Unified Object Motion and Affinity Model for Online Multi-Object Tracking

Junbo Yin, Wenguan Wang, Qinghao Meng and
Ruigang Yang, Jianbing Shen

Keywords Paper

mot, multi-task learning, motion, affinity, attention, online

0

0

0

0

1:03

26/04/2020

On the Relationship between Self-Attention and Convolutional Layers

Jean-Baptiste Cordonnier, Andreas Loukas, Martin Jaggi

Keywords Paper

self-attention, attention, transformers, convolution, CNN, image, expressivity, capacity

0

0

0

0

5:18

14/06/2020

Memory Aggregation Networks for Efficient Interactive Video Object Segmentation

Jiaxu Miao, Yunchao Wei, Yi Yang

Keywords Paper

interactive video object segmentation, pixel embedding learning, memory aggregation networks

0

0

0

0

0:59

22/11/2021

Cross-Modal Generative Augmentation for Visual Question Answering

Zixu Wang, Yishu Miao, Lucia Specia

Keywords Paper

visual question answering, data augmentation, generative model, multimodal machine learning

0

0

0

0

2:49

06/12/2020

NVAE: A Deep Hierarchical Variational Autoencoder

Arash Vahdat, Jan Kautz

Keywords Paper

0

0

0

0

3:37

06/12/2020

LAPAR: Linearly-Assembled Pixel-Adaptive Regression Network for Single Image Super-resolution and Beyond

Wenbo Li, Kun Zhou, lu Qi and
Nianjuan Jiang, Jiangbo Lu, Jiaya Jia

Keywords Paper

0

0

0

0

3:09

14/06/2020

Adaptive Dilated Network With Self-Correction Supervision for Counting

Shuai Bai, Zhiqun He, Yu Qiao and
Hanzhe Hu, Wei Wu, Junjie Yan

Keywords Paper

crowd counting, self-correction, convolutional neural network

0

0

0

0

0:59

14/06/2020

Deep Homography Estimation for Dynamic Scenes

Hoang Le, Feng Liu, Shu Zhang, Aseem Agarwala

Keywords Paper

homography estimation, dynamic scenes, motion estimation, multi-task learning, deep learning

0

0

0

0

1:01

14/06/2020

Webly Supervised Knowledge Embedding Model for Visual Reasoning

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

visual reasoning, webly supervised learning

0

0

0

0

1:01