Counterfactual Vision and Language Learning

14/06/2020

Counterfactual Vision and Language Learning

Ehsan Abbasnejad, Damien Teney, Amin Parvaneh, Javen Shi, Anton van den Hengel

Keywords: counterfactual reasoning vision and language tasks vqa

Abstract Paper Similar Papers

Abstract: The ongoing success of visual question answering methods has been somwehat surprising given that, at its most general, the problem requires understanding the entire variety of both visual and language stimuli. It is particularly remarkable that this success has been achieved on the basis of comparatively small datasets, given the scale of the problem. One explanation is that this has been accomplished partly by exploiting bias in the datasets rather than developing deeper multi-modal reasoning. This fundamentally limits the generalization of the method, and thus its practical applicability. We propose a method that addresses this problem by introducing counterfactuals in the training. In doing so we leverage structural causal models for counterfactual evaluation to formulate alternatives, for instance, questions that could be asked of the same image set. We show that simulating plausible alternative training data through this process results in better generalization.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

Xinyu Wang, Yuliang Liu, Chunhua Shen and
Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

Keywords Paper

visual question answering, scene text, ocr

0

0

0

0

1:01

16/11/2020

Learning to Contrast the Counterfactual Samples for Robust Visual Question Answering

Zujie Liang, Weitao Jiang, Haifeng Hu, Jiaying Zhu

Keywords Paper

visual, generating samples, augmentation, self-supervised mechanism

0

0

0

0

2:00

26/04/2020

I Am Going MAD: Maximum Discrepancy Competition for Comparing Classifiers Adaptively

Haotao Wang, Tianlong Chen, Zhangyang Wang, Kede Ma

Keywords Paper

model comparison

0

0

0

0

4:53

02/02/2021

A Case Study of the Shortcut Effects in Visual Commonsense Reasoning

Keren Ye, Adriana Kovashka

Keywords Paper

0

0

0

0

14:26

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

06/12/2021

Debiased Visual Question Answering from Feature and Sample Perspectives

Zhiquan Wen, Guanghui Xu, Mingkui Tan and
Qingyao Wu, Qi Wu

Keywords Paper

vision

0

0

0

0

11:20

14/06/2020

TA-Student VQA: Multi-Agents Training by Self-Questioning

Peixi Xiong, Ying Wu

Keywords Paper

visual question answering, vqa, visual question generation, generative adversarial network

0

0

0

0

5:03

05/01/2021

Multimodal Prototypical Networks for Few-Shot Learning

Frederik Pahde, Mihai Puscas, Tassilo Klein, Moin Nabi

Keywords Paper

0

0

0

0

4:56

22/11/2021

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

Keywords Paper

person re-identification, cross-modal image retrieval, fine-grained image retrieval, text-based person search

0

0

0

0

3:04

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

06/12/2021

Learning Debiased Representation via Disentangled Feature Augmentation

Jungsoo Lee, Eungyeup Kim, Juyoung Lee and
Jihyeon Lee, Jaegul Choo

Keywords Paper

machine learning, vision

0

0

0

0

20:06

03/05/2021

Relating by Contrasting: A Data-efficient Framework for Multimodal Generative Models

Yuge Shi, Brooks Paige, Philip Torr, Siddharth N

Keywords Paper

Deep generative model, representation learning, multi-modal learning

0

0

0

0

5:09

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

14/06/2020

Hierarchically Robust Representation Learning

Qi Qian, Juhua Hu, Hao Li

Keywords Paper

representation learning, hierarchical robustness

0

0

0

0

1:01

02/02/2021

Explainable Models with Consistent Interpretations

Vipin Pillai, Hamed Pirsiavash

Keywords Paper

0

0

0

0

16:20

14/06/2020

Inter-Task Association Critic for Cross-Resolution Person Re-Identification

Zhiyi Cheng, Qi Dong, Shaogang Gong, Xiatian Zhu

Keywords Paper

person re-identification, cross-resolution person re-identification, inter-task, image super-resolution, low-resolution, image retrieval

0

0

0

0

4:58

06/12/2020

On the Value of Out-of-Distribution Testing: An Example of Goodhart's Law

Damien Teney, Ehsan Abbasnejad, Kushal Kafle and
Robik Shrestha, Christopher Kanan, Anton van den Hengel

Keywords Paper

0

0

0

0

3:21

14/06/2020

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

Sven Gowal, Chongli Qin, Po-Sen Huang and
Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

Keywords Paper

real-world robustness, adversarial examples, disentangled latents, generative models, spurious correlations

0

0

0

0

1:01

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

06/12/2020

Self-Learning Transformations for Improving Gaze and Head Redirection

Yufeng Zheng, Seonwook Park, Xucong Zhang and
Shalini De Mello, Otmar Hilliges

Keywords Paper

0

0

0

0

3:20

03/05/2021

What Makes Instance Discrimination Good for Transfer Learning?

Nanxuan Zhao, Zhirong Wu, Rynson W Lau, Stephen Lin

Keywords Paper

Unsupervised Learning, Transfer Learning, Self-supervised Learning

0

0

0

0

5:10

22/11/2021

Looking at the whole picture: constrained unsupervised anomaly segmentation

Julio Silva-Rodríguez, Valery Naranjo, Jose Dolz

Keywords Paper

unsueprvised anomaly localization, brain lesion segmentation, constrained segmentation, size-constrained loss, class-activations maps, CAMs, log-barrier extension, BRATS19

0

0

0

0

2:57

06/12/2021

Visual Adversarial Imitation Learning using Variational Models

Rafael Rafailov, Tianhe Yu, Aravind Rajeswaran, Chelsea Finn

Keywords Paper

theory, reinforcement learning and planning, adversarial robustness and security, representation learning

0

0

0

0

7:25

26/04/2020

Reinforced active learning for image segmentation

Arantxa Casanova, Pedro O. Pinheiro, Negar Rostamzadeh, Christopher J. Pal

Keywords Paper

semantic segmentation, active learning, reinforcement learning

0

0

0

0

5:08

14/06/2020

Counterfactual Samples Synthesizing for Robust Visual Question Answering

Long Chen, Xin Yan, Jun Xiao and
Hanwang Zhang, Shiliang Pu, Yueting Zhuang

Keywords Paper

visual question answering, counterfactual, debias, language bias, data augmentation, visual-and-language

0

0

0

0

1:01

06/12/2020

Dual Manifold Adversarial Robustness: Defense against Lp and non-Lp Adversarial Attacks

Wei-An Lin, Chun Pong Lau, Alexander Levine and
Rama Chellappa, Soheil Feizi

Keywords Paper

0

0

0

0

3:24

22/11/2021

In-N-Out: Towards Good Initialization for Inpainting and Outpainting

Changho Jo, Woobin Im, Sungeui Yoon

Keywords Paper

inpainting, outpainting, extrapolation, environment map estimation, self-supervised learning, transfer learning

0

0

0

0

2:33

04/07/2020

Improving Image Captioning Evaluation by Considering Inter References Variance

Yanzhi Yi, Hangyu Deng, Jinglu Hu

Keywords Paper

Image Evaluation, Evaluating captions, system-level tasks, BERTScore

0

0

0

0

11:31

05/01/2021

Breaking Shortcuts by Masking for Robust Visual Reasoning

Keren Ye, Mingda Zhang, Adriana Kovashka

Keywords Paper

0

0

0

0

5:01

16/11/2020

Exposing Shallow Heuristics of Relation Extraction Models with Challenge Data

Shachar Rosenman, Alon Jacovi, Yoav Goldberg

Keywords Paper

data process, re collection, sota models, tacred

0

0

0

0

5:55

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

07/09/2020

From Saturation to Zero-Shot Visual Relationship Detection Using Local Context

Nikolaos Gkanatsios, Vassilis Pitsikalis, Petros Maragos

Keywords Paper

Visual Relationship Detection, Scene Graph Generation, Zero-shot Classification, Local Context, Language Bias

0

0

0

0

7:17

06/12/2021

SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning

Talip Ucar, Ehsan Hajiramezanali, Lindsay Edwards

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

13:28

14/06/2020

Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation

Yunhan Zhao, Shu Kong, Daeyun Shin, Charless Fowlkes

Keywords Paper

monocular depth prediction, real-synthetic domain shift, synthetic training data, domain adaptation, image inpainting, high-level domain gaps

0

0

0

0

1:01

04/07/2020

What is Learned in Visually Grounded Neural Syntax Acquisition

Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush, Yoav Artzi

Keywords Paper

Visually Acquisition, bootstrap models, blackbox models, visual components

0

0

0

0

6:41

26/04/2020

Robust Local Features for Improving the Generalization of Adversarial Training

Chuanbiao Song, Kun He, Jiadong Lin and
Liwei Wang, John E. Hopcroft

Keywords Paper

adversarial robustness, adversarial training, adversarial example, deep learning

0

0

0

0

4:01