Two Causal Principles for Improving Visual Dialog

14/06/2020

Two Causal Principles for Improving Visual Dialog

Jiaxin Qi, Yulei Niu, Jianqiang Huang, Hanwang Zhang

Keywords: visual dialog, vision and language, causality

Abstract Paper Similar Papers

Abstract: This paper unravels the design tricks adopted by us, the champion team MReaL-BDAI, for Visual Dialog Challenge 2019: two causal principles for improving Visual Dialog (VisDial). By "improving", we mean that they can promote almost every existing VisDial model to the state-of-the-art performance on the leader-board. Such a major improvement is only due to our careful inspection on the causality behind the model and data, finding that the community has overlooked two causalities in VisDial. Intuitively, Principle 1 suggests: we should remove the direct input of the dialog history to the answer model, otherwise a harmful shortcut bias will be introduced. Principle 2 says: there is an unobserved confounder for history, question, and answer, leading to spurious correlations from training data. In particular, to remove the confounder suggested in Principle 2, we propose several causal intervention algorithms, which make the training fundamentally different from the traditional likelihood estimation. Note that the two principles are model-agnostic, so they are applicable in any VisDial model.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

30/11/2020

Exploiting Transferable Knowledge for Fairness-aware Image Classification

sunhee hwang, Sungho Park, Pilhyeon Lee and
seogkyu jeon, Dohyung Kim, Hyeran Byun

Keywords Paper

0

0

0

0

5:56

02/02/2021

A Case Study of the Shortcut Effects in Visual Commonsense Reasoning

Keren Ye, Adriana Kovashka

Keywords Paper

0

0

0

0

14:26

26/04/2020

Mixup Inference: Better Exploiting Mixup to Defend Adversarial Attacks

Tianyu Pang, Kun Xu, Jun Zhu

Keywords Paper

Trustworthy Machine Learning, Adversarial Robustness, Inference Principle, Mixup

0

0

0

0

4:59

06/12/2021

Interactive Label Cleaning with Example-based Explanations

Stefano Teso, Andrea Bontempelli, Fausto Giunchiglia, Andrea Passerini

Keywords Paper

active learning

0

0

0

0

12:23

06/12/2020

Pixel-Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

Guoliang Kang, Yunchao Wei, Yi Yang and
Yueting Zhuang, Alexander Hauptmann

Keywords Paper

0

0

0

0

3:16

14/09/2020

Network Cooperation with Progressive Disambiguation for Partial Label Learning

Yao Yao, Chen Gong, Jiehui Deng, Jian Yang

Keywords Paper

weakly-supervised learning, partial label learning, progressive disambiguation, network cooperation

0

0

0

0

10:19

06/12/2020

Make One-Shot Video Object Segmentation Efficient Again

Tim Meinhardt, Laura Leal-Taixé

Keywords Paper

0

0

0

0

3:17

14/06/2020

Suppressing Uncertainties for Large-Scale Facial Expression Recognition

Kai Wang, Xiaojiang Peng, Jianfei Yang and
Shijian Lu, Yu Qiao

Keywords Paper

emotion recognition, self-cure network, uncertainties

0

0

0

0

1:01

14/06/2020

Auxiliary Training: Towards Accurate and Robust Models

Linfeng Zhang, Muzhou Yu, Tong Chen and
Zuoqiang Shi, Chenglong Bao, Kaisheng Ma

Keywords Paper

model robustness, data augmentation, adversarial attack, training method, classification

0

0

0

0

0:56

18/07/2021

Matrix Sketching for Secure Collaborative Machine Learning

Mengjiao Zhang, Shusen Wang

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

4:25

14/06/2020

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

Sven Gowal, Chongli Qin, Po-Sen Huang and
Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

Keywords Paper

real-world robustness, adversarial examples, disentangled latents, generative models, spurious correlations

0

0

0

0

1:01

05/01/2021

Representation Learning With Statistical Independence to Mitigate Bias

Ehsan Adeli, Qingyu Zhao, Adolf Pfefferbaum and
Edith V. Sullivan, Li Fei-Fei, Juan Carlos Niebles, Kilian M. Pohl

Keywords Paper

0

0

0

0

4:33

06/12/2020

Teaching a GAN What Not to Learn

Siddarth Asokan, Chandra Seelamantula

Keywords Paper

0

0

0

0

3:31

12/07/2020

Learning with Multiple Complementary Labels

LEI FENG, Takuo Kaneko, Bo Han and
Gang Niu, Bo An, Masashi Sugiyama

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

10:19

02/02/2021

Deep Open Intent Classification with Adaptive Decision Boundary

Hanlei Zhang, Hua Xu, Ting-En Lin

Keywords Paper

0

0

0

0

13:40

06/12/2021

Believe What You See: Implicit Constraint Approach for Offline Multi-Agent Reinforcement Learning

Yiqin Yang, Xiaoteng Ma, Li Chenghao and
Zewu Zheng, Qiyuan Zhang, Gao Huang, Jun Yang, Qianchuan Zhao

Keywords Paper

reinforcement learning and planning

0

0

0

0

13:36

02/02/2021

Few-Shot Lifelong Learning

Pratik Mazumder, Pravendra Singh, Piyush Rai

Keywords Paper

0

0

0

0

18:14

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

14/06/2020

Modeling the Background for Incremental Learning in Semantic Segmentation

Fabio Cermelli, Massimiliano Mancini, Samuel Rota Bulò and
Elisa Ricci, Barbara Caputo

Keywords Paper

incremental, learning, semantic, segmentation, continual, catastrophic, forgetting, scene, parsing

0

0

0

0

1:01

06/12/2020

Differentiable Augmentation for Data-Efficient GAN Training

Shengyu Zhao, Zhijian Liu, Ji Lin and
Jun-Yan Zhu, Song Han

Keywords Paper

0

0

0

0

3:22

26/04/2020

Adversarially Robust Representations with Smooth Encoders

Taylan Cemgil, Sumedh Ghaisas, Krishnamurthy (Dj) Dvijotham, Pushmeet Kohli

Keywords Paper

Adversarial Learning, Robust Representations, Variational AutoEncoder, Wasserstein Distance, Variational Inference

0

0

0

0

5:16

06/12/2020

Learning from Failure: De-biasing Classifier from Biased Classifier

Junhyun Nam, Hyuntak Cha, Sungsoo Ahn and
Jaeho Lee, Jinwoo Shin

Keywords Paper

0

0

0

0

3:21

14/06/2020

Deep Spatial Gradient and Temporal Depth Learning for Face Anti-Spoofing

Zezheng Wang, Zitong Yu, Chenxu Zhao and
Xiangyu Zhu, Yunxiao Qin, Qiusheng Zhou, Feng Zhou, Zhen Lei

Keywords Paper

face anti-spoofing, depth supervised learning, multiple frames, detailed discriminative clues, 3d moving faces

0

0

0

0

4:57

06/12/2020

Improving Generalization in Reinforcement Learning with Mixture Regularization

KAIXIN WANG, Bingyi Kang, Jie Shao, Jiashi Feng

Keywords Paper

0

0

0

1

3:14

14/06/2020

Reusing Discriminators for Encoding: Towards Unsupervised Image-to-Image Translation

Runfa Chen, Wenbing Huang, Binghui Huang and
Fuchun Sun, Bin Fang

Keywords Paper

nice-gan, reusing discriminators for encoding, unsupervised image-to-image translation, decoupled training, multi-scale discriminators, adversarial loss, no independent component for encoding, shared layers, residual attention, cyclegan

0

0

0

0

1:01

06/12/2021

Curriculum Offline Imitating Learning

Minghuan Liu, Hanye Zhao, Zhengyu Yang and
Jian Shen, Weinan Zhang, Li Zhao, Tie-Yan Liu

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:28

06/12/2021

Supervising the Transfer of Reasoning Patterns in VQA

Corentin Kervadec, Christian Wolf, Grigory Antipov and
Moez Baccouche, Madiha Nadri

Keywords Paper

theory, deep learning, vision

0

0

0

0

12:54

01/07/2020

Simple Compounded-Label Training for Fact Extraction and Verification

Yixin Nie, Lisa Bauer, Mohit Bansal

Keywords Paper

0

0

0

0

9:59

07/09/2020

Transferring Pretrained Networks to Small Data via Category Decorrelation

Ying Jin, Zhangjie Cao, Mingsheng Long, Jianmin Wang

Keywords Paper

Category Decorrelation, Under Transfer

1

1

0

0

8:39

06/12/2021

Rethinking conditional GAN training: An approach using geometrically structured latent manifolds

Sameera Ramasinghe, Moshiur Farazi, Salman H Khan and
Nick Barnes, Stephen Gould

Keywords Paper

generative model

0

0

0

0

8:00

06/12/2021

Data-Efficient Instance Generation from Instance Discrimination

Ceyuan Yang, Yujun Shen, Yinghao Xu, Bolei Zhou

Keywords Paper

machine learning, generative model

0

0

0

0

6:53

06/12/2021

Conservative Data Sharing for Multi-Task Offline Reinforcement Learning

Tianhe Yu, Aviral Kumar, Yevgen Chebotar and
Karol Hausman, Sergey Levine, Chelsea Finn

Keywords Paper

reinforcement learning and planning

0

0

0

0

14:27

26/04/2020

Mutual Mean-Teaching: Pseudo Label Refinery for Unsupervised Domain Adaptation on Person Re-identification

Yixiao Ge, Dapeng Chen, Hongsheng Li

Keywords Paper

Label Refinery, Unsupervised Domain Adaptation, Person Re-identification

0

0

0

0

5:03

02/02/2021

Adversarial Training Reduces Information and Improves Transferability

Matteo Terzi, Alessandro Achille, Marco Maggipinto, Gian Antonio Susto

Keywords Paper

0

0

0

0

19:54

14/06/2020

Style Normalization and Restitution for Generalizable Person Re-Identification

Xin Jin, Cuiling Lan, Wenjun Zeng and
Zhibo Chen, Li Zhang

Keywords Paper

generalizable person re-identification, style normalization and restitution, feature disentanglement, identity-relevant and irrelevant features

0

0

0

0

1:01

06/12/2020

LoopReg: Self-supervised Learning of Implicit Surface Correspondences, Pose and Shape for 3D Human Mesh Registration

Bharat Lal Bhatnagar, Cristian Sminchisescu, Christian Theobalt, Gerard Pons-Moll

Keywords Paper

0

0

0

0

3:21

02/02/2021

Controllable Guarantees for Fair Outcomes via Contrastive Information Estimation

Umang Gupta, Aaron M Ferber, Bistra Dilkina, Greg Ver Steeg

Keywords Paper

0

0

0

0

16:48

03/05/2021

Witches' Brew: Industrial Scale Data Poisoning via Gradient Matching

Jonas Geiping, Liam H Fowl, Ronny Huang and
Wojciech Czaja, Gavin Taylor, Michael Moeller, Tom Goldstein

Keywords Paper

clean-label, from-scratch, Backdoor Attacks, Gradient Alignment, Large-scale, Data Poisoning, ImageNet, Security

0

0

0

0

4:40

05/01/2021

ADA-AT/DT: An Adversarial Approach for Cross-Domain and Cross-Task Knowledge Transfer

Ruchika Chavhan, Ankit Jha, Biplab Banerjee, Subhasis Chaudhuri

Keywords Paper

0

0

0

0

6:09

26/04/2020

Adversarial AutoAugment

Xinyu Zhang, Qiang Wang, Jian Zhang, Zhao Zhong

Keywords Paper

Automatic Data Augmentation, Adversarial Learning, Reinforcement Learning

0

0

0

0

4:30