Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

12/07/2020

Neuro-Symbolic Visual Reasoning: Disentangling "Visual" from "Reasoning"

Saeed Amizadeh, Hamid Palangi, Oleksandr Polozov, Yichen Huang, Kazuhito Koishida

Keywords: Applications - Computer Vision

Abstract Paper Similar Papers

Abstract: Visual reasoning tasks such as visual question answering (VQA) require an interplay of visual perception with reasoning about the question semantics grounded in perception. Challenges like VCR (Zellers et al., 2019) and GQA (Hudson& Manning, 2019) facilitate scientific progress from perception models to visual reasoning. However, recent advances on GQA are still primarily driven by perception improvements (e.g. scene graph generation) rather than reasoning. Neuro-symbolic models such as MAC (Hudson& Manning, 2018) bring the benefits of compositional reasoning to VQA, but they are still entangled with visual representation learning, and thus neural reasoning is hard to improve and assess on its own. To address this, we propose (1) a framework to isolate and evaluate the reasoning aspect of VQA separately from its perception, and (2) a novel top-down calibration technique that allows the model to answer reasoning questions even with imperfect perception. To this end, we introduce a differentiable first-order logic formalism for VQA that explicitly decouples question answering from visual perception. On the challenging GQA dataset, this approach is competitive with non-symbolic neural models while also interpretable by construction, composable with arbitrary pre-trained visual representation learning, and requires much fewer parameters.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ICML 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2020

Learning Object-Centric Representations of Multi-Object Scenes from Multiple Views

Nanbo Li, Cian Eastwood, Robert Fisher

Keywords Paper

0

0

0

0

3:19

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

02/02/2021

Gaussian Process Priors for View-Aware Inference

Yuxin Hou, Ari Heljakka, Arno Solin

Keywords Paper

0

0

0

0

14:48

06/12/2021

Scallop: From Probabilistic Deductive Databases to Scalable Differentiable Reasoning

Jiani Huang, Ziyang Li, Binghong Chen and
Karan Samel, Mayur Naik, Le Song, Xujie Si

Keywords Paper

deep learning, transformers, vision

0

0

0

0

15:02

22/11/2021

Perception Visualization: Seeing Through The Eyes Of a DNN

Loris Giulivi, Mark Carman, Giacomo Boracchi

Keywords Paper

explainable artificial intelligence, saliency maps, convolutional neural networks, latent representations

0

0

0

0

3:04

19/04/2021

Modeling coreference relations in visual dialog

Mingxiao Li, Marie-Francine Moens

Keywords Paper

0

0

0

0

10:33

05/01/2021

Towards Visually Explaining Video Understanding Networks With Perturbation

Zhenqiang Li, Weimin Wang, Zuoyue Li and
Yifei Huang, Yoichi Sato

Keywords Paper

0

0

0

0

4:53

06/12/2021

Stabilizing Deep Q-Learning with ConvNets and Vision Transformers under Data Augmentation

Nicklas Hansen, Hao Su, Xiaolong Wang

Keywords Paper

reinforcement learning and planning, transformers

0

0

0

0

8:43

06/12/2021

Designing Counterfactual Generators using Deep Model Inversion

Jayaraman Thiagarajan, Vivek Sivaraman Narayanaswamy, Deepta Rajan and
Jia Liang, Akshay Chaudhari, Andreas Spanias

Keywords Paper

optimization, representation learning, interpretability

0

0

0

0

13:28

06/12/2021

Supervising the Transfer of Reasoning Patterns in VQA

Corentin Kervadec, Christian Wolf, Grigory Antipov and
Moez Baccouche, Madiha Nadri

Keywords Paper

theory, deep learning, vision

0

0

0

0

12:54

14/06/2020

Attack to Explain Deep Representation

Mohammad A. A. K. Jalwana, Naveed Akhtar, Mohammed Bennamoun, Ajmal Mian

Keywords Paper

interpreting deep learning, adversarial attack, explanation attack, explainable ai, image generation

0

0

0

0

1:01

26/04/2020

Disentangling neural mechanisms for perceptual grouping

Junkyung Kim, Drew Linsley, Kalpit Thakkar, Thomas Serre

Keywords Paper

Perceptual grouping, visual cortex, recurrent feedback, horizontal connections, top-down connections

0

0

0

0

5:16

22/11/2021

Gradient Frequency Modulation for Visually Explaining Video Understanding Models

Xin Miao Lin, Wentao Bao, Matthew Wright, Yu Kong

Keywords Paper

model explanation, model explainability, explainable AI, video action recognition, Discrete Fourier Transform, video perturbation, interpretable machine learning, video model explanation, frequency modulation, spatiotemporal consistency

0

0

0

0

2:53

14/06/2020

RL-CycleGAN: Reinforcement Learning Aware Simulation-to-Real

Kanishka Rao, Chris Harris, Alex Irpan and
Sergey Levine, Julian Ibarz, Mohi Khansari

Keywords Paper

robotics, sim2real, cyclegan, reinforcement learning, grasping, q-learning

0

0

0

0

4:55

06/12/2021

Predify: Augmenting deep neural networks with brain-inspired predictive coding dynamics

Bhavin Choksi, Milad Mozafari, Callum Biggs O'May and
B. ADOR, Andrea Alamia, Rufin VanRullen

Keywords Paper

deep learning, machine learning, robustness, adversarial robustness and security, neuroscience, vision

0

0

0

0

11:21

06/12/2020

Reinforcement Learning with Augmented Data

Misha Laskin, Kimin Lee, Adam Stooke and
Lerrel Pinto, Pieter Abbeel, Aravind Srinivas

Keywords Paper

0

0

0

0

3:33

16/11/2020

DIRL: Domain-Invariant Representation Learning for Sim-to-Real Transfer

Ajay Tanwani

Keywords Paper

0

0

0

0

5:07

06/12/2021

Debiased Visual Question Answering from Feature and Sample Perspectives

Zhiquan Wen, Guanghui Xu, Mingkui Tan and
Qingyao Wu, Qi Wu

Keywords Paper

vision

0

0

0

0

11:20

03/05/2021

What Makes Instance Discrimination Good for Transfer Learning?

Nanxuan Zhao, Zhirong Wu, Rynson W Lau, Stephen Lin

Keywords Paper

Unsupervised Learning, Transfer Learning, Self-supervised Learning

0

0

0

0

5:10

06/12/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

0

0

0

0

3:23

14/06/2020

On the General Value of Evidence, and Bilingual Scene-Text Visual Question Answering

Xinyu Wang, Yuliang Liu, Chunhua Shen and
Chun Chet Ng, Canjie Luo, Lianwen Jin, Chee Seng Chan, Anton van den Hengel, Liangwei Wang

Keywords Paper

visual question answering, scene text, ocr

0

0

0

0

1:01

02/02/2021

Learning to Rationalize for Nonmonotonic Reasoning with Distant Supervision

Faeze Brahman, Vered Shwartz, Rachel Rudinger, Yejin Choi

Keywords Paper

0

0

0

0

18:33

26/04/2020

Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning

Kimin Lee, Kibok Lee, Jinwoo Shin, Honglak Lee

Keywords Paper

Deep reinforcement learning, Generalization in visual domains

0

0

0

0

5:03

02/02/2021

A Continual Learning Framework for Uncertainty-Aware Interactive Image Segmentation

Ervine Zheng, Qi Yu, Rui Li and
Pengcheng Shi, Anne Haake

Keywords Paper

0

0

0

0

14:21

03/05/2021

NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation

Angtian Wang, Adam Kortylewski, Alan Yuille

Keywords Paper

Contrastive Learning, Render-and-Compare, Robust Deep Learning, Pose Estimation

0

0

0

0

5:08

14/06/2020

Iteratively-Refined Interactive 3D Medical Image Segmentation With Multi-Agent Reinforcement Learning

Xuan Liao, Wenhao Li, Qisen Xu and
Xiangfeng Wang, Bo Jin, Xiaoyun Zhang, Yanfeng Wang, Ya Zhang

Keywords Paper

medical image segmentation, interactive image segmentation, reinforcement learning

0

0

0

0

1:00

14/06/2020

Domain Decluttering: Simplifying Images to Mitigate Synthetic-Real Domain Shift and Improve Depth Estimation

Yunhan Zhao, Shu Kong, Daeyun Shin, Charless Fowlkes

Keywords Paper

monocular depth prediction, real-synthetic domain shift, synthetic training data, domain adaptation, image inpainting, high-level domain gaps

0

0

0

0

1:01

16/11/2020

Does my multimodal model learn cross-modal interactions? It's harder to tell than you might think!

Jack Hessel, Lillian Lee

Keywords Paper

modeling interactions, multimodal tasks, visual answering, multimodal learning

0

0

0

0

12:02

14/06/2020

Instance-Aware Image Colorization

Jheng-Wei Su, Hung-Kuo Chu, Jia-Bin Huang

Keywords Paper

colorization, instance-aware, deep learning, computer vision

0

0

0

0

1:01

05/01/2021

Breaking Shortcuts by Masking for Robust Visual Reasoning

Keren Ye, Mingda Zhang, Adriana Kovashka

Keywords Paper

0

0

0

0

5:01

02/02/2021

Explainable Models with Consistent Interpretations

Vipin Pillai, Hamed Pirsiavash

Keywords Paper

0

0

0

0

16:20

16/11/2020

Multi-document Summarization with Maximal Marginal Relevance-guided Reinforcement Learning

Yuning Mao, Yanru Qu, Yiqing Xie and
Xiang Ren, Jiawei Han

Keywords Paper

single-document summarization, single-document sds, multi-document summarization, multi-document mds

0

0

0

0

10:58

14/06/2020

Counterfactual Vision and Language Learning

Ehsan Abbasnejad, Damien Teney, Amin Parvaneh and
Javen Shi, Anton van den Hengel

Keywords Paper

counterfactual reasoning vision and language tasks vqa

0

0

0

0

5:00

19/08/2021

Information Bottleneck Approach to Spatial Attention Learning

Qiuxia Lai, Yu Li, Ailing Zeng and
Minhao Liu, Hanqiu Sun, Qiang Xu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Classification, Deep Learning

0

0

0

0

14:42

22/11/2021

Duplicate Latent Representation Suppression for Multi-object Variational Autoencoders

Li Nanbo, Robert B Fisher

Keywords Paper

object-centric representation learning, variational autoencoders, scene representation

0

0

0

0

2:58

06/12/2021

Few-Shot Segmentation via Cycle-Consistent Transformer

Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei

Keywords Paper

transformers, vision, few shot learning

0

0

0

0

11:58

06/12/2021

SubTab: Subsetting Features of Tabular Data for Self-Supervised Representation Learning

Talip Ucar, Ehsan Hajiramezanali, Lindsay Edwards

Keywords Paper

self-supervised learning, contrastive learning, representation learning

0

0

0

0

13:28

14/06/2020

Achieving Robustness in the Wild via Adversarial Mixing With Disentangled Representations

Sven Gowal, Chongli Qin, Po-Sen Huang and
Taylan Cemgil, Krishnamurthy Dvijotham, Timothy Mann, Pushmeet Kohli

Keywords Paper

real-world robustness, adversarial examples, disentangled latents, generative models, spurious correlations

0

0

0

0

1:01

14/06/2020

Transformation GAN for Unsupervised Image Synthesis and Representation Learning

Jiayu Wang, Wengang Zhou, Guo-Jun Qi and
Zhongqian Fu, Qi Tian, Houqiang Li

Keywords Paper

gan, unsupervised learning, representation learning

0

0

0

0

1:00

04/07/2020

What is Learned in Visually Grounded Neural Syntax Acquisition

Noriyuki Kojima, Hadar Averbuch-Elor, Alexander Rush, Yoav Artzi

Keywords Paper

Visually Acquisition, bootstrap models, blackbox models, visual components

0

0

0

0

6:41