Attention-Based Context Aware Reasoning for Situation Recognition

14/06/2020

Attention-Based Context Aware Reasoning for Situation Recognition

Thilini Cooray, Ngai-Man Cheung, Wei Lu

Keywords: situation recognition, visual semantic role labelling, scene understanding, vision and language, action recognition

Abstract Paper Similar Papers

Abstract: Situation Recognition (SR) is a fine-grained action recognition task where the model is expected to not only predict the salient action of the image, but also predict values of all associated semantic roles of the action. Predicting semantic roles is very challenging: a vast variety of possibilities can be the match for a semantic role. Existing work has focused on dependency modelling architectures to solve this issue. Inspired by the success achieved by query-based visual reasoning (e.g., Visual Question Answering), we propose to address semantic role prediction as a query-based visual reasoning problem. However, existing query-based reasoning methods have not considered handling of inter-dependent queries which is a unique requirement of semantic role prediction in SR. Therefore, to the best of our knowledge, we propose the first set of methods to address inter-dependent queries in query-based visual reasoning. Extensive experiments demonstrate the effectiveness of our proposed method which achieves outstanding performance on Situation Recognition task. Furthermore, leveraging query inter-dependency, our methods improve upon a state-of-the-art method that answers queries separately. Our code: https://github.com/thilinicooray/context-aware-reasoning-for-sr

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

OODformer: Out-Of-Distribution Detection Transformer

Rajat Koner, Poulami Sinhamahapatra, Karsten Roscher and
Stephan Günnemann, Volker Tresp

Keywords Paper

Out-Of-Distribution Detection, Vision Transfomer, Repsentation Learning

0

0

0

0

3:19

14/06/2020

Towards Inheritable Models for Open-Set Domain Adaptation

Jogendra Nath Kundu, Naveen Venkat, Ambareesh Revanur and
Rahul M V, R. Venkatesh Babu

Keywords Paper

transfer learning, domain adaptation, unsupervised learning, open set recognition, data privacy, hypothesis transfer, entropy minimization

0

0

0

0

4:56

14/06/2020

Hierarchical Human Parsing With Typed Part-Relation Reasoning

Wenguan Wang, Hailong Zhu, Jifeng Dai and
Yanwei Pang, Jianbing Shen, Ling Shao

Keywords Paper

human parsing, part-relation modeling, graph neural network

0

0

0

0

0:56

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

07/09/2020

Attribute-Guided Image Generation from Layout

Ke Ma, Bo Zhao, Leonid Sigal

Keywords Paper

conditional image generation, GAN

0

0

0

0

9:41

05/01/2021

Interpretable and Trustworthy Deepfake Detection via Dynamic Prototypes

Loc Trinh, Michael Tsang, Sirisha Rambhatla, Yan Liu

Keywords Paper

0

0

0

0

5:00

19/04/2021

‘just because you are right, doesn’t mean I am wrong’: Overcoming a bottleneck in development and evaluation of open-ended VQA tasks

Man Luo, Shailaja Keyur Sampat, Riley Tallman and
Yankai Zeng, Manuha Vancha, Akarshan Sajja, Chitta Baral

Keywords Paper

0

0

0

0

7:10

06/12/2021

Few-Shot Segmentation via Cycle-Consistent Transformer

Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei

Keywords Paper

transformers, vision, few shot learning

0

0

0

0

11:58

26/04/2020

Neural Outlier Rejection for Self-Supervised Keypoint Learning

Jiexiong Tang, Hanme Kim, Vitor Guizilini and
Sudeep Pillai, Rares Ambrus

Keywords Paper

Self-Supervised Learning, Keypoint Detection, Outlier Rejection, Deep Learning

0

0

0

0

4:55

14/06/2020

Vec2Face: Unveil Human Faces From Their Blackbox Features in Face Recognition

Chi Nhan Duong, Thanh-Dat Truong, Khoa Luu and
Kha Gia Quach, Hung Bui, Kaushik Roy

Keywords Paper

generative models, bijective metric learning, blackbox face matcher, distillation framework, face synthesis, id preservation, feature-conditional structure, feature reconstruction, dibigan.

0

0

0

0

5:03

04/07/2020

Cross-Modality Relevance for Reasoning on Language and Vision

Chen Zheng, Quan Guo, Parisa Kordjamshidi

Keywords Paper

Cross-Modality Relevance, Language Vision, visual answering, VQA

0

0

0

0

10:59

02/02/2021

Adversarial Pose Regression Network for Pose-Invariant Face Recognitions

Pengyu Li, Biao Wang, Lei Zhang

Keywords Paper

0

0

0

0

15:17

14/06/2020

Vision-Language Navigation With Self-Supervised Auxiliary Reasoning Tasks

Fengda Zhu, Yi Zhu, Xiaojun Chang, Xiaodan Liang

Keywords Paper

computer vision, vision language navigation, reinforcement learning

0

0

0

0

4:25

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

30/11/2020

MLIFeat: Multi-level information fusion based deep local features

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

0

0

0

0

5:28

02/02/2021

Semantic Consistency Networks for 3D Object Detection

Wenwen Wei, Ping Wei, Nanning Zheng

Keywords Paper

0

0

0

0

14:06

14/06/2020

A Programmatic and Semantic Approach to Explaining and Debugging Neural Network Based Object Detectors

Edward Kim, Divya Gopinath, Corina Păsăreanu, Sanjit A. Seshia

Keywords Paper

population-level explanation, testing, perception, neural network, blackbox, scenario, object detection, machine learning, autonomous driving

0

0

0

0

4:58

22/11/2021

Revisiting spatio-temporal layouts for compositional action recognition

Gorjan Radevski, Marie-Francine Moens, Tinne Tuytelaars

Keywords Paper

compositional action recognition, video understanding, something-something, action genome, charades, video transformer, multimodal fusion, spatial reasoning, spatio-temporal action recognition, revisiting spatio-temporal layouts

0

0

0

0

9:58

06/12/2020

Learning Deep Attribution Priors Based On Prior Knowledge

Ethan Weinberger, Joe Janizek, Su-In Lee

Keywords Paper

0

0

0

0

4:20

05/01/2021

Auto-Navigator: Decoupled Neural Architecture Search for Visual Navigation

Tianqi Tang, Xin Yu, Xuanyi Dong, Yi Yang

Keywords Paper

0

0

0

0

4:39

30/11/2020

Visualizing Color-wise Saliency of Black-Box Image Classification Models

Yuhki Hatakeyama, Hiroki Sakuma, Yoshinori Konishi, Kohei Suenaga

Keywords Paper

0

0

0

0

9:43

03/05/2021

Prototypical Representation Learning for Relation Extraction

Ning Ding, Xiaobin Wang, Yao Fu and
Guangwei Xu, Rui Wang, Pengjun Xie, Ying Shen, Fei Huang, Hai-Tao Zheng, Rui Zhang

Keywords Paper

NLP, Representation Learning, Relation Extraction

0

0

0

0

5:14

30/11/2020

Second Order enhanced Multi-glimpse Attention in Visual Question Answering

Qiang Sun, Binghui Xie, Yanwei Fu

Keywords Paper

0

0

0

0

7:20

03/05/2021

Contrastive Syn-to-Real Generalization

Wuyang Chen, Zhiding Yu, Shalini De Mello and
Sifei Liu, Jose M. Alvarez, Zhangyang Wang, Anima Anandkumar

Keywords Paper

domain generalization, synthetic-to-real generalization

0

0

0

0

4:56

14/06/2020

Graph-Structured Referring Expression Reasoning in the Wild

Sibei Yang, Guanbin Li, Yizhou Yu

Keywords Paper

graph-structured reasoning, ref-reasoning dataset, referring expression reasoning, scene graph, neural module, visual grounding, grounding referring expressions

0

0

0

0

4:58

18/07/2021

A Bit More Bayesian: Domain-Invariant Learning with Uncertainty

Zehao Xiao, Jiayi Shen, Xiantong Zhen and
Ling Shao, Cees Snoek

Keywords Paper

Algorithms, Model Selection and Structure Learning, Applications, Computational Biology and Bioinformatics; Applications, Health; Deep Learning, Adversarial Networks; Theory, Deep Learning, Bayesian Deep Learning

0

0

0

0

5:46

14/06/2020

Exploring Categorical Regularization for Domain Adaptive Object Detection

Chang-Dong Xu, Xing-Ran Zhao, Xin Jin, Xiu-Shen Wei

Keywords Paper

domain adaptive object detection, image-level categorical regularization, categorical consistency regularization, domain adaptive faster r-cnn

0

0

0

0

1:00

08/12/2020

Probing Multimodal Embeddings for Linguistic Properties: the Visual-Semantic Case

Adam Dahlgren Lindström, Johanna Björklund, Suna Bensch, Frank Drewes

Keywords Paper

0

0

0

0

14:20

19/08/2021

Context-Aware Image Inpainting with Learned Semantic Priors

Wendong Zhang, Junwei Zhu, Ying Tai and
Yunbo Wang, Wenqing Chu, Bingbing Ni, Chengjie Wang, Xiaokang Yang

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Deep Learning

0

0

0

0

13:26

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

14/06/2020

Syntax-Aware Action Targeting for Video Captioning

Qi Zheng, Chaoyue Wang, Dacheng Tao

Keywords Paper

video and language, video captioning, action predicting

0

0

0

0

1:01

14/06/2020

Cascaded Human-Object Interaction Recognition

Tianfei Zhou, Wenguan Wang, Siyuan Qi and
Haibin Ling, Jianbing Shen

Keywords Paper

human-object interaction recognition, cascade reasoning, fine-grained relation segmentation

0

0

0

0

1:01

30/11/2020

Rotation Axis Focused Attention Network (RAFA-Net) for Estimating Head Pose

Ardhendu Behera, Zachary Wharton, Pradeep Hewage, Swagat Kumar

Keywords Paper

0

0

0

0

10:19

14/06/2020

Referring Image Segmentation via Cross-Modal Progressive Comprehension

Shaofei Huang, Tianrui Hui, Si Liu and
Guanbin Li, Yunchao Wei, Jizhong Han, Luoqi Liu, Bo Li

Keywords Paper

referring segmentation, progressive comprehension, cross-modal, entity perception, relation-aware reasoning

0

0

0

0

1:01

03/05/2021

Bayesian Context Aggregation for Neural Processes

Michael Volpp, Fabian Flürenbrock, Lukas Grossberger and
Christian Daniel, Gerhard Neumann

Keywords Paper

Neural Processes, Multi-task Learning, Deep Sets, Meta Learning, Latent Variable Models, Aggregation Methods

0

0

0

0

5:04

30/11/2020

3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun, Jing Zhang, Nick Barnes

Keywords Paper

0

0

0

0

9:20

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

05/01/2021

Two-Level Adversarial Visual-Semantic Coupling for Generalized Zero-Shot Learning

Shivam Chandhok, Vineeth N Balasubramanian

Keywords Paper

0

0

0

0

4:59

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

04/07/2020

Generating Hierarchical Explanations on Text Classification via Feature Interaction Detection

Hanjie Chen, Guangtao Zheng, Yangfeng Ji

Keywords Paper

Text Classification, Generating explanations, natural processing, model prediction

0

0

0

0

11:47