Grounded Situation Recognition with Transformers

22/11/2021

Grounded Situation Recognition with Transformers

Junhyeong Cho, Youngseok Yoon, Hyeonjun Lee, Suha Kwak

Keywords: grounded situation recognition, situation recognition, transformers, scene understanding

Abstract Paper Code Similar Papers

Abstract: Grounded Situation Recognition (GSR) is the task that not only classifies a salient action (verb), but also predicts entities (nouns) associated with semantic roles and their locations in the given image. Inspired by the remarkable success of Transformers in vision tasks, we propose a GSR model based on a Transformer encoder-decoder architecture. The attention mechanism of our model enables accurate verb classification by capturing high-level semantic feature of an image effectively, and allows the model to flexibly deal with the complicated and image-dependent relations between entities for improved noun classification and localization. Our model is the first Transformer architecture for GSR, and achieves the state of the art in every evaluation metric on the SWiG benchmark. Our code is available at https://github.com/jhcho99/gsrtr.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

26/04/2020

VL-BERT: Pre-training of Generic Visual-Linguistic Representations

Weijie Su, Xizhou Zhu, Yue Cao and
Bin Li, Lewei Lu, Furu Wei, Jifeng Dai

Keywords Paper

Visual-Linguistic, Generic Representation, Pre-training

0

0

0

0

4:40

02/02/2021

Encoder-Decoder Based Unified Semantic Role Labeling with Label-Aware Syntax

Hao Fei, Fei Li, Bobo Li, Donghong Ji

Keywords Paper

0

0

0

0

16:10

16/11/2020

Variational Hierarchical Dialog Autoencoder for Dialog State Tracking Data Augmentation

Kang Min Yoo, Hanbit Lee, Franck Dernoncourt and
Trung Bui, Walter Chang, Sang-goo Lee

Keywords Paper

generative augmentation, nlp tasks, dialog tracking, dialog generation

0

0

0

0

5:34

06/12/2020

Heavy-tailed Representations, Text Polarity Classification & Data Augmentation

Hamid Jalalzai, Pierre Colombo, Chloé Clavel and
Eric Gaussier, Giovanna Varni, Emmanuel Vignon, Anne Sabourin

Keywords Paper

0

0

0

0

2:57

14/06/2020

Graph-Structured Referring Expression Reasoning in the Wild

Sibei Yang, Guanbin Li, Yizhou Yu

Keywords Paper

graph-structured reasoning, ref-reasoning dataset, referring expression reasoning, scene graph, neural module, visual grounding, grounding referring expressions

0

0

0

0

4:58

03/05/2021

Structured Prediction as Translation between Augmented Natural Languages

Giovanni Paolini, Ben Athiwaratkun, Jason Krone and
Jie Ma, Alessandro Achille, RISHITA ANUBHAI, Cicero Nogueira dos Santos, Bing Xiang, Stefano Soatto

Keywords Paper

sequence to sequence, structured prediction, language models, transfer learning, few-shot learning, multi-task learning, generative modeling

0

0

0

0

12:16

14/06/2020

Image Search With Text Feedback by Visiolinguistic Attention Learning

Yanbei Chen, Shaogang Gong, Loris Bazzani

Keywords Paper

vision and language, image search, text feedback, attention mechanism, transformer, multimodal learning, representation learning, composition, image retrieval, interactive image search

0

0

0

0

1:00

06/12/2021

ViTAE: Vision Transformer Advanced by Exploring Intrinsic Inductive Bias

Yufei Xu, Qiming ZHANG, Jing Zhang, Dacheng Tao

Keywords Paper

machine learning, transformers, vision

0

0

0

0

10:16

04/07/2020

NILE : Natural Language Inference with Faithful Natural Language Explanations

Sawan Kumar, Partha Talukdar

Keywords Paper

Natural Inference, NLP tasks, internal making, NLI

0

0

0

1

10:55

16/11/2020

Top-Rank-Focused Adaptive Vote Collection for the Evaluation of Domain-Specific Semantic Models

Pierangelo Lombardo, Alessio Boiardi, Luca Colombo and
Angelo Schiavone, Nicolò Tamagnone

Keywords Paper

content-based recommenders, construction, top-rank evaluation, semantic models

0

0

0

0

12:03

06/12/2021

SOLQ: Segmenting Objects by Learning Queries

Bin Dong, Fangao Zeng, Tiancai Wang and
Xiangyu Zhang, Yichen Wei

Keywords Paper

machine learning, transformers

0

0

0

0

7:12

02/02/2021

Entity Structure Within and Throughout: Modeling Mention Dependencies for Document-Level Relation Extraction

Benfeng Xu, Quan Wang, Yajuan Lyu and
Yong Zhu, Zhendong Mao

Keywords Paper

0

0

0

0

14:48

14/06/2020

Attention-Based Context Aware Reasoning for Situation Recognition

Thilini Cooray, Ngai-Man Cheung, Wei Lu

Keywords Paper

situation recognition, visual semantic role labelling, scene understanding, vision and language, action recognition

0

0

0

0

1:00

26/04/2020

StructBERT: Incorporating Language Structures into Pre-training for Deep Language Understanding

Wei Wang, Bin Bi, Ming Yan and
Chen Wu, Jiangnan Xia, Zuyi Bao, Liwei Peng, Luo Si

Keywords Paper

0

0

0

0

5:34

14/06/2020

Interpretable and Accurate Fine-grained Recognition via Region Grouping

Zixuan Huang, Yin Li

Keywords Paper

interpretable deep model, fine-grained recognition, region-based recognition

0

0

0

0

4:58

16/11/2020

BERT-enhanced Relational Sentence Ordering Network

Baiyun Cui, Yingming Li, Zhongfei Zhang

Keywords Paper

bert-enhanced network, brson, bert, coherence modeling

0

0

0

0

11:33

05/12/2020

Exploiting WordNet synset and hypernym representations for answer selection

Weikang Li, Yunfang Wu

Keywords Paper

0

0

0

0

7:04

06/12/2020

Knowledge Augmented Deep Neural Networks for Joint Facial Expression and Action Unit Recognition

Zijun Cui, Tengfei Song, Yuru Wang, Qiang Ji

Keywords Paper

0

0

0

0

3:19

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

06/12/2021

Grammar-Based Grounded Lexicon Learning

Jiayuan Mao, Freda Shi, Jiajun Wu and
Roger Levy, Josh Tenenbaum

Keywords Paper

deep learning

0

0

0

0

13:41

19/04/2021

Attention-based relational graph convolutional network for target-oriented opinion words extraction

Junfeng Jiang, An Wang, Akiko Aizawa

Keywords Paper

0

0

0

0

8:40

06/12/2021

Probabilistic Attention for Interactive Segmentation

Prasad Gabbur, Manjot Bilkhu, Javier Movellan

Keywords Paper

transformers, vision

0

0

0

0

13:20

04/07/2020

Decomposing Generalization: Models of Generic, Habitual and Episodic Statements

Venkata Subrahmanyan Govindarajan, Benjamin Van Durme, Aaron Steven White

Keywords Paper

linguistic generalization—, predicting generalization, expressions generalization, Decomposing Generalization

0

0

0

0

12:26

14/06/2020

Unsupervised Domain Adaptation With Hierarchical Gradient Synchronization

Lanqing Hu, Meina Kan, Shiguang Shan, Xilin Chen

Keywords Paper

domain adaptation, distribution alignment gradient consistency, adversarial learning

0

0

0

0

1:00

05/01/2021

Mutual Information Maximization on Disentangled Representations for Differential Morph Detection

Sobhan Soleymani, Ali Dabouei, Fariborz Taherkhani and
Jeremy Dawson, Nasser M. Nasrabadi

Keywords Paper

0

0

0

0

4:41

19/08/2021

Disentangled Face Attribute Editing via Instance-Aware Latent Space Search

Yuxuan Han, Jiaolong Yang, Ying Fu

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Explainable/Interpretable Machine Learning

0

0

0

0

12:51

16/11/2020

Neural Mask Generator: Learning to Generate Adaptive Word Maskings for Language Model Adaptation

Minki Kang, Moonsu Han, Sung Ju Hwang

Keywords Paper

self-supervised pre-training, question answering, task, reinforcement learning

0

0

0

0

12:00

16/11/2020

LUKE: Deep Contextualized Entity Representations with Entity-aware Self-attention

Ikuya Yamada, Akari Asai, Hiroyuki Shindo and
Hideaki Takeda, Yuji Matsumoto

Keywords Paper

natural tasks, pretraining task, transformer, entity-related tasks

0

0

0

0

11:15

16/11/2020

Unified Feature and Instance Based Domain Adaptation for Aspect-Based Sentiment Analysis

Chenggong Gong, Jianfei Yu, Rui Xia

Keywords Paper

aspect-based analysis, absa task, feature-based adaptation, auxiliary tasks

0

0

0

0

12:12

04/07/2020

Contrastive Self-Supervised Learning for Commonsense Reasoning

Tassilo Klein, Moin Nabi

Keywords Paper

Commonsense Reasoning, Pronoun problems, pronoun disambiguation, commonsense tasks

0

0

0

0

7:10

02/02/2021

Exploring Auxiliary Reasoning Tasks for Task-oriented Dialog Systems with Meta Cooperative Learning

Bowen Qin, Min Yang, Lidong Bing and
Qingshan Jiang, Chengming Li, Ruifeng Xu

Keywords Paper

0

0

0

0

15:41

14/06/2020

Attention-Guided Hierarchical Structure Aggregation for Image Matting

Yu Qiao, Yuhao Liu, Xin Yang and
Dongsheng Zhou, Mingliang Xu, Qiang Zhang, Xiaopeng Wei

Keywords Paper

image matting, attention, hierarchical, aggregation, appearance cues

0

0

0

0

0:59

16/11/2020

Coreferential Reasoning Learning for Language Representation

Deming Ye, Yankai Lin, Jiaju Du and
Zhenghao Liu, Peng Li, Maosong Sun, Zhiyuan Liu

Keywords Paper

downstream tasks, coreferential reasoning, common tasks, language models

0

0

0

0

7:30

08/12/2020

BME-TUW at SR’20: Lexical grammar induction for surface realization

Gábor Recski, Ádám Kovács, Kinga Gémes and
Judit Ács, Andras Kornai

Keywords Paper

0

0

0

0

15:32

02/02/2021

Self-Attention Attribution: Interpreting Information Interactions Inside Transformer

Yaru Hao, Li Dong, Furu Wei, Ke Xu

Keywords Paper

0

0

0

0

16:26

08/12/2020

IIE-NLP-NUT at SemEval-2020 Task 4: Guiding PLM with Prompt Template Reconstruction Strategy for ComVE

Luxi Xing, Yuqiang Xie, Yue Hu, Wei Peng

Keywords Paper

0

0

0

0

18:57

06/12/2020

HOI Analysis: Integrating and Decomposing Human-Object Interaction

Yong-Lu Li, Xinpeng Liu, Xiaoqian Wu and
Yizhuo Li, Cewu Lu

Keywords Paper

, Deep Learning -> Generative Models

0

0

0

0

3:19

03/05/2021

InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective

Boxin Wang, Shuohang Wang, Yu Cheng and
Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu

Keywords Paper

adversarial training, QA, NLI, BERT, information theory, adversarial robustness

0

0

0

0

5:21

14/06/2020

Embodied Language Grounding With 3D Visual Feature Representations

Mihir Prabhudesai, Hsiao-Yu Fish Tung, Syed Ashar Javed and
Maximilian Sieb, Adam W. Harley, Katerina Fragkiadaki

Keywords Paper

affordance in language, geometry-aware neural network, language grounding, 3d feature representations, grounding referential expression in images, instruction following, language-to-image generation

0

0

0

0

1:01

15/11/2020

Formulog: Datalog for SMT-Based Static Analysis

Aaron Bembenek, Michael Greenberg, Stephen Chong

Keywords Paper

Datalog, SMT solving

0

0

0

0

15:05