RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection

19/08/2021

RR-Net: Injecting Interactive Semantics in Human-Object Interaction Detection

Dongming Yang, Yuexian Zou, Can Zhang, Meng Cao, Jie Chen

Keywords: Computer Vision, Recognition, Action Recognition, Video

Abstract Paper Similar Papers

Abstract: Human-Object Interaction (HOI) detection devotes to learn how humans interact with surrounding objects. Latest end-to-end HOI detectors are short of relation reasoning, which leads to inability to learn HOI-specific interactive semantics for predictions. In this paper, we therefore propose novel relation reasoning for HOI detection. We first present a progressive Relation-aware Frame, which brings a new structure and parameter sharing pattern for interaction inference. Upon the frame, an Interaction Intensifier Module and a Correlation Parsing Module are carefully designed, where: a) interactive semantics from humans can be exploited and passed to objects to intensify interactions, b) interactive correlations among humans, objects and interactions are integrated to promote predictions. Based on modules above, we construct an end-to-end trainable framework named Relation Reasoning Network (abbr. RR-Net). Extensive experiments show that our proposed RR-Net sets a new state-of-the-art on both V-COCO and HICO-DET benchmarks and improves the baseline about 5.5% and 9.8% relatively, validating that this first effort in exploring relation reasoning and integrating interactive semantics has brought obvious improvement for end-to-end HOI detection.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

Learning Human-Object Interaction Detection Using Interaction Points

Tiancai Wang, Tong Yang, Martin Danelljan and
Fahad Shahbaz Khan, Xiangyu Zhang, Jian Sun

Keywords Paper

human-object interaction, interaction point, interaction grouping, keypoint detection

0

0

0

0

0:58

14/06/2020

Cascaded Human-Object Interaction Recognition

Tianfei Zhou, Wenguan Wang, Siyuan Qi and
Haibin Ling, Jianbing Shen

Keywords Paper

human-object interaction recognition, cascade reasoning, fine-grained relation segmentation

0

0

0

0

1:01

06/12/2020

HOI Analysis: Integrating and Decomposing Human-Object Interaction

Yong-Lu Li, Xinpeng Liu, Xiaoqian Wu and
Yizhuo Li, Cewu Lu

Keywords Paper

, Deep Learning -> Generative Models

0

0

0

0

3:19

14/06/2020

A Programmatic and Semantic Approach to Explaining and Debugging Neural Network Based Object Detectors

Edward Kim, Divya Gopinath, Corina Păsăreanu, Sanjit A. Seshia

Keywords Paper

population-level explanation, testing, perception, neural network, blackbox, scenario, object detection, machine learning, autonomous driving

0

0

0

0

4:58

02/02/2021

Exploring Explainable Selection to Control Abstractive Summarization

Haonan Wang, Yang Gao, Yu Bai and
Mirella Lapata, Heyan Huang

Keywords Paper

0

0

0

0

18:50

14/06/2020

Hierarchical Human Parsing With Typed Part-Relation Reasoning

Wenguan Wang, Hailong Zhu, Jifeng Dai and
Yanwei Pang, Jianbing Shen, Ling Shao

Keywords Paper

human parsing, part-relation modeling, graph neural network

0

0

0

0

0:56

14/06/2020

Discovering Human Interactions With Novel Objects via Zero-Shot Learning

Suchen Wang, Kim-Hui Yap, Junsong Yuan, Yap-Peng Tan

Keywords Paper

human-object interaction detection, zero-shot learning, zero shot interaction, interacting object detection

0

0

0

0

1:01

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

06/12/2021

Explanation-based Data Augmentation for Image Classification

Sandareka Wickramanayake, Wynne Hsu, Mong Li Lee

Keywords Paper

deep learning, machine learning, vision, interpretability

0

0

0

0

14:23

06/12/2021

Learning to Generate Visual Questions with Noisy Supervision

Shen Kai, Lingfei Wu, Siliang Tang and
Yueting Zhuang, zhen he, Zhuoye Ding, Yun Xiao, Bo Long

Keywords Paper

generative model

0

0

0

0

14:54

02/02/2021

Exploiting Relationship for Complex-scene Image Generation

Tianyu Hua, Hongdong Zheng, Yalong Bai and
Wei Zhang, Xiao-Ping Zhang, Tao Mei

Keywords Paper

0

0

0

0

15:01

19/08/2021

A Description Logic for Analogical Reasoning

Steven Schockaert, Yazmin Ibanez-Garcia, Victor Gutierrez-Basulto

Keywords Paper

Knowledge Representation and Reasoning, Common-Sense Reasoning, Description Logics and Ontologies

0

0

0

0

12:47

16/11/2020

Birds have four legs?! NumerSense: Probing Numerical Commonsense Knowledge of Pre-Trained Language Models

Bill Yuchen Lin, Seyeon Lee, Rahul Khanna, Xiang Ren

Keywords Paper

probing task, fine-tuning, testing, pre-trained models

0

0

0

0

6:56

16/11/2020

Towards Interpretable Reasoning over Paragraph Effects in Situation

Mucheng Ren, Xiubo Geng, Tao Qin and
Heyan Huang, Daxin Jiang

Keywords Paper

reasoning process, sequential approach, neural modules, reasoning modules

0

0

0

0

10:30

12/07/2020

Probing Emergent Semantics in Predictive Agents via Question Answering

Abhishek Das, Federico Carnevale, Hamza Merzic and
Laura Rimell, Rosalia Schneider, Josh Abramson, Alden Hung, Arun Ahuja, Stephen Clark, Greg Wayne, Feilx Hill

Keywords Paper

Applications - Neuroscience, Cognitive Science, Biology and Health

0

0

0

0

12:31

07/09/2020

Mixup-CAM: Weakly-supervised Semantic Segmentation via Uncertainty Regularization

Yu-Ting Chang, Qiaosong Wang, Wei-Chih Hung and
Robinson Piramuthu, Yi-Hsuan Tsai, Ming-Hsuan Yang

Keywords Paper

semantic segmentation, weakly-supervised learning, class activatin map, mixup augmentation, entropy regularization

0

0

0

0

8:22

02/02/2021

DIRV: Dense Interaction Region Voting for End-to-End Human-Object Interaction Detection

Hao-Shu Fang, Yichen Xie, Dian Shao, Cewu Lu

Keywords Paper

0

0

0

0

5:11

02/02/2021

Interpretable NLG for Task-oriented Dialogue Systems with Heterogeneous Rendering Machines

Yangming Li, Kaisheng Yao

Keywords Paper

0

0

0

0

17:05

04/07/2020

ERASER: A Benchmark to Evaluate Rationalized NLP Models

Jay DeYoung, Sarthak Jain, Nazneen Fatema Rajani and
Eric Lehman, Caiming Xiong, Richard Socher, Byron C. Wallace

Keywords Paper

NLP, Evaluating Reasoning, ERASER, Rationalized Models

0

0

0

0

9:04

22/11/2021

Human-object Interaction Detection without Alignment Supervision

Mert Kilickaya, Arnold W.M. Smeulders

Keywords Paper

human-object interactions, visual relationship detection, weakly supervised learning, visual transformers

0

0

0

0

2:55

14/06/2020

BANet: Bidirectional Aggregation Network With Occlusion Handling for Panoptic Segmentation

Yifeng Chen, Guangchen Lin, Songyuan Li and
Omar Bourahla, Yiming Wu, Fangfang Wang, Junyi Feng, Mingliang Xu, Xi Li

Keywords Paper

panoptic segmentation, instance segmentation, semantic segmentation, occlusion handling, patch-recovering operator

0

0

0

0

5:00

05/01/2021

Proposal Learning for Semi-Supervised Object Detection

Peng Tang, Chetan Ramaiah, Yan Wang and
Ran Xu, Caiming Xiong

Keywords Paper

0

0

0

0

4:51

02/02/2021

Inference Fusion with Associative Semantics for Unseen Object Detection

Yanan Li, Pengyang Li, Han Cui, Donghui Wang

Keywords Paper

0

0

0

0

14:57

06/12/2021

Collaborative Uncertainty in Multi-Agent Trajectory Forecasting

Bohan Tang, Yiqi Zhong, Ulrich Neumann and
Gang Wang, Siheng Chen, Ya Zhang

Keywords Paper

deep learning

0

0

0

0

7:15

03/05/2021

Explaining the Efficacy of Counterfactually Augmented Data

Divyansh Kaushik, Amrith Setlur, Eduard H Hovy, Zachary Lipton

Keywords Paper

sentiment analysis, text classification, natural language inference, annotation artifacts, humans in the loop

0

0

0

0

5:11

19/08/2021

A Structure Self-Aware Model for Discourse Parsing on Multi-Party Dialogues

Ante Wang, Linfeng Song, Hui Jiang and
Shaopeng Lai, Junfeng Yao, Min Zhang, Jinsong Su

Keywords Paper

Natural Language Processing, Dialogue, Discourse, Tagging, Chunking, and Parsing

0

0

0

0

8:33

22/11/2021

Towards Dynamic and Scalable Active Learning with Neural Architecture Adaption for Object Detection

Fuhui Tang, ChenHan Jiang, Dafeng Wei and
Hang Xu, Andi Zhang, Wei Zhang, Hongtao Lu, Chunjing Xu

Keywords Paper

active learning, neural architecture adaption, object detection, dirichlet calibration, clustering sampling, network morphism modifications, uncertainty, dimension reduction, sample diversity, swap-expand strategy

0

0

0

0

2:40

14/06/2020

Attention-Based Context Aware Reasoning for Situation Recognition

Thilini Cooray, Ngai-Man Cheung, Wei Lu

Keywords Paper

situation recognition, visual semantic role labelling, scene understanding, vision and language, action recognition

0

0

0

0

1:00

30/11/2020

CLASS: Cross-Level Attention and Supervision for Salient Objects Detection

Lv Tang, Bo Li

Keywords Paper

0

0

0

0

7:04

04/07/2020

Neural Topic Modeling with Bidirectional Adversarial Training

Rui Wang, Xuemeng Hu, Deyu Zhou and
Yulan He, Yuxuan Xiong, Chenchen Ye, Haiyang Xu

Keywords Paper

automatic extraction, model inference, neural modeling, topic inference

0

0

0

0

11:17

25/07/2020

HME: A hyperbolic metric embedding approach for next-POI recommendation

Shanshan Feng, Lucas Vinh Tran, Gao Cong and
Lisi Chen, Jing Li, Fan Li

Keywords Paper

metric embedding, hyperbolic space, next-poi recommendation

0

0

0

0

17:18

23/08/2020

Adversarial infidelity learning for model interpretation

Jian Liang, Bing Bai, Yuren Cao and
Kun Bai, Fei Wang

Keywords Paper

infidelity, model interpretation, adversarial learning, black-box explanations

0

0

0

0

5:34

06/12/2020

Bongard-LOGO: A New Benchmark for Human-Level Concept Learning and Reasoning

Weili Nie, Zhiding Yu, Lei Mao and
Ankit Patel, Yuke Zhu, Anima Anandkumar

Keywords Paper

0

0

0

0

3:23

30/11/2020

Unsupervised Domain Adaptive Object Detection using Forward-Backward Cyclic Adaptation

Siqi Yang, Lin Wu, Arnold Wiliem, Brian C. Lovell

Keywords Paper

0

0

0

0

6:32

05/12/2020

Point-of-interest oriented question answering with joint inference of semantic matching and distance correlation

Yifei Yuan, Jingbo Zhou, Wai Lam

Keywords Paper

0

0

0

0

13:14

02/02/2021

DecAug: Augmenting HOI Detection via Decomposition

Hao-Shu Fang, Yichen Xie, Dian Shao and
Yong-Lu Li, Cewu Lu

Keywords Paper

0

0

0

0

9:02

06/12/2021

Few-Shot Segmentation via Cycle-Consistent Transformer

Gengwei Zhang, Guoliang Kang, Yi Yang, Yunchao Wei

Keywords Paper

transformers, vision, few shot learning

0

0

0

0

11:58

02/02/2021

Generative Partial Visual-Tactile Fused Object Clustering

Tao Zhang, Yang Cong, Gan Sun and
Jiahua Dong, Yuyang Liu, Zhengming Ding

Keywords Paper

0

0

0

0

15:49

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

14/06/2020

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang and
Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

Keywords Paper

3d object detection, point cloud, 3d scene understanding, lidar, autonomous driving, kitti dataset, waymo open dataset

0

0

0

0

1:01