Visual-Semantic Matching by Exploring High-Order Attention and Distraction

14/06/2020

Visual-Semantic Matching by Exploring High-Order Attention and Distraction

Yongzhi Li, Duo Zhang, Yadong Mu

Keywords: visual semantic matching, cross modal retrieval, scene graph, visual distraction, graph matching, gcn

Abstract Paper Similar Papers

Abstract: Cross-modality semantic matching is a vital task in computer vision and has attracted increasing attention in recent years. Existing methods mainly explore object-based alignment between image objects and text words. In this work, we address this task from two previously-ignored aspects: high-order semantic information (e.g., object-predicate-subject triplet, object-attribute pair) and visual distraction (i.e., despite the high relevance to textual query, images may also contain many prominent distracting objects or visual relations). Specically, we build scene graphs for both visual and textual modalities. Our technical contributions are two-folds: rstly, we formulate the visual-semantic matching task as an attention-driven cross-modality scene graph matching problem. Graph convolutional networks (GCNs) are used to extract high-order information from two scene graphs. A novel cross-graph attention mechanism is proposed to contextually reweigh graph elements and calculate the inter-graph similarity. Secondly, some top-ranked samples are indeed false matching due to the co-occurrence of both highly-relevant and distracting information. We devise an information-theoretic measure for estimating semantic distraction and re-ranking the initial retrieval results. Comprehensive experiments and ablation studies on two large public datasets (MS-COCO and Flickr30K) demonstrate the superiority of the proposed method and the effectiveness of both high-order attention and distraction.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Multi-scale Graph Fusion for Co-saliency Detection

Rongyao Hu, Zhenyun Deng, Xiaofeng Zhu

Keywords Paper

0

0

0

0

16:52

14/06/2020

Graph Structured Network for Image-Text Matching

Chunxiao Liu, Zhendong Mao, Tianzhu Zhang and
Hongtao Xie, Bin Wang, Yongdong Zhang

Keywords Paper

image-text matching, graph network, cross-modal, fine-grained correspondence, visual-semantic

0

0

0

0

1:01

18/07/2021

Two Heads are Better Than One: Hypergraph-Enhanced Graph Reasoning for Visual Event Ratiocination

Wenbo Zheng, Lan Yan, Chao Gou, Fei-Yue Wang

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:14

02/02/2021

Deep Graph-neighbor Coherence Preserving Network for Unsupervised Cross-modal Hashing

Jun Yu, Hao Zhou, Yibing Zhan, Dacheng Tao

Keywords Paper

0

0

0

0

14:01

02/02/2021

Scene Graph Embeddings Using Relative Similarity Supervision

Paridhi Maheshwari, Ritwick Chaudhry, Vishwa Vinay

Keywords Paper

0

0

0

0

18:53

02/02/2021

Similarity Reasoning and Filtration for Image-Text Matching

Haiwen Diao, Ying Zhang, Lin Ma, Huchuan Lu

Keywords Paper

0

0

0

0

16:34

02/02/2021

Dual Adversarial Graph Neural Networks for Multi-label Cross-modal Retrieval

Shengsheng Qian, Dizhan Xue, Huaiwen Zhang and
Quan Fang, Changsheng Xu

Keywords Paper

0

0

0

0

15:24

02/02/2021

Deep Metric Learning with Graph Consistency

Binghui Chen, Pengyu Li, Zhaoyi Yan and
Biao Wang, Lei Zhang

Keywords Paper

0

0

0

0

14:36

22/11/2021

BI-GCN: Boundary-Aware Input-Dependent Graph Convolution Network for Biomedical Image Segmentation

Yanda Meng, Hongrun Zhang, Dongxu Gao and
Yitian Zhao, Xiaoyun Yang, Xuesheng Qian, Xiaowei Huang, Yalin Zheng

Keywords Paper

Medical Image Segmentation, Graph Convolution Network

0

0

0

0

7:43

02/02/2021

GraphMSE: Efficient Meta-path Selection in Semantically Aligned Feature Space for Graph Neural Networks

Yi Li, Yilun Jin, Guojie Song and
Zihao Zhu, Chuan Shi, Yiming Wang

Keywords Paper

0

0

0

0

15:25

26/04/2020

Adaptive Structural Fingerprints for Graph Attention Networks

Kai Zhang, Yaokang Zhu, Jun Wang, Jie Zhang

Keywords Paper

Graph attention networks, graph neural networks, node classification

0

0

0

0

3:47

19/08/2021

CogTree: Cognition Tree Loss for Unbiased Scene Graph Generation

Jing Yu, Yuan Chai, Yujing Wang and
Yue Hu, Qi Wu

Keywords Paper

Computer Vision, Language and Vision

0

0

0

0

15:08

12/07/2020

Graph Optimal Transport for Cross-Domain Alignment

Liqun Chen, Zhe Gan, Yu Cheng and
Linjie Li, Lawrence Carin, Jingjing Liu

Keywords Paper

Deep Learning - Algorithms

0

0

0

0

12:20

04/07/2020

Bridging the Structural Gap Between Encoding and Decoding for Data-To-Text Generation

Chao Zhao, Marilyn Walker, Snigdha Chaturvedi

Keywords Paper

Data-To-Text Generation, faithful generation, Encoding, Decoding

0

0

0

0

12:13

15/06/2020

Fast graph simplification for interleaved dyck-reachability

Yuanbo Li, Qirun Zhang, Thomas Reps

Keywords Paper

Static Analysis, CFL-Reachability

0

0

0

0

16:46

12/07/2020

Progressive Graph Learning for Open-Set Domain Adaptation

Yadan Luo, Zijian Wang, Zi Huang, Mahsa Baktashmotlagh

Keywords Paper

Applications - Computer Vision

0

0

0

0

15:40

13/04/2021

Hyperbolic graph embedding with enhanced semi-implicit variational inference.

Ali Lotfi Rezaabad, Rahi Kalantari, Sriram Vishwanath and
Mingyuan Zhou, Jonathan Tamir

Keywords Paper

0

0

0

0

3:09

14/06/2020

Hierarchical Graph Attention Network for Visual Relationship Detection

Li Mi, Zhenzhong Chen

Keywords Paper

visual relationship detection, graph attention network, prior knowledge, attention mechanism

0

0

0

0

0:58

02/02/2021

Exploiting Relationship for Complex-scene Image Generation

Tianyu Hua, Hongdong Zheng, Yalong Bai and
Wei Zhang, Xiao-Ping Zhang, Tao Mei

Keywords Paper

0

0

0

0

15:01

14/06/2020

Auto-Encoding Twin-Bottleneck Hashing

Yuming Shen, Jie Qin, Jiaxin Chen and
Mengyang Yu, Li Liu, Fan Zhu, Fumin Shen, Ling Shao

Keywords Paper

image hashing, data retrieval, unsupervised learning, graph neural networks

0

0

0

0

1:00

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

14/06/2020

Composing Good Shots by Exploiting Mutual Relations

Debang Li, Junge Zhang, Kaiqi Huang, Ming-Hsuan Yang

Keywords Paper

image composition, good views, graph neural network, relation mining

0

0

0

0

1:01

02/02/2021

Dual-level Collaborative Transformer for Image Captioning

Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun and
Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

Keywords Paper

0

0

0

0

14:58

14/06/2020

Self-Learning With Rectification Strategy for Human Parsing

Tao Li, Zhiyuan Liang, Sanyuan Zhao and
Jiahao Gong, Jianbing Shen

Keywords Paper

human parsing, semi-supervised learning, graph reasoning, self-learning, pseudo-labels

0

0

0

0

1:01

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

08/12/2020

Generalized Shortest-Paths Encoders for AMR-to-Text Generation

Lisa Jin, Daniel Gildea

Keywords Paper

0

0

0

0

14:57

19/08/2021

TextGTL: Graph-based Transductive Learning for Semi-supervised Text Classification via Structure-Sensitive Interpolation

Chen Li, Xutan Peng, Hao Peng and
Jianxin Li, Lihong Wang

Keywords Paper

Machine Learning, Semi-Supervised Learning, Mining Graphs, Semi Structured Data, Complex Data

0

0

0

0

13:14

01/07/2020

Toward General Scene Graph: Integration of Visual Semantic Knowledge with Entity Synset Alignment

Woo Suk Choi, Kyoung-Woon On, Yu-Jung Heo, Byoung-Tak Zhang

Keywords Paper

0

0

0

0

5:28

02/02/2021

Group-Wise Semantic Mining for Weakly Supervised Semantic Segmentation

Xueyi Li, Tianfei Zhou, Jianwu Li and
Yi Zhou, Zhaoxiang Zhang

Keywords Paper

0

0

0

0

14:47

14/06/2020

Adaptive Graph Convolutional Network With Attention Graph Clustering for Co-Saliency Detection

Kaihua Zhang, Tengpeng Li, Shiwen Shen and
Bo Liu, Jin Chen, Qingshan Liu

Keywords Paper

co-saliency detection, deep learning, end-to-end, graph convolution, group attention, graph clustering, joint training, foreground attention, consistency, state-of-the-art

0

0

0

0

1:01

14/06/2020

Hypergraph Attention Networks for Multimodal Learning

Eun-Sol Kim, Woo Young Kang, Kyoung-Woon On and
Yu-Jung Heo, Byoung-Tak Zhang

Keywords Paper

multimodal learning, graph neural network, deep learning, visual question answering, graph question answering, bilinear attention mechanism

1

0

0

0

1:01

06/12/2021

Coupled Segmentation and Edge Learning via Dynamic Graph Propagation

Zhiding Yu, Rui Huang, Wonmin Byeon and
Sifei Liu, Guilin Liu, Thomas Breuel, Anima Anandkumar, Jan Kautz

Keywords Paper

robustness, vision, graph learning

0

0

0

0

9:46

06/12/2021

Local Hyper-Flow Diffusion

Kimon Fountoulakis, Pan Li, Shenghao Yang

Keywords Paper

optimization, graph learning, clustering

0

0

0

0

14:24

03/05/2021

Retrieval-Augmented Generation for Code Summarization via Hybrid GNN

Shangqing Liu, Yu Chen, Xiaofei Xie and
Siow Jing Kai, Yang Liu

Keywords Paper

Generation, Retrieval, Code Summarization, Graph Neural Network

0

0

0

0

9:37

19/08/2021

Graph Deformer Network

Wenting Zhao, Yuan Fang, Zhen Cui and
Tong Zhang, Jian Yang

Keywords Paper

Data Mining, Mining Graphs, Semi Structured Data, Complex Data, Feature Extraction, Selection and Dimensionality Reduction, Classification

0

0

0

0

11:35

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

04/07/2020

Line Graph Enhanced AMR-to-Text Generation with Mix-Order Graph Attention Networks

Yanbin Zhao, Lu Chen, Zhi Chen and
Ruisheng Cao, Su Zhu, Kai Yu

Keywords Paper

Line Generation, AMR-to-text generation, graph-to-sequence task, graph-to-sequence modeling

0

0

0

0

10:56

22/11/2021

Image-Text Alignment using Adaptive Cross-attention with Transformer Encoder for Scene Graphs

Juyong Song, Sunghyun Choi

Keywords Paper

cross-attention, multi-modal, retrieval, scene-graphs, graph neural networks, contrastive loss

0

0

0

0

3:01

02/02/2021

Unsupervised Domain Adaptation for Person Re-identification via Heterogeneous Graph Alignment

Minying Zhang, Kai Liu, Yidong Li and
Shihui Guo, Hongtao Duan, Yimin Long, Yi Jin

Keywords Paper

0

0

0

0

16:37

08/12/2020

Integrating knowledge graph embeddings to improve mention representation for bridging anaphora resolution

Onkar Pandit, Pascal Denis, Liva Ralaivola

Keywords Paper

0

0

0

0

17:39