Text-based Person Search via Multi-Granularity Embedding Learning

19/08/2021

Text-based Person Search via Multi-Granularity Embedding Learning

Chengji Wang, Zhiming Luo, Yaojin Lin, Shaozi Li

Keywords: Computer Vision, Language and Vision, Recognition

Abstract Paper Similar Papers

Abstract: Most existing text-based person search methods highly depend on exploring the corresponding relations between the regions of the image and the words in the sentence. However, these methods correlated image regions and words in the same semantic granularity. It 1) results in irrelevant corresponding relations between image and text, 2) causes an ambiguity embedding problem. In this study, we propose a novel multi-granularity embedding learning model for text-based person search. It generates multi-granularity embeddings of partial person bodies in a coarse-to-fine manner by revisiting the person image at different spatial scales. Specifically, we distill the partial knowledge from image scrips to guide the model to select the semantically relevant words from the text description. It can learn discriminative and modality-invariant visual-textual embeddings. In addition, we integrate the partial embeddings at each granularity and perform multi-granularity image-text matching. Extensive experiments validate the effectiveness of our method, which can achieve new state-of-the-art performance by the learned discriminative partial embeddings.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at IJCAI 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

FILTER: An Enhanced Fusion Method for Cross-lingual Language Understanding

Yuwei Fang, Shuohang Wang, Zhe Gan and
Siqi Sun, Jingjing Liu

Keywords Paper

0

0

0

0

17:39

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

30/11/2020

Show, Conceive and Tell: Image Captioning with Prospective Linguistic Information

Yiqing Huang, Jiansheng Chen

Keywords Paper

0

0

0

0

7:08

02/02/2021

Fake it Till You Make it: Self-Supervised Semantic Shifts for Monolingual Word Embedding Tasks

Maurício Gruppi, Pin-Yu Chen, Sibel Adali

Keywords Paper

0

0

0

0

19:35

22/11/2021

Text-Based Person Search with Limited Data

Xiao Han, Sen He, Li Zhang, Tao Xiang

Keywords Paper

person re-identification, cross-modal image retrieval, fine-grained image retrieval, text-based person search

0

0

0

0

3:04

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

04/07/2020

Improving Disentangled Text Representation Learning with Information-Theoretic Guidance

Pengyu Cheng, Martin Renqiang Min, Dinghan Shen and
Christopher Malon, Yizhe Zhang, Yitong Li, Lawrence Carin

Keywords Paper

Learning language, NLP tasks, conditional generation, style transfer

0

0

0

0

9:56

26/04/2020

Masked Based Unsupervised Content Transfer

Ron Mokady, Sagie Benaim, Lior Wolf, Amit Bermano

Keywords Paper

0

0

0

0

4:38

06/12/2021

Implicit Semantic Response Alignment for Partial Domain Adaptation

Wenxiao Xiao, Zhengming Ding, Hongfu Liu

Keywords Paper

domain adaptation, transfer learning

0

0

0

0

11:43

02/02/2021

Task-Independent Knowledge Makes for Transferable Representations for Generalized Zero-Shot Learning

Chaoqun Wang, Xuejin Chen, Shaobo Min and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

14:56

16/11/2020

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Milan Aggarwal, Hiresh Gupta, Mausoom Sarkar, Balaji Krishnamurthy

Keywords Paper

document extraction, semantic task, image resolution, structure extraction

0

0

0

0

11:26

02/02/2021

Non-Autoregressive Coarse-to-Fine Video Captioning

Bang Yang, Yuexian Zou, Fenglin Liu, Can Zhang

Keywords Paper

0

0

0

0

18:21

06/12/2020

Hierarchical Granularity Transfer Learning

Shaobo Min, Hongtao Xie, Hantao Yao and
Xuran Deng, Zheng-Jun Zha, Yongdong Zhang

Keywords Paper

0

0

0

0

3:07

07/09/2020

Deep Metric Learning Meets Deep Clustering: An Novel Unsupervised Approach for Feature Embedding

Binh Nguyen, Binh Nguyen, Gustavo Carneiro and
Erman Tjiputra, Quang Tran, Thanh-Toan Do

Keywords Paper

unsupervised deep metric learning, unsupervised feature learning, unsupervised metric loss, negative mining, deep clustering, pseudo labels, reconstruction, centroid representations, retrieval, multi-task

0

0

0

0

6:18

14/06/2020

IMRAM: Iterative Matching With Recurrent Attention Memory for Cross-Modal Image-Text Retrieval

Hui Chen, Guiguang Ding, Xudong Liu and
Zijia Lin, Ji Liu, Jungong Han

Keywords Paper

cross-modal image text retrieval, iterative matching, recurrent attention memory

0

0

0

0

1:04

30/11/2020

Image Captioning through Image Transformer

Sen He, Wentong Liao, Hamed R. Tavakoli and
Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

Keywords Paper

0

0

0

0

9:49

06/12/2021

HSVA: Hierarchical Semantic-Visual Adaptation for Zero-Shot Learning

Shiming Chen, Guosen Xie, Yang Liu and
Qinmu Peng, Baigui Sun, Hao Li, Xinge You, Ling Shao

Keywords Paper

generative model, domain adaptation

0

0

0

0

9:19

06/12/2021

Explainable Semantic Space by Grounding Language to Vision with Cross-Modal Contrastive Learning

Yizhen Zhang, Minkyu Choi, Kuan Han, Zhongming Liu

Keywords Paper

contrastive learning, language

0

0

0

0

14:34

14/06/2020

Multi-Modality Cross Attention Network for Image and Sentence Matching

Xi Wei, Tianzhu Zhang, Yan Li and
Yongdong Zhang, Feng Wu

Keywords Paper

cross modal, retrieval, transformer, attention, intra-modality, inter-modality

0

0

0

0

0:59

03/05/2021

Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis

Zhipeng Bao, Yu-Xiong Wang, Martial Hebert

Keywords Paper

adversarial training, computer vision, object recognition, few-shot learning, generative models

0

0

0

0

5:11

08/12/2020

Learning distributed sentence vectors with bi-directional 3D convolutions

Bin Liu, Liang Wang, Guosheng Yin

Keywords Paper

0

0

0

0

3:07

12/07/2020

On Variational Learning of Controllable Representations for Text without Supervision

Peng Xu, Jackie Chi Kit Cheung, Yanshuai Cao

Keywords Paper

Representation Learning

0

0

0

0

14:51

14/06/2020

Nested Scale-Editing for Conditional Image Synthesis

Lingzhi Zhang, Jiancong Wang, Yinshuang Xu and
Jie Min, Tarmily Wen, James C. Gee, Jianbo Shi

Keywords Paper

scale editing, identity recovery, image synthesis, super-resolution, image outpainting, text2image, cross-modal translation

0

0

0

0

1:01

02/02/2021

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition

Siteng Huang, Min Zhang, Yachen Kang, Donglin Wang

Keywords Paper

0

0

0

0

17:04

02/02/2021

Visual Boundary Knowledge Translation for Foreground Segmentation

Zunlei Feng, Lechao Cheng, Xinchao Wang and
Xiang Wang, Ya Jie Liu, Xiangtong Du, Mingli Song

Keywords Paper

0

0

0

0

14:28

19/08/2021

Deep Unified Cross-Modality Hashing by Pairwise Data Alignment

Yimu Wang, Bo Xue, Quan Cheng and
Yuhui Chen, Lijun Zhang

Keywords Paper

Computer Vision, Recognition, Information Retrieval, Deep Learning

0

0

0

0

13:11

02/02/2021

Semantic-guided Reinforced Region Embedding for Generalized Zero-Shot Learning

Jiannan Ge, Hongtao Xie, Shaobo Min, Yongdong Zhang

Keywords Paper

0

0

0

0

16:22

14/06/2020

ManiGAN: Text-Guided Image Manipulation

Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H.S. Torr

Keywords Paper

image manipulation, natural language, generative adversarial networks, gan

0

0

0

0

1:01

22/11/2021

Multi-Granularity Hypergraphs and Adversarial Complementary Learning for Person Re-identification

Yi Ma, Tian Bai, Wenyu Zhang, Jian Hu

Keywords Paper

Person Re-Identification, Hypergraphs Learning, Adversarial Complementary Learning

0

0

0

0

2:40

05/12/2020

Named entity recognition in multi-level contexts

Yubo Chen, Chuhan Wu, Tao Qi and
Zhigang Yuan, Yongfeng Huang

Keywords Paper

0

0

0

0

14:10

08/12/2020

Automatic Word Association Norms (AWAN)

Jorge Reyes-Magaña, Gerardo Sierra Martínez, Gemma Bel-Enguix, Helena Gomez-Adorno

Keywords Paper

0

0

0

0

14:34

19/08/2021

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

Zhihao Fan, Zhongyu Wei, Siyuan Wang and
Ruize Wang, Zejun Li, Haijun Shan, Xuanjing Huang

Keywords Paper

Computer Vision, Language and Vision, Natural Language Generation

0

0

0

0

10:46

06/12/2021

TopicNet: Semantic Graph-Guided Topic Discovery

Zhibin Duan, Yi.shi Xu, Bo Chen and
dongsheng wang, Chaojie Wang, Mingyuan Zhou

Keywords Paper

optimization, generative model, graph learning

0

0

0

0

10:15

19/08/2021

Dependent Multi-Task Learning with Causal Intervention for Image Captioning

Wenqing Chen, Jidong Tian, Caoyun Fan and
Hao He, Yaohui Jin

Keywords Paper

Machine Learning, Transfer, Adaptation, Multi-task Learning, Natural Language Generation, Language and Vision

0

0

0

0

12:02

30/11/2020

Query by Strings and Return Ranking Word Regions with Only One Look

Peng Zhao, Wenyuan Xue, Qingyong Li, Siqi Cai

Keywords Paper

0

0

0

0

6:46

02/02/2021

Train a One-Million-Way Instance Classifier for Unsupervised Visual Representation Learning

Yu Liu, Lianghua Huang, Pan Pan and
Bin Wang, Yinghui Xu, Rong Jin

Keywords Paper

0

0

0

0

15:15

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

22/11/2021

Deep Video Decaptioning

Pengpeng Chu, Weize Quan, Tong Wang and
Pan Wang, Peiran Ren, Dong-Ming Yan

Keywords Paper

video decaptioning, caption mask extraction, frame attention, real time

0

0

0

0

2:59

04/07/2020

Enabling Language Models to Fill in the Blanks

Chris Donahue, Mina Lee, Percy Liang

Keywords Paper

text infilling, predicting text, writing tools, language modeling

0

0

0

0

7:01

06/12/2020

Learning Semantic-aware Normalization for Generative Adversarial Networks

Heliang Zheng, Jianlong Fu, zengyh Zeng and
Jiebo Luo, Zheng-Jun Zha

Keywords Paper

0

0

0

0

3:11