Cross-Modal Cross-Domain Moment Alignment Network for Person Search

14/06/2020

Cross-Modal Cross-Domain Moment Alignment Network for Person Search

Ya Jing, Wei Wang, Liang Wang, Tieniu Tan

Keywords: cross-domain adaptation, text-based person search, moment alignment network, cross-modal retrieval, unsupervised learning

Abstract Paper Similar Papers

Abstract: Text-based person search has drawn increasing attention due to its wide applications in video surveillance. However, most of the existing models depend heavily on paired image-text data, which is very expensive to acquire. Moreover, they always face huge performance drop when directly exploiting them to new domains. To overcome this problem, we make the first attempt to adapt the model to new target domains in the absence of pairwise labels, which combines the challenges from both cross-modal (text-based) person search and cross-domain person search. Specially, we propose a moment alignment network (MAN) to solve the cross-modal cross-domain person search task in this paper. The idea is to learn three effective moment alignments including domain alignment (DA), cross-modal alignment (CA) and exemplar alignment (EA), which together can learn domain-invariant and semantic aligned cross-modal representations to improve model generalization. Extensive experiments are conducted on CUHK Person Description dataset (CUHK-PEDES) and Richly Annotated Pedestrian dataset (RAP). Experimental results show that our proposed model achieves the state-of-the-art performances on five transfer tasks.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

22/11/2021

Knowing What, Where and When to Look: Video Action modelling with Attention

Juan-Manuel Perez-Rua, Brais Martinez, Xiatian Zhu and
Antoine S Toisoul, Victor A Escorcia, Tao Xiang

Keywords Paper

Action recognition, Fine-grained action, video attention, Spatial attention, Channel attention, Temporal attention, Spatio-temporal attention, Feature refinement

0

0

0

0

2:46

02/02/2021

Region-aware Global Context Modeling for Automatic Nerve Segmentation from Ultrasound Images

Huisi Wu, Jiasheng Liu, Wei Wang and
Zhenkun Wen, Jing Qin

Keywords Paper

0

0

0

0

15:15

05/01/2021

Multi-Frame Recurrent Adversarial Network for Moving Object Segmentation

Prashant W. Patil, Akshay Dudhane, Subrahmanyam Murala

Keywords Paper

0

0

0

0

5:00

07/09/2020

Integrating Long-Short Term Network for Efficient Video Object Segmentation

Jingjing Wang, Zhu Teng, Baopeng Zhang, Jianping Fan

Keywords Paper

Video Object Segmentation, Long-Short Term Network, Multiple-object segmentation

0

0

0

0

8:30

03/05/2021

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan, Rameswar Panda, Camilo L Fosco and
Chung-Ching Lin, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris

Keywords Paper

0

0

0

0

5:02

14/06/2020

An End-to-End Edge Aggregation Network for Moving Object Segmentation

Prashant W. Patil, Kuldeep M. Biradar, Akshay Dudhane, Subrahmanyam Murala

Keywords Paper

edge extraction mechanism, bridge network, training-testing configurations, moving object segmentation

0

0

0

0

1:00

05/01/2021

DynaVSR: Dynamic Adaptive Blind Video Super-Resolution

Suyoung Lee, Myungsub Choi, Kyoung Mu Lee

Keywords Paper

0

0

0

0

4:56

30/11/2020

Mask-Ranking Network for Semi-Supervised Video Object Segmentation

Wenjing Li, Xiang Zhang, Yujie Hu, Yingqi Tang

Keywords Paper

0

0

0

0

5:36

06/12/2020

Adversarially Robust Few-Shot Learning: A Meta-Learning Approach

Micah Goldblum, Liam Fowl, Tom Goldstein

Keywords Paper

0

0

0

0

2:55

30/11/2020

MLIFeat: Multi-level information fusion based deep local features

Yuyang Zhang Institute of Automation, Chinese Academy of Sciences, University of Chinese Academy of Sciences and
Jinge Wang, Shibiao Xu, Xiao Liu, Xiaopeng Zhang

Keywords Paper

0

0

0

0

5:28

11/08/2020

Reducto: On-camera filtering for resource-efficient real-time video analytics

Yuanqi Li, Arthi Padmanabhan, Pengzhan Zhao and
Yufei Wang, Guoqing Harry Xu, Ravi Netravali

Keywords Paper

object detection, video analytics, deep neural networks

0

0

0

0

22:14

02/02/2021

Addressing Domain Gap via Content Invariant Representation for Semantic Segmentation

Li Gao, Lefei Zhang, Qian Zhang

Keywords Paper

0

0

0

0

16:16

19/08/2021

Deep Unified Cross-Modality Hashing by Pairwise Data Alignment

Yimu Wang, Bo Xue, Quan Cheng and
Yuhui Chen, Lijun Zhang

Keywords Paper

Computer Vision, Recognition, Information Retrieval, Deep Learning

0

0

0

0

13:11

14/06/2020

AdaCoF: Adaptive Collaboration of Flows for Video Frame Interpolation

Hyeongmin Lee, Taeoh Kim, Tae-young Chung and
Daehyun Pak, Yuseok Ban, Sangyoun Lee

Keywords Paper

video frame interpolation, video temporal super-resolution, frame rate up conversion, frame synthesis, motion estimation, motion compensation, frame warping

0

0

0

0

1:01

14/06/2020

Prior Guided GAN Based Semantic Inpainting

Avisek Lahiri, Arnav Kumar Jain, Sanskar Agrawal and
Pabitra Mitra, Prabir Kumar Biswas

Keywords Paper

semantic inpainting, generative adversarial networks, video inpainting, facial keypoints, generative models

0

0

0

0

1:01

19/08/2021

Dig into Multi-modal Cues for Video Retrieval with Hierarchical Alignment

Wenzhe Wang, Mengdan Zhang, Runnan Chen and
Guanyu Cai, Penghao Zhou, Pai Peng, Xiaowei Guo, Jian Wu, Xing Sun

Keywords Paper

Computer Vision, Language and Vision, Deep Learning

0

0

0

0

9:07

14/06/2020

Unsupervised Learning From Video With Deep Neural Embeddings

Chengxu Zhuang, Tianwei She, Alex Andonian and
Max Sobol Mark, Daniel Yamins

Keywords Paper

unsupervised learning, self-supervised learning, video learning, contrastive learning, deep neural networks, action recognition, object recognition, two-pathway models

0

0

0

0

1:01

14/06/2020

Instance-Aware Image Colorization

Jheng-Wei Su, Hung-Kuo Chu, Jia-Bin Huang

Keywords Paper

colorization, instance-aware, deep learning, computer vision

0

0

0

0

1:01

05/01/2021

A Variational Information Bottleneck Based Method to Compress Sequential Networks for Human Action Recognition

Ayush Srivastava, Oshin Dutta, Jigyasa Gupta and
Sumeet Agarwal, Prathosh AP

Keywords Paper

0

0

0

0

4:29

02/02/2021

End-to-End Differentiable Learning to HDR Image Synthesis for Multi-exposure Images

Junghee Kim, Siyeong Lee, Suk-Ju Kang

Keywords Paper

0

0

0

0

15:35

19/08/2021

Detecting Deepfake Videos with Temporal Dropout 3DCNN

Daichi Zhang, Chenyu Li, Fanzhao Lin and
Dan Zeng, Shiming Ge

Keywords Paper

Computer Vision, Biometrics, Face and Gesture Recognition, Fairness, Surveillance, Manipulation of People

0

0

0

0

8:30

03/05/2021

A Good Image Generator Is What You Need for High-Resolution Video Synthesis

Yu Tian, Jian Ren, Menglei Chai and
Kyle Olszewski, Xi Peng, Dimitris Metaxas, Sergey Tulyakov

Keywords Paper

contrastive learning, cross-domain video generation, high-resolution video generation

0

0

0

0

10:03

02/02/2021

Adversarial Training with Fast Gradient Projection Method against Synonym Substitution Based Text Attacks

Xiaosen Wang, Yichen Yang, Yihe Deng, Kun He

Keywords Paper

0

0

0

0

16:46

14/06/2020

Multimodal Categorization of Crisis Events in Social Media

Mahdi Abavisani, Liwei Wu, Shengli Hu and
Joel Tetreault, Alejandro Jaimes

Keywords Paper

multimodal learning, multimodal categorization, cross-attention, stochastic shared embedding, event detection, social media, image-text fusion, ai for social goods, language and vision, emergency response

0

0

0

0

1:01

07/09/2020

Refinement of Boundary Regression Using Uncertainty in Temporal Action Localization

Yunze Chen, Mengjuan Chen, Rui Wu and
Jiagang Zhu, Zheng Zhu, Qingyi Gu

Keywords Paper

Temporal Action Localization, Temporal Action Detection, Activity recognition and understanding

0

0

0

0

5:09

18/11/2020

AARM: Action attention recalibration module for action recognition

Li Zhonghong, Yi Yang, She Ying and
Song Jialun, Wu Yukun

Keywords Paper

0

0

0

0

13:27

06/12/2021

Implicit Transformer Network for Screen Content Image Continuous Super-Resolution

Jingyu Yang, Sheng Shen, Huanjing Yue, Kun Li

Keywords Paper

transformers

0

0

0

0

9:44

07/09/2020

Learning Effectively from Noisy Supervision for Weakly Supervised Semantic Segmentation

Wenbin Xie, Qiaoqiao Wei, Zheng Li, Hui Zhang

Keywords Paper

Semantic Segmentation, Weakly Supervised Semantic Segmentation, Self Attention

0

0

0

0

3:46

03/05/2021

Self-Supervised Learning of Compressed Video Representations

Youngjae Yu, Sangho Lee, Gunhee Kim, Yale Song

Keywords Paper

self-supervised learning, Compressed videos

0

0

0

0

4:34

30/11/2020

A cost-effective method for improving and re-purposing large, pre-trained GANs by fine-tuning their class-embeddings

Qi Li, Long Mai, Michael A. Alcorn, Anh Nguyen

Keywords Paper

0

0

0

0

7:54

05/01/2021

TranstextNet: Transducing Text for Recognizing Unseen Visual Relationships

Gal S. Kenigsfield, Ran El-Yaniv

Keywords Paper

0

0

0

0

5:00

30/11/2020

Low-light Color Imaging via Dual Camera Acquisition

Peiyao Guo, Zhan Ma

Keywords Paper

0

0

0

0

7:28

02/02/2021

Precise Yet Efficient Semantic Calibration and Refinement in ConvNets for Real-time Polyp Segmentation from Colonoscopy Videos

Huisi Wu, Jiafu Zhong, Wei Wang and
Zhenkun Wen, Jing Qin

Keywords Paper

0

0

0

0

17:40

22/09/2020

Progressive layered extraction (PLE): A novel multi-task learning (MTL) model for personalized recommendations

Hongyan Tang, Junning Liu, Ming Zhao, Xudong Gong

Keywords Paper

Recommender System, Multi-task Learning, Seesaw Phenomenon

0

0

0

0

3:20

14/06/2020

EventSR: From Asynchronous Events to Image Reconstruction, Restoration, and Super-Resolution via End-to-End Adversarial Learning

Lin Wang, Tae-Kyun Kim, Kuk-Jin Yoon

Keywords Paper

event-based vision, image super-resolution, image restoration, image reconstruction, unsupervised and adversarial learning

0

0

0

0

1:03

12/07/2020

Stochastic Latent Residual Video Prediction

Jean-Yves Franceschi, Edouard Delasalles, Mickael Chen and
Sylvain Lamprier, Patrick Gallinari

Keywords Paper

Sequential, Network, and Time-Series Modeling

0

0

0

0

14:36

19/10/2020

Deep metric learning based on rank-sensitive optimization of top-k precision

Naoki Muramoto, Hai-Tao Yu

Keywords Paper

top-k precision, deep metric learning, rank-sensitive

0

0

0

0

6:51

14/06/2020

Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution

Xiaoyu Xiang, Yapeng Tian, Yulun Zhang and
Yun Fu, Jan P. Allebach, Chenliang Xu

Keywords Paper

space-time video super-resolution, high-resolution, slow motion, one-stage, fast and accurate, feature temporal interpolation, deformable convlstm, temporal alignment, temporal aggregation, video restoration

0

0

0

0

1:00

30/11/2020

Discrete Spatial Importance-Based Deep Weighted Hashing

Yang Shi, Xiushan Nie, Quan Zhou and
Xiaoming Xi, Yilong Yin

Keywords Paper

0

0

0

0

8:22

05/01/2021

Towards Visually Explaining Video Understanding Networks With Perturbation

Zhenqiang Li, Weimin Wang, Zuoyue Li and
Yifei Huang, Yoichi Sato

Keywords Paper

0

0

0

0

4:53