Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image Retrieval

07/09/2020

Cross-Modal Hierarchical Modelling for Fine-Grained Sketch Based Image Retrieval

Aneeshan Sain, Ayan Kumar Bhunia, Yongxin Yang, Tao Xiang, Yi-Zhe Song

Keywords: cross-modal co-attention, sketch hierarchy, cross-modal retrieval, sketch based image retrieval

Abstract Paper Similar Papers

Abstract: Sketch as an image search query is an ideal alternative to text in capturing the fine-grained visual details. Prior successes on fine-grained sketch-based image retrieval (FG-SBIR) have demonstrated the importance of tackling the unique traits of sketches as opposed to photos, e.g., temporal vs. static, strokes vs. pixels, and abstract vs. pixel-perfect. In this paper, we study a further trait of sketches that has been overlooked to date, that is, they are hierarchical in terms of the levels of detail -- a person typically sketches up to various extents of detail to depict an object. This hierarchical structure is often visually distinct. In this paper, we design a novel network that is capable of cultivating sketch-specific hierarchies and exploiting them to match sketch with photo at corresponding hierarchical levels. In particular, features from a sketch and a photo are enriched using cross-modal co-attention, coupled with hierarchical node fusion at every level to form a better embedding space to conduct retrieval. Experiments on common benchmarks show our method to outperform state-of-the-arts by a significant margin.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at BMVC 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

19/08/2021

Norm-guided Adaptive Visual Embedding for Zero-Shot Sketch-Based Image Retrieval

Wenjie Wang, Yufeng Shi, Shiming Chen and
Qinmu Peng, Feng Zheng, Xinge You

Keywords Paper

Computer Vision, Recognition, Deep Learning, Multi-instance; Multi-label; Multi-view learning

0

0

0

0

10:53

06/12/2020

Generative View Synthesis: From Single-view Semantics to Novel-view Images

Tewodros Amberbir Habtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker

Keywords Paper

0

0

0

0

3:20

14/06/2020

Sketch Less for More: On-the-Fly Fine-Grained Sketch-Based Image Retrieval

Ayan Kumar Bhunia, Yongxin Yang, Timothy M. Hospedales and
Tao Xiang, Yi-Zhe Song

Keywords Paper

fine-grained sketch based image retrieval, on-the-fly retrieval, reinforcement learning, cross-modal retrieval, proximal policy optimization

0

0

0

0

4:56

02/02/2021

Self-Supervised Sketch-to-Image Synthesis

Bingchen Liu, Yizhe Zhu, Kunpeng Song, Ahmed Elgammal

Keywords Paper

0

0

0

0

14:42

22/11/2021

Contour-guided Image Completion with Perceptual Grouping

Morteza Rezanejad, Sidharth Gupta, Chandra Gummaluru and
Ryan Marten, John Wilder, Michael Gruninger, Dirk B. Walther

Keywords Paper

Image Completion, Inpainting, Perceptual Grouping, Stochastic Completion Fields, Contour Completion, Good Continuation, Perceptual Organisation

0

0

0

0

3:02

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

17/08/2020

DeepFaceDrawing: Deep generation of face images from sketches

Shu-Yu Chen, Wanchao Su, Lin Gao and
Shihong Xia, Hongbo Fu

Keywords Paper

sketch-based generation, feature embedding, image-to-image translation, face synthesis

0

0

0

0

18:07

19/08/2021

Domain-Smoothing Network for Zero-Shot Sketch-Based Image Retrieval

Zhipeng Wang, Hao Wang, Jiexi Yan and
Aming Wu, Cheng Deng

Keywords Paper

Computer Vision, Recognition, Transfer, Adaptation, Multi-task Learning

0

0

0

0

10:05

05/01/2021

SHAD3S: A Model to Sketch, Shade and Shadow

Raghav Brahmadesam Venkataramaiyer, Abhishek Joshi, Saisha Narang, Vinay P. Namboodiri

Keywords Paper

0

0

0

0

3:49

05/01/2021

One-Shot Image Recognition Using Prototypical Encoders With Reduced Hubness

Chenxi Xiao, Naveen Madapana, Juan Wachs

Keywords Paper

0

0

0

0

4:29

30/11/2020

D2D: Keypoint Extraction with Describe to Detect Approach

Yurun Tian, Vassileios Balntas, Tony Ng and
Axel Barroso-Laguna, Yiannis Demiris, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

4:34

07/09/2020

SketchHealer: A Graph-to-Sequence Network for Recreating Partial Human Sketches

Guoyao Su, Yonggang Qi, Kaiyue Pang and
Jie Yang, Yi-Zhe Song

Keywords Paper

sketch healing, sketch synthesis, graph-to-sequence network, GCN

0

0

0

0

8:34

14/06/2020

SketchyCOCO: Image Generation From Freehand Scene Sketches

Chengying Gao, Qi Liu, Qi Xu and
Limin Wang, Jianzhuang Liu, Changqing Zou

Keywords Paper

image generation, freehand scene sketches, composite scene-level dataset, sequential stages, cross-domain latent space, sketchycoco, edgegan

0

0

0

0

5:00

03/05/2021

Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis

Zhipeng Bao, Yu-Xiong Wang, Martial Hebert

Keywords Paper

adversarial training, computer vision, object recognition, few-shot learning, generative models

0

0

0

0

5:11

30/11/2020

DeepVoxels++: Enhancing the Fidelity of Novel View Synthesis from 3D Voxel Embeddings

Tong He, John Collomosse, Hailin Jin, Stefano Soatto

Keywords Paper

0

0

0

0

7:47

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49

19/08/2021

A Sketch-Transformer Network for Face Photo-Sketch Synthesis

Mingrui Zhu, Changcheng Liang, Nannan Wang and
Xiaoyu Wang, Zhifeng Li, Xinbo Gao

Keywords Paper

Computer Vision, 2D and 3D Computer Vision, Biometrics, Face and Gesture Recognition

0

0

0

0

12:42

06/12/2021

Dual Progressive Prototype Network for Generalized Zero-Shot Learning

Chaoqun Wang, Shaobo Min, Xuejin Chen and
Xiaoyan Sun, Houqiang Li

Keywords Paper

0

0

0

0

10:51

14/06/2020

Local Class-Specific and Global Image-Level Generative Adversarial Networks for Semantic-Guided Scene Generation

Hao Tang, Dan Xu, Yan Yan and
Philip H.S. Torr, Nicu Sebe

Keywords Paper

generative adversarial networks, local, global, semantic guided, scene generation, semantic image synthesis, cross-view image generation, class-specific feature representation, attention fusion

0

0

0

0

1:00

02/02/2021

Tailoring Embedding Function to Heterogeneous Few-Shot Tasks by Global and Local Feature Adaptors

Su Lu, Han-Jia Ye, De-Chuan Zhan

Keywords Paper

0

0

0

0

14:21

14/06/2020

3D Sketch-Aware Semantic Scene Completion via Semi-Supervised Structure Prior

Xiaokang Chen, Kwan-Yee Lin, Chen Qian and
Gang Zeng, Hongsheng Li

Keywords Paper

semantic scene completion, depth embedding

0

0

0

0

1:01

30/11/2020

Sketch-to-Art: Synthesizing Stylized Art Images From Sketches

Bingchen Liu, Kunpeng Song, Yizhe Zhu, Ahmed Elgammal

Keywords Paper

0

0

0

0

9:12

14/06/2020

ManiGAN: Text-Guided Image Manipulation

Bowen Li, Xiaojuan Qi, Thomas Lukasiewicz, Philip H.S. Torr

Keywords Paper

image manipulation, natural language, generative adversarial networks, gan

0

0

0

0

1:01

06/12/2020

Hard Example Generation by Texture Synthesis for Cross-domain Shape Similarity Learning

Huan Fu, Shunming Li, Rongfei Jia and
Mingming Gong, Binqiang Zhao, Dacheng Tao

Keywords Paper

0

0

0

0

3:21

22/11/2021

Multimodal Semi-Supervised Learning for 3D Objects

Zhimin Chen, Longlong Jing, Yang Liang and
YingLi Tian, Bing Li

Keywords Paper

Semi-supervised learning, Multimodal learning, Representation learning

0

0

0

0

2:56

05/01/2021

Coarse-to-Fine Gaze Redirection With Numerical and Pictorial Guidance

Jingjing Chen, Jichao Zhang, Enver Sangineto and
Tao Chen, Jiayuan Fan, Nicu Sebe

Keywords Paper

0

0

0

0

4:34

30/11/2020

3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun, Jing Zhang, Nick Barnes

Keywords Paper

0

0

0

0

9:20

02/02/2021

Sketch Generation with Drawing Process Guided by Vector Flow and Grayscale

Zhengyan Tong, Xuanhong Chen, Bingbing Ni, Xiaohang Wang

Keywords Paper

0

0

0

0

18:15

14/06/2020

Unsupervised Learning of Intrinsic Structural Representation Points

Nenglun Chen, Lingjie Liu, Zhiming Cui and
Runnan Chen, Duygu Ceylan, Changhe Tu, Wenping Wang

Keywords Paper

3d point cloud learning, structure point, unsupervised learning

0

0

0

0

1:00

02/02/2021

Dual-level Collaborative Transformer for Image Captioning

Yunpeng Luo, Jiayi Ji, Xiaoshuai Sun and
Liujuan Cao, Yongjian Wu, Feiyue Huang, Chia-Wen Lin, Rongrong Ji

Keywords Paper

0

0

0

0

14:58

14/06/2020

Fashion Editing With Adversarial Parsing Learning

Haoye Dong, Xiaodan Liang, Yixuan Zhang and
Xujie Zhang, Xiaohui Shen, Zhenyu Xie, Bowen Wu, Jian Yin

Keywords Paper

fashion editing, image generation, image synthesis, gan, generative adversarial network, image manipulation, human parsing, segmentation, image editing, virtual try-on

0

0

0

0

1:00

02/02/2021

An Unsupervised Sampling Approach for Image-Sentence Matching Using Document-level Structural Information

Zejun Li, Zhongyu Wei, Zhihao Fan and
Haijun Shan, Xuanjing Huang

Keywords Paper

0

0

0

0

18:39

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

18/07/2021

SketchEmbedNet: Learning Novel Concepts by Imitating Drawings

Alexander Wang, Mengye Ren, Richard Zemel

Keywords Paper

Neuroscience and Cognitive Science, Human or Animal Learning, Neuroscience and Cognitive Science, Memory; Optimization, Combinatorial Optimization; Optimization, Submodular Optimizati, Deep Learning, Embedding and Representation learning

0

0

0

0

5:47

14/06/2020

Cross-Domain Correspondence Learning for Exemplar-Based Image Translation

Pan Zhang, Bo Zhang, Dong Chen and
Lu Yuan, Fang Wen

Keywords Paper

exemplar based image translation, correspondence, gan, weak supervision learning

0

0

0

0

5:01

02/02/2021

Learning Intact Features by Erasing-Inpainting for Few-shot Classification

Junjie Li, Zilei Wang, Xiaoming Hu

Keywords Paper

0

0

0

0

15:15

05/01/2021

Saliency Driven Perceptual Image Compression

Yash Patel, Srikar Appalaraju, R. Manmatha

Keywords Paper

0

0

0

0

4:58

06/12/2021

Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis

Tianchang Shen, Jun Gao, Kangxue Yin and
Ming-Yu Liu, Sanja Fidler

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:32

14/06/2020

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

Keywords Paper

differentiable rendering, signed distance field, image-based 3d reconstruction, 3d shape optimization, deep learning, inverse graphics

0

0

0

0

5:01

06/12/2020

Geo-PIFu: Geometry and Pixel Aligned Implicit Functions for Single-view Human Reconstruction

Tong He, John Collomosse, Hailin Jin, Stefano Soatto

Keywords Paper

0

0

0

0

3:16