Recognizing Vector Graphics without Rasterization

06/12/2021

Recognizing Vector Graphics without Rasterization

XINYANG JIANG, LU LIU, Caihua Shan, Yifei Shen, Xuanyi Dong, Dongsheng Li

Keywords: deep learning, machine learning, vision, graph learning

Abstract Paper Similar Papers

Abstract: In this paper, we consider a different data format for images: vector graphics. In contrast to raster graphics which are widely used in image recognition, vector graphics can be scaled up or down into any resolution without aliasing or information loss, due to the analytic representation of the primitives in the document. Furthermore, vector graphics are able to give extra structural information on how low-level elements group together to form high level shapes or structures. These merits of graphic vectors have not been fully leveraged in existing methods. To explore this data format, we target on the fundamental recognition tasks: object localization and classification. We propose an efficient CNN-free pipeline that does not render the graphic into pixels (i.e. rasterization), and takes textual document of the vector graphics as input, called YOLaT (You Only Look at Text). YOLaT builds multi-graphs to model the structural and spatial information in vector graphics, and a dual-stream graph neural network is proposed to detect objects from the graph. Our experiments show that by directly operating on vector graphics, YOLaT outperforms raster-graphic based object detection baselines in terms of both average precision and efficiency.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

PGNet: Real-time Arbitrarily-Shaped Text Spotting with Point Gathering Network

Pengfei Wang, Chengquan Zhang, Fei Qi and
Shanshan Liu, Xiaoqiang Zhang, Pengyuan Lyu, Junyu Han, Jingtuo Liu, Errui Ding, Guangming Shi

Keywords Paper

0

0

0

0

18:06

06/12/2021

A Multi-Implicit Neural Representation for Fonts

Pradyumna Reddy, Zhifei Zhang, Matthew Fisher and
Hailin Jin, Zhaowen Wang, Niloy Mitra

Keywords Paper

deep learning, representation learning

0

0

0

0

8:42

02/02/2021

Visual Concept Reasoning Networks

Taesup Kim, Sungwoong Kim, Yoshua Bengio

Keywords Paper

0

0

0

0

13:01

14/06/2020

DIST: Rendering Deep Implicit Signed Distance Function With Differentiable Sphere Tracing

Shaohui Liu, Yinda Zhang, Songyou Peng and
Boxin Shi, Marc Pollefeys, Zhaopeng Cui

Keywords Paper

differentiable rendering, 3d reconstruction, implicit representations, multi-view reconstruction, depth completion, 3d deep learning

0

0

0

0

1:01

14/06/2020

Probabilistic Structural Latent Representation for Unsupervised Embedding

Mang Ye, Jianbing Shen

Keywords Paper

unsupervised embedding learning, latent representation, instance feature, adaptable softmax

0

0

0

0

0:59

06/12/2021

OctField: Hierarchical Implicit Functions for 3D Modeling

Jia-Heng Tang, Weikai Chen, jie Yang and
Bo Wang, Songrun Liu, Bo Yang, Lin Gao

Keywords Paper

0

0

0

0

13:25

06/12/2020

Neural Unsigned Distance Fields for Implicit Function Learning

Julian Chibane, Mohamad Aymen mir, Gerard Pons-Moll

Keywords Paper

Applications; Data, Challenges, Implementations, and Software; Data, Challenges, Implementations, and Software -> Benchmarks; R, Data, Challenges, Implementations, and Software -> Data Sets or Data Repositories

0

0

0

0

3:16

06/12/2020

Evolving Normalization-Activation Layers

Hanxiao Liu, Andy Brock, Karen Simonyan, Quoc V Le

Keywords Paper

0

0

0

0

2:32

14/06/2020

Say As You Wish: Fine-Grained Control of Image Caption Generation With Abstract Scene Graphs

Shizhe Chen, Qin Jin, Peng Wang, Qi Wu

Keywords Paper

image captioning, controllability, diversity, graph neural network

0

0

0

0

4:54

06/12/2021

Low-Rank Subspaces in GANs

Jiapeng Zhu, Ruili Feng, Yujun Shen and
Deli Zhao, Zheng-Jun Zha, Jingren Zhou, Qifeng Chen

Keywords Paper

generative model

0

0

0

0

11:41

14/06/2020

Density-Aware Feature Embedding for Face Clustering

Senhui Guo, Jing Xu, Dapeng Chen and
Chao Zhang, Xiaogang Wang, Rui Zhao

Keywords Paper

representation learning, clustering, face clustering, density, density-aware, gcn

0

0

0

0

0:55

30/11/2020

3D Object Detection and Pose Estimation of Unseen Objects in Color Images with Local Surface Embeddings

Giorgia Pitteri, Aureélie Bugeau, Slobodan Ilic, Vincent Lepetit

Keywords Paper

0

0

0

0

9:17

02/02/2021

Explicitly Modeled Attention Maps for Image Classification

Andong Tan, Duc Tam Nguyen, Maximilian Dax and
Matthias Nießner, Thomas Brox

Keywords Paper

0

0

0

0

16:59

04/07/2020

pyBART: Evidence-based Syntactic Transformations for IE

Aryeh Tiktinsky, Yoav Goldberg, Reut Tsarfaty

Keywords Paper

IE, machine-learned tasks, downstream applications, data-driven transformations

0

0

0

0

14:02

06/12/2020

Learning Deformable Tetrahedral Meshes for 3D Reconstruction

Jun Gao, Wenzheng Chen, Tommy Xiang and
Alec Jacobson, Morgan McGuire, Sanja Fidler

Keywords Paper

Applications -> Computer Vision; Deep Learning -> CNN Architectures, Applications -> Body Pose, Face, and Gesture Analysis

0

0

0

0

3:22

03/05/2021

Anchor & Transform: Learning Sparse Embeddings for Large Vocabularies

Paul Pu Liang, Manzil Zaheer, Yuan Wang, Amr Ahmed

Keywords Paper

text classification, recommendation systems, large vocabularies, sparse embeddings, language modeling

0

0

0

1

7:03

02/02/2021

SPIN: Structure-Preserving Inner Offset Network for Scene Text Recognition

Chengwei Zhang, Yunlu Xu, Zhanzhan Cheng and
Shiliang Pu, Yi Niu, Fei Wu, Futai Zou

Keywords Paper

0

0

0

0

14:11

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

03/05/2021

GAN "Steerability" without optimization

Nurit Spingarn Eliezer, Ron Banner, Tomer Michaeli

Keywords Paper

nonlinear walk, semantic directions in latent space, Generative Adversarial Network

0

0

0

0

9:42

07/09/2020

Advancing weakly supervised cross-domain alignment with optimal transport

Siyang Yuan, Ke Bai, Liqun Chen and
Yizhe Zhang, Chenyang Tao, Chunyuan Li, Guoyin Wang, Ricardo Henao, Lawrence Carin Duke

Keywords Paper

Optimal Transport, Cross Domain Alignment

0

0

0

0

10:04

06/12/2020

Learning Physical Graph Representations from Visual Scenes

Daniel Bear, Chaofei Fan, Damian Mrowca and
Yunzhu Li, Seth Alter, Aran Nayebi, Jeremy Schwartz, Li Fei-Fei, Jiajun Wu, Josh Tenenbaum, Daniel Yamins

Keywords Paper

0

0

0

0

3:19

14/06/2020

Unsupervised Intra-Domain Adaptation for Semantic Segmentation Through Self-Supervision

Fei Pan, Inkyu Shin, Francois Rameau and
Seokju Lee, In So Kweon

Keywords Paper

domain adaptation, semantic segmentation, self-supervised learning, unsupervised learning, transfer learning.

0

0

0

0

4:58

14/06/2020

Unsupervised Learning of Intrinsic Structural Representation Points

Nenglun Chen, Lingjie Liu, Zhiming Cui and
Runnan Chen, Duygu Ceylan, Changhe Tu, Wenping Wang

Keywords Paper

3d point cloud learning, structure point, unsupervised learning

0

0

0

0

1:00

06/12/2021

Capacity and Bias of Learned Geometric Embeddings for Directed Graphs

Michael Boratko, Dongxu Zhang, Nicholas Monath and
Luke Vilnis, Kenneth L Clarkson, Andrew McCallum

Keywords Paper

machine learning, graph learning, representation learning

0

0

0

0

14:56

22/11/2021

Extended Differentiable Marching Cubes by Manifold-Preserving Shape Inflation

Kiichi Itoh, Tatsuya Yatagawa, Yutaka Ohtake, Suzuki Hiromasa

Keywords Paper

surface reconstruction, marching cubes, normalizing flow, deep learning

0

0

0

0

3:01

26/04/2020

Unrestricted Adversarial Examples via Semantic Manipulation

Anand Bhattad, Min Jin Chong, Kaizhao Liang and
Bo Li, D. A. Forsyth

Keywords Paper

Adversarial Examples, Semantic Manipulation, Image Colorization, Texture Transfer

0

0

0

0

3:51

06/12/2020

DeepSVG: A Hierarchical Generative Network for Vector Graphics Animation

Alexandre Carlier, Martin Danelljan, Alexandre Alahi, Radu Timofte

Keywords Paper

0

0

0

0

3:23

06/12/2021

Learning to Compose Visual Relations

Nan Liu, Shuang Li, Yilun Du and
Josh Tenenbaum, Antonio Torralba

Keywords Paper

deep learning, graph learning

0

0

0

0

10:15

14/06/2020

Learning Unsupervised Hierarchical Part Decomposition of 3D Objects From a Single RGB Image

Despoina Paschalidou, Luc Van Gool, Andreas Geiger

Keywords Paper

3d reconstruction, primitive-based representations, structure-aware representations, part-based decomposition, primitives, semantic shape abstractions, single-view 3d reconstruction, unsupervised learning, 3d deep learning

0

0

0

0

1:01

06/12/2021

ResT: An Efficient Transformer for Visual Recognition

Qinglong Zhang, Yu-Bin Yang

Keywords Paper

machine learning, transformers, vision

0

0

0

0

12:23

14/06/2020

PFCNN: Convolutional Neural Networks on 3D Surfaces Using Parallel Frames

Yuqi Yang, Shilin Liu, Hao Pan and
Yang Liu, Xin Tong

Keywords Paper

surface meshes, convolution neural networks, parallel frames, translation equivariance, local euclidean structure, non-rigid shape, classification, segmentation, registration

0

0

0

0

0:58

14/06/2020

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

Keywords Paper

differentiable rendering, signed distance field, image-based 3d reconstruction, 3d shape optimization, deep learning, inverse graphics

0

0

0

0

5:01

03/05/2021

Bowtie Networks: Generative Modeling for Joint Few-Shot Recognition and Novel-View Synthesis

Zhipeng Bao, Yu-Xiong Wang, Martial Hebert

Keywords Paper

adversarial training, computer vision, object recognition, few-shot learning, generative models

0

0

0

0

5:11

02/02/2021

Object-Centric Image Generation from Layouts

Tristan Sylvain, Pengchuan Zhang, Yoshua Bengio and
R Devon Hjelm, Shikhar Sharma

Keywords Paper

0

0

0

0

17:44

14/06/2020

Spatial Pyramid Based Graph Reasoning for Semantic Segmentation

Xia Li, Yibo Yang, Qijie Zhao and
Tiancheng Shen, Zhouchen Lin, Hong Liu

Keywords Paper

semantic segmentation, graph convolution, spatial pyramid, scene parsing

0

0

0

0

1:01

02/02/2021

ASHF-Net: Adaptive Sampling and Hierarchical Folding Network for Robust Point Cloud Completion

Daoming Zong, Shiliang Sun, Jing Zhao

Keywords Paper

0

0

0

0

16:49

03/08/2020

Locally Masked Convolution for Autoregressive Models

Ajay Jain, Pieter Abbeel, Deepak Pathak

Keywords Paper

0

0

0

0

8:28

03/05/2021

Counterfactual Generative Networks

Axel Sauer, Andreas Geiger

Keywords Paper

Generative Models, Data Augmentation, Image Classification, Counterfactuals, Robustness, Causality

0

0

0

0

5:25

18/07/2021

Sharf: Shape-conditioned Radiance Fields from a Single View

Konstantinos Rematas, Ricardo Martin-Brualla, Vittorio Ferrari

Keywords Paper

Applications, Computer Vision

0

0

0

0

5:11

06/12/2020

Convolutional Generation of Textured 3D Meshes

Dario Pavllo, Graham Spinks, Thomas Hofmann and
Marie-Francine Moens, Aurelien Lucchi

Keywords Paper

0

0

0

0

3:16