G2L-Net: Global to Local Network for Real-Time 6D Pose Estimation With Embedding Vector Features

14/06/2020

G2L-Net: Global to Local Network for Real-Time 6D Pose Estimation With Embedding Vector Features

Wei Chen, Xi Jia, Hyung Jin Chang, Jinming Duan, Aleš Leonardis

Keywords: object pose estimation, point cloud, embedding vector features, real time

Abstract Paper Similar Papers

Abstract: In this paper, we propose a novel real-time 6D object pose estimation framework, named G2L-Net. Our network operates on point clouds from RGB-D detection in a divide-and-conquer fashion. Specifically, our network consists of three steps. First, we extract the coarse object point cloud from the RGB-D image by 2D detection. Second, we feed the coarse object point cloud to a translation localization network to perform 3D segmentation and object translation prediction. Third, via the predicted segmentation and translation, we transfer the fine object point cloud into a local canonical coordinate, in which we train a rotation localization network to estimate initial object rotation. In the third step, we define point-wise embedding vector features to capture viewpoint-aware information. To calculate more accurate rotation, we adopt a rotation residual estimator to estimate the residual between initial rotation and ground truth, which can boost initial pose estimation performance. Our proposed G2L-Net is real-time despite the fact multiple steps are stacked via the proposed coarse-to-fine framework. Extensive experiments on two benchmark datasets show that G2L-Net achieves state-of-the-art performance in terms of both accuracy and speed.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

FroDO: From Detections to 3D Objects

Martin Rünz, Kejie Li, Meng Tang and
Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

Keywords Paper

reconstruction, shape embedding, 3d vision, object detection, shape prior, object representation, monocular, sdf, pointcloud, inference

0

0

0

0

1:01

02/02/2021

R3Det: Refined Single-Stage Detector with Feature Refinement for Rotating Object

Xue Yang, Junchi Yan, Ziming Feng, Tao He

Keywords Paper

0

0

0

0

14:39

14/06/2020

Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation

Dengsheng Chen, Jun Li, Zheng Wang, Kai Xu

Keywords Paper

object pose estimation, 3d vision

0

0

0

0

1:01

02/02/2021

EMLight: Lighting Estimation via Spherical Distribution Approximation

Fangneng Zhan, Changgong Zhang, Yingchen Yu and
Yuan Chang, Shijian Lu, Feiying Ma, Xuansong Xie

Keywords Paper

0

0

0

0

15:44

14/06/2020

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang and
Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

Keywords Paper

3d object detection, point cloud, 3d scene understanding, lidar, autonomous driving, kitti dataset, waymo open dataset

0

0

0

0

1:01

22/11/2021

AniFormer: Data-driven 3D Animation with Transformer

Haoyu Chen, Hao Tang, Nicu Sebe, Guoying Zhao

Keywords Paper

3D motion, 3D generation, 3D style transfer, Transformer, 3D animation

0

0

0

0

2:51

05/01/2021

A Pose Proposal and Refinement Network for Better 6D Object Pose Estimation

Ameni Trabelsi, Mohamed Chaabane, Nathaniel Blanchard, Ross Beveridge

Keywords Paper

0

0

0

0

4:36

06/12/2021

3D Siamese Voxel-to-BEV Tracker for Sparse Point Clouds

Le Hui, Lingpeng Wang, Mingmei Cheng and
Jin Xie, Jian Yang

Keywords Paper

0

0

0

0

8:30

14/06/2020

MotionNet: Joint Perception and Motion Prediction for Autonomous Driving Based on Bird’s Eye View Maps

Pengxiang Wu, Siheng Chen, Dimitris N. Metaxas

Keywords Paper

autonomous driving, perception, motion prediction, bird's eye view map

0

0

0

0

1:00

14/06/2020

DeepEMD: Few-Shot Image Classification With Differentiable Earth Mover’s Distance and Structured Classifiers

Chi Zhang, Yujun Cai, Guosheng Lin, Chunhua Shen

Keywords Paper

few-shot classification, meta learning, few-shot learning, classification, metric learning, convex, optimization, image retrieval

0

0

0

0

5:00

14/06/2020

Density-Based Clustering for 3D Object Detection in Point Clouds

Syeda Mariam Ahmed, Chee Meng Chew

Keywords Paper

3d object detection, edge-aware pointnet, instance segmentation, unsupervised clustering, cascaded modules, semantic segmentation, amodal bounding box detection

0

0

0

0

0:51

14/06/2020

Sequential 3D Human Pose and Shape Estimation From Point Clouds

Kangkan Wang, Jin Xie, Guofeng Zhang and
Lei Liu, Jian Yang

Keywords Paper

3d reconstruction, 3d human pose and shape estimation, point clouds, depth sensor, sequential modeling, spatial-temporal features, mesh convolution, attention model, weakly-supervised fine-tuning, deep learning

0

0

0

0

1:00

16/11/2020

Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection

Alex Bewley, Pei Sun, Thomas Mensink and
Dragomir Anguelov, Cristian Sminchisescu

Keywords Paper

0

0

0

0

5:06

06/12/2020

PIE-NET: Parametric Inference of Point Cloud Edges

Xiaogang Wang, Yuelang Xu, Kevin Xu and
Andrea Tagliasacchi, Bin Zhou, Ali Mahdavi-Amiri, Hao Zhang

Keywords Paper

Theory -> Game Theory and Computational Economics, Applications -> Fairness, Accountability, and Transparency

0

0

0

0

3:11

14/06/2020

Cost Volume Pyramid Based Depth Inference for Multi-View Stereo

Jiayu Yang, Wei Mao, Jose M. Alvarez, Miaomiao Liu

Keywords Paper

multi-view stereo, depth estimation, cost volume, coarse-to-fine, depth map, deep learning

0

0

0

0

4:58

14/06/2020

PointGroup: Dual-Set Point Grouping for 3D Instance Segmentation

Li Jiang, Hengshuang Zhao, Shaoshuai Shi and
Shu Liu, Chi-Wing Fu, Jiaya Jia

Keywords Paper

instance segmentation, point cloud, 3d, scene understanding, indoor scenes, bottom-up, grouping, dual-set, scannet, s3dis

0

0

0

0

5:01

30/11/2020

Project to Adapt: Domain Adaptation for Depth Completion from Noisy and Sparse Sensor Data

Adrian Lopez-Rodriguez, Benjamin Busam, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

10:00

02/02/2021

Inferring Camouflaged Objects by Texture-Aware Interactive Guidance Network

Jinchao Zhu, Xiaoyu Zhang, Shuo Zhang, Junnan Liu

Keywords Paper

0

0

0

0

14:04

14/06/2020

Extreme Relative Pose Network Under Hybrid Representations

Zhenpei Yang, Siming Yan, Qixing Huang

Keywords Paper

relative pose estimation, rgb-d registration, few shot reconstruction, hybrid representation

0

0

0

0

4:56

14/06/2020

Reconstruct Locally, Localize Globally: A Model Free Method for Object Pose Estimation

Ming Cai, Ian Reid

Keywords Paper

object pose estimation, convolutional neural networks, multi-view geometry, self-supervised learning, 3d reconstruction.

0

0

0

0

1:02

14/06/2020

GPS-Net: Graph Property Sensing Network for Scene Graph Generation

Xin Lin, Changxing Ding, Jinquan Zeng, Dacheng Tao

Keywords Paper

scene graph generation, message passing, visual genome dataset, visual relationship detection, open images dataset

0

0

0

0

4:56

30/11/2020

Novel-View Human Action Synthesis

Mohamed Ilyes Lakhal, Davide Boscaini, Fabio Poiesi and
Oswald Lanz, Andrea Cavallaro

Keywords Paper

0

0

0

0

4:34

06/12/2021

Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space

Jiehong Lin, Hongyang Li, Ke Chen and
Jiangbo Lu, Kui Jia

Keywords Paper

vision

0

0

0

0

12:29

14/06/2020

AugFPN: Improving Multi-Scale Feature Learning for Object Detection

Chaoxu Guo, Bin Fan, Qian Zhang and
Shiming Xiang, Chunhong Pan

Keywords Paper

object detection, augfpn, consistent supervision, residual feature augmentation, soft roi selection

0

0

0

0

1:00

06/12/2021

Manifold Topology Divergence: a Framework for Comparing Data Manifolds.

Serguei Barannikov, Ilya Trofimov, Grigorii Sotnikov and
Ekaterina Trimbach, Alexander Korotin, Alexander Filippov, Evgeny Burnaev

Keywords Paper

generative model

0

0

0

0

15:01

12/07/2020

Hypernetwork approach to generating point clouds

Przemysław Spurek, Sebastian Winczowski, Jacek Tabor and
Maciej Zamorski, Maciej Zieba, Tomasz Trzcinski

Keywords Paper

Deep Learning - Generative Models and Autoencoders

0

0

0

0

13:04

06/12/2021

Learning High-Precision Bounding Box for Rotated Object Detection via Kullback-Leibler Divergence

Xue Yang, Xiaojiang Yang, Jirui Yang and
Qi Ming, Wentao Wang, Qi Tian, Junchi Yan

Keywords Paper

optimization, vision

0

0

0

0

14:14

22/11/2021

Monocular Arbitrary Moving Object Discovery and Segmentation

Michal Neoral, Jan Sochman, Jiri Matas

Keywords Paper

motion segmentation, instance motion segmentation

0

0

0

0

2:55

03/05/2021

Heating up decision boundaries: isocapacitory saturation, adversarial scenarios and generalization bounds

Bogdan Georgiev, Lukas Franken, Mayukh Mukherjee

Keywords Paper

generalization bounds, adversarial attacks/defenses, deep learning theory, curvature estimates, decision boundary geometry, Brownian motion

0

0

0

0

5:11

05/01/2021

MART: Motion-Aware Recurrent Neural Network for Robust Visual Tracking

Heng Fan, Haibin Ling

Keywords Paper

0

0

0

0

4:55

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49

14/06/2020

LiDAR-Based Online 3D Video Object Detection With Graph-Based Message Passing and Spatiotemporal Transformer Attention

Junbo Yin, Jianbing Shen, Chenye Guan and
Dingfu Zhou, Ruigang Yang

Keywords Paper

3d object detection, point cloud, video, graph, attention, autonomous driving

0

0

0

0

1:02

14/06/2020

Smooth Shells: Multi-Scale Shape Registration With Functional Maps

Marvin Eisenberger, Zorah Lähner, Daniel Cremers

Keywords Paper

shape correspondence, functional maps, shape registration, non-rigid correspondence, interclass matching

0

0

0

0

4:56

14/06/2020

Joint Spatial-Temporal Optimization for Stereo 3D Object Tracking

Peiliang Li, Jieqi Shi, Shaojie Shen

Keywords Paper

3d object tracking, stereo cameras, autonomous driving

0

0

0

0

1:01

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

06/12/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Feng Liu, Xiaoming Liu

Keywords Paper

vision

0

0

0

0

9:19

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

Neural Point Cloud Rendering via Multi-Plane Projection

Peng Dai, Yinda Zhang, Zhuwen Li and
Shuaicheng Liu, Bing Zeng

Keywords Paper

neural rendering, point cloud, multi-plane representation, novel view synthesis

0

0

0

0

1:00

05/01/2021

IGSSTRCF: Importance Guided Sparse Spatio-Temporal Regularized Correlation Filters for Tracking

Monika Jain, A. V. Subramanyam, Simon Denman and
Sridha Sridharan, Clinton Fookes

Keywords Paper

0

0

0

0

4:25

06/12/2021

3DP3: 3D Scene Perception via Probabilistic Programming

Nishad Gothoskar, Marco Cusumano-Towner, Ben Zinberg and
Matin Ghavamizadeh, Falk Pollok, Austin Garrett, Josh Tenenbaum, Dan Gutfreund, Vikash Mansinghka

Keywords Paper

deep learning, vision, generative model, graph learning

0

0

0

0

6:16