Learning Canonical Shape Space for Category-Level 6D Object Pose and Size Estimation

Abstract: We present a novel approach to category-level 6D object pose and size estimation. To tackle intra-class shape variations, we learn canonical shape space (CASS), a unified representation for a large variety of instances of a certain object category. In particular, CASS is modeled as the latent space of a deep generative model of canonical 3D shapes with normalized pose. We train a variational auto-encoder (VAE) for generating 3D point clouds in the canonical space from an RGBD image. The VAE is trained in a cross-category fashion, exploiting the publicly available large 3D shape repositories. Since the 3D point cloud is generated in normalized pose (with actual size), the encoder of the VAE learns view-factorized RGBD embedding. It maps an RGBD image in arbitrary view into a poseindependent 3D shape representation. Object pose is then estimated via contrasting it with a pose-dependent feature of the input RGBD extracted with a separate deep neural networks. We integrate the learning of CASS and pose and size estimation into an end-to-end trainable network, achieving the state-of-the-art performance.

14/06/2020

FroDO: From Detections to 3D Objects

Martin Rünz, Kejie Li, Meng Tang and
Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

Keywords Paper

reconstruction, shape embedding, 3d vision, object detection, shape prior, object representation, monocular, sdf, pointcloud, inference

1:01

06/12/2020

Serguei Barannikov, Ilya Trofimov, Grigorii Sotnikov and
Ekaterina Trimbach, Alexander Korotin, Alexander Filippov, Evgeny Burnaev

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

2:49

12/07/2020

category level pose estimation, articulated object, 3d vision, point cloud, object part, object joint, segmentation, kinematic constraints

5:00

30/11/2020

3d reconstruction, 3d human pose and shape estimation, point clouds, depth sensor, sequential modeling, spatial-temporal features, mesh convolution, attention model, weakly-supervised fine-tuning, deep learning

1:00

14/06/2020

few-shot classification, meta learning, few-shot learning, classification, metric learning, convex, optimization, image retrieval

5:00

26/04/2020

instance segmentation, point cloud, 3d, scene understanding, indoor scenes, bottom-up, grouping, dual-set, scannet, s3dis

5:01

14/06/2020

6d pose estimation, keypoints detection, relative pose, object detection, deep learning, computer vision, multi-task learning, metric learning, multi-view learning, epipolar geometry

1:01

19/08/2021

single image view synthesis, view synthesis, differentiable rendering, point cloud, convolutional neural networks, generative networks

4:58

02/02/2021