Lightweight Multi-View 3D Pose Estimation Through Camera-Disentangled Representation

14/06/2020

Lightweight Multi-View 3D Pose Estimation Through Camera-Disentangled Representation

Edoardo Remelli, Shangchen Han, Sina Honari, Pascal Fua, Robert Wang

Keywords: 3d pose estimation, multi-view fusion, disentangled representation learning, direct linear triangulation

Abstract Paper Similar Papers

Abstract: We present a lightweight solution to recover 3D pose from multi-view images captured with spatially calibrated cameras. Building upon recent advances in interpretable representation learning, we exploit 3D geometry to fuse input images into a unified latent representation of pose, which is disentangled from camera view-points. This allows us to reason effectively about 3D pose across different views without using compute-intensive volumetric grids. Our architecture then conditions the learned representation on camera projection operators to produce accurate per-view 2d detections, that can be simply lifted to 3D via a differentiable Direct Linear Transform (DLT) layer. In order to do it efficiently, we propose a novel implementation of DLT that is orders of magnitude faster on GPU architectures than standard SVD-based triangulation methods. We evaluate our approach on two large-scale human pose datasets (H36M and Total Capture): our method outperforms or performs comparably to the state-of-the-art volumetric methods, while, unlike them, yielding real-time performance.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at CVPR 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

14/06/2020

What You See is What You Get: Exploiting Visibility for 3D Object Detection

Peiyun Hu, Jason Ziglar, David Held, Deva Ramanan

Keywords Paper

freespace reasoning, 3d object detection, lidar processing, autonomous driving

0

0

0

0

5:01

06/12/2021

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Xingang Pan, Xudong XU, Chen Change Loy and
Christian Theobalt, Bo Dai

Keywords Paper

generative model

0

0

0

0

11:15

05/01/2021

SMPLpix: Neural Avatars From 3D Human Models

Sergey Prokudin, Michael J. Black, Javier Romero

Keywords Paper

0

0

0

0

4:55

06/12/2021

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys

Keywords Paper

optimization

0

0

0

0

10:44

14/06/2020

PVN3D: A Deep Point-Wise 3D Keypoints Voting Network for 6DoF Pose Estimation

Yisheng He, Wei Sun, Haibin Huang and
Jianran Liu, Haoqiang Fan, Jian Sun

Keywords Paper

6d pose estimation, 3d instance segmentation, 3d semantic segmentation, 3d keypoint, 3d scene understanding, vision for robotics, 3d single view, rgbd, 3d computer vision

0

0

0

0

1:01

22/11/2021

TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation

Haoyu Ma, Liangjian Chen, Deying Kong and
Zhe Wang, Xingwei Liu, Hao Tang, Xiangyi Yan, Yusheng Xie, Shih-Yao Lin, Xiaohui Xie

Keywords Paper

multi-view, 3D pose estimation, epipolar line, vision transformer

0

0

0

0

3:01

14/06/2020

Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

Yiyi Liao, Katja Schwarz, Lars Mescheder, Andreas Geiger

Keywords Paper

image synthesis, generative adversarial network, 3d controllability, unsupervised learning, 3d representation, disentangled representation, differentiable rendering, neural rendering

0

0

0

0

1:01

14/06/2020

Self-Supervised Monocular Scene Flow Estimation

Junhwa Hur, Stefan Roth

Keywords Paper

monocular scene flow, self-supervised learning, 3d scene flow, optical flow, monocular depth estimation

0

0

0

0

5:00

17/08/2020

Consistent video depth estimation

Xuan Luo, Jia-Bin Huang, Richard Szeliski and
Kevin Matzen, Johannes Kopf

Keywords Paper

video, depth estimation

0

0

0

1

12:43

14/06/2020

3D Photography Using Context-Aware Layered Depth Inpainting

Meng-Li Shih, Shih-Yang Su, Johannes Kopf, Jia-Bin Huang

Keywords Paper

computational photography, novel view synthesis

0

0

0

0

1:01

14/06/2020

OccuSeg: Occupancy-Aware 3D Instance Segmentation

Lei Han, Tian Zheng, Lan Xu, Lu Fang

Keywords Paper

instance segmentation, multi-task learning, occupancy

0

0

0

0

1:02

07/09/2020

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

0

0

0

0

8:16

06/12/2021

Generative Occupancy Fields for 3D Surface-Aware Image Synthesis

Xudong XU, Xingang Pan, Dahua Lin, Bo Dai

Keywords Paper

generative model

0

0

0

0

6:47

17/08/2020

Radiative backpropagation: An adjoint method for lightning-fast differentiable rendering

Merlin Nimier-David, Sébastien Speierer, Benoı̂t Ruiz, Wenzel Jakob

Keywords Paper

ray tracing, global illumination, differentiable rendering

0

0

0

0

17:54

06/12/2021

Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D Space

Jiehong Lin, Hongyang Li, Ke Chen and
Jiangbo Lu, Kui Jia

Keywords Paper

vision

0

0

0

0

12:29

14/06/2020

Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose Reconstruction

Ruixu Liu, Ju Shen, He Wang and
Chen Chen, Sen-ching Cheung, Vijayan Asari

Keywords Paper

3d human pose, attention mechanism, multi-scale dilation convolution, monocular motion reconstruction

0

0

0

0

5:01

14/06/2020

SDFDiff: Differentiable Rendering of Signed Distance Fields for 3D Shape Optimization

Yue Jiang, Dantong Ji, Zhizhong Han, Matthias Zwicker

Keywords Paper

differentiable rendering, signed distance field, image-based 3d reconstruction, 3d shape optimization, deep learning, inverse graphics

0

0

0

0

5:01

14/06/2020

Deep 3D Capture: Geometry and Reflectance From Sparse Multi-View Images

Sai Bi, Zexiang Xu, Kalyan Sunkavalli and
David Kriegman, Ravi Ramamoorthi

Keywords Paper

appearance acquisition, 3d reconstruction, multi-view stereo

0

0

0

0

1:01

14/06/2020

Joint Texture and Geometry Optimization for RGB-D Reconstruction

Yanping Fu, Qingan Yan, Jie Liao, Chunxia Xiao

Keywords Paper

rgb-d reconstruction, 3d reconstruction, texture optimization, geometry optimization, joint texture and geometry optimization

0

0

0

0

0:57

14/06/2020

RoutedFusion: Learning Real-Time Depth Map Fusion

Silvan Weder, Johannes Schönberger, Marc Pollefeys, Martin R. Oswald

Keywords Paper

depth map fusion, online 3d reconstruction, deep learning, real-time applications, 3d geometry

0

0

0

0

5:00

17/08/2020

One shot 3D photography

Johannes Kopf, Kevin Matzen, Suhib Alsisan and
Ocean Quigley, Francis Ge, Yangming Chong, Josh Patterson, Jan-Michael Frahm, Shu Wu, Matthew Yu, Peizhao Zhang, Zijian He, Peter Vajda, Ayush Saraf, Michael Cohen

Keywords Paper

3D photography, depth estimation

0

0

0

0

15:00

14/06/2020

On Joint Estimation of Pose, Geometry and svBRDF From a Handheld Scanner

Carolin Schmitt, Simon Donné, Gernot Riegler and
Vladlen Koltun, Andreas Geiger

Keywords Paper

3d reconstruction, mobile lightstage, mulitview photometric stereo, svbrdf estimation, shape from shading, material segmentation, handheld 3d sensor, non-lambertian surfaces

0

0

0

0

1:01

26/04/2020

Deep 3D Pan via Local adaptive "t-shaped" convolutions with global and local adaptive dilations

Juan Luis Gonzalez Bello, Munchurl Kim

Keywords Paper

Deep learning, Stereoscopic view synthesis, Monocular depth, Deep 3D Pan

0

0

0

0

5:01

30/11/2020

D2D: Keypoint Extraction with Describe to Detect Approach

Yurun Tian, Vassileios Balntas, Tony Ng and
Axel Barroso-Laguna, Yiannis Demiris, Krystian Mikolajczyk

Keywords Paper

0

0

0

0

4:34

14/06/2020

Cross-View Tracking for Multi-Human 3D Pose Estimation at Over 100 FPS

Long Chen, Haizhou Ai, Rui Chen and
Zijie Zhuang, Shuang Liu

Keywords Paper

3d pose estimation, multi-view, multi-human, cross-view tracking, triangulation

0

0

0

0

1:01

06/12/2021

Panoptic 3D Scene Reconstruction From a Single RGB Image

Manuel Dahnert, Ji Hou, Matthias Niessner, Angela Dai

Keywords Paper

vision

0

0

0

0

6:06

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

Plug-and-Play Algorithms for Large-Scale Snapshot Compressive Imaging

Xin Yuan, Yang Liu, Jinli Suo, Qionghai Dai

Keywords Paper

snapshot compressive image, plug-and-play, large-scale, video compressive sensing, convergence, coded aperture compresive temporal imaging (cacti), gap, admm, real data

0

0

0

0

5:01

22/11/2021

Unsupervised View-Invariant Human Posture Representation

Faegheh Sardari, Bjorn Ommer, Majid Mirmehdi

Keywords Paper

Representation Learning, Self-supervised Learning, Unsupervised 3D Pose Estimation, View-Invariant Pose Estimation, View-Invariant Action Recognition, View-Invariant Action Assessment, View-Invariant Human Movemnet Assessment, Human Posture Representation, Unsupervised Action Recognition, Unsupervised Action Assessment

0

0

0

0

2:59

14/06/2020

PV-RCNN: Point-Voxel Feature Set Abstraction for 3D Object Detection

Shaoshuai Shi, Chaoxu Guo, Li Jiang and
Zhe Wang, Jianping Shi, Xiaogang Wang, Hongsheng Li

Keywords Paper

3d object detection, point cloud, 3d scene understanding, lidar, autonomous driving, kitti dataset, waymo open dataset

0

0

0

0

1:01

14/06/2020

Novel View Synthesis of Dynamic Scenes With Globally Coherent Depths From a Monocular Camera

Jae Shin Yoon, Kihwan Kim, Orazio Gallo and
Hyun Soo Park, Jan Kautz

Keywords Paper

view synthesis, depth estimation, dynamic scene, depth fusion, globally coherent depth, monocular camera

0

0

0

0

1:00

17/08/2020

Langevin monte carlo rendering with gradient-based adaptation

Fujun Luan, Shuang Zhao, Kavita Bala, Ioannis Gkioulekas

Keywords Paper

global illumination, langevin Monte Carlo, photorealistic rendering

0

0

0

0

17:36

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49

02/02/2021

Self-Paced Two-dimensional PCA

Jiangxin Li, Zhao Kang, Chong Peng, Wenyu Chen

Keywords Paper

0

0

0

0

17:01

22/11/2021

Towards Monocular Shape from Refraction

Antonin Sulc, Imari Sato, Bastian Goldluecke, Tali Treibitz

Keywords Paper

shape from x, refraction, optimisation

0

0

0

0

8:27

06/12/2020

An Unsupervised Information-Theoretic Perceptual Quality Metric

Sangnie Bhardwaj, Ian Fischer, Johannes Ballé, Troy Chinen

Keywords Paper

0

0

0

0

3:08

02/02/2021

Graph and Temporal Convolutional Networks for 3D Multi-person Pose Estimation in Monocular Videos

Yu Cheng, Bo Wang, Bo Yang, Robby T. Tan

Keywords Paper

0

0

0

0

15:02

14/06/2020

HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation From a Single Depth Map

Jameel Malik, Ibrahim Abdelaziz, Ahmed Elhayek and
Soshi Shimada, Sk Aziz Ali, Vladislav Golyanik, Christian Theobalt, Didier Stricker

Keywords Paper

3d computer vision, 3d pose, 3d shape, depth image, convolutional neural network, voxelized hand shape, hand surface, depth map synthesizers, weak supervision, shape registration

0

0

0

0

1:01

22/11/2021

Learning to Deblur and Rotate Motion-Blurred Faces

Givi Meishvili, Attila Szabo, Simon Jenni, Paolo Favaro

Keywords Paper

deblurring, face, multi-view, video, blur, GAN, novel view synthesis, inversion, deep learning, dataset

0

0

0

0

2:50