ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

06/12/2021

ViSER: Video-Specific Surface Embeddings for Articulated 3D Shape Reconstruction

Gengshan Yang, Deqing Sun, Varun Jampani, Daniel Vlasic, Forrester Cole, Ce Liu, Deva Ramanan

Keywords:

Abstract Paper Similar Papers

Abstract: We introduce ViSER, a method for recovering articulated 3D shapes and dense3D trajectories from monocular videos. Previous work on high-quality reconstruction of dynamic 3D shapes typically relies on multiple camera views, strong category-specific priors, or 2D keypoint supervision. We show that none of these are required if one can reliably estimate long-range correspondences in a video, making use of only 2D object masks and two-frame optical flow as inputs. ViSER infers correspondences by matching 2D pixels to a canonical, deformable 3D mesh via video-specific surface embeddings that capture the pixel appearance of each surface point. These embeddings behave as a continuous set of keypoint descriptors defined over the mesh surface, which can be used to establish dense long-range correspondences across pixels. The surface embeddings are implemented as coordinate-based MLPs that are fit to each video via consistency and contrastive reconstruction losses.Experimental results show that ViSER compares favorably against prior work on challenging videos of humans with loose clothing and unusual poses as well as animals videos from DAVIS and YTVOS. Our code is available at viser-shape.github.io.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

06/12/2021

TransformerFusion: Monocular RGB Scene Reconstruction using Transformers

Aljaz Bozic, Pablo Palafox, Justus Thies and
Angela Dai, Matthias Niessner

Keywords Paper

transformers

0

0

0

0

7:14

14/06/2020

Hierarchical Scene Coordinate Classification and Regression for Visual Localization

Xiaotian Li, Shuzhe Wang, Yi Zhao and
Jakob Verbeek, Juho Kannala

Keywords Paper

visual localization, camera relocalization, scene coordinate regression

0

0

0

0

1:01

07/09/2020

Making a Case for 3D Convolutions for Object Segmentation in Videos

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

object tracking, video segmentation, video object segmentation, video scene understanding, object segmentation

0

0

0

0

8:16

14/06/2020

Joint Texture and Geometry Optimization for RGB-D Reconstruction

Yanping Fu, Qingan Yan, Jie Liao, Chunxia Xiao

Keywords Paper

rgb-d reconstruction, 3d reconstruction, texture optimization, geometry optimization, joint texture and geometry optimization

0

0

0

0

0:57

25/07/2020

3D self-attention for unsupervised video quantization

Jingkuan Song, Ruimin Lang, Xiaosu Zhu and
Xing Xu, Lianli Gao, Heng Tao Shen

Keywords Paper

quantization, video retrieval, ann search

0

0

0

0

9:44

14/06/2020

FeatureFlow: Robust Video Interpolation via Structure-to-Texture Generation

Shurui Gui, Chaoyue Wang, Qihua Chen, Dacheng Tao

Keywords Paper

frame interpolation, slow motion, video processing, generation framework, deep learning, computer vision

0

0

0

0

1:00

06/12/2021

Voxel-based 3D Detection and Reconstruction of Multiple Objects from a Single Image

Feng Liu, Xiaoming Liu

Keywords Paper

vision

0

0

0

0

9:19

14/06/2020

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

3d action recognition, point cloud, 3d motion, temporal rank pooling, pointnet++, multi-stream network

0

0

0

0

1:01

14/06/2020

DeepFaceFlow: In-the-Wild Dense 3D Facial Motion Estimation

Mohammad Rami Koujan, Anastasios Roussos, Stefanos Zafeiriou

Keywords Paper

3d flow, dense 3d facial motion capture, optical flow, scene flow, 3d reconstruction and tracking, in-the-wild monocular tracking, facial reenactment, expression recognition, performance capture, non-rigid facial deformations

0

0

0

0

1:01

14/06/2020

Front2Back: Single View 3D Shape Reconstruction via Front to Back Prediction

Yuan Yao, Nico Schertler, Enrique Rosales and
Helge Rhodin, Leonid Sigal, Alla Sheffer

Keywords Paper

single-view 3d reconstruction, surface reconstruction, view synthesis

0

0

0

0

1:01

06/12/2021

Deep Marching Tetrahedra: a Hybrid Representation for High-Resolution 3D Shape Synthesis

Tianchang Shen, Jun Gao, Kangxue Yin and
Ming-Yu Liu, Sanja Fidler

Keywords Paper

deep learning, optimization, generative model

0

0

0

0

14:32

16/11/2020

Range Conditioned Dilated Convolutions for Scale Invariant 3D Object Detection

Alex Bewley, Pei Sun, Thomas Mensink and
Dragomir Anguelov, Cristian Sminchisescu

Keywords Paper

0

0

0

0

5:06

06/12/2021

Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

Denys Rozumnyi, Martin R. Oswald, Vittorio Ferrari, Marc Pollefeys

Keywords Paper

optimization

0

0

0

0

10:44

14/06/2020

FroDO: From Detections to 3D Objects

Martin Rünz, Kejie Li, Meng Tang and
Lingni Ma, Chen Kong, Tanner Schmidt, Ian Reid, Lourdes Agapito, Julian Straub, Steven Lovegrove, Richard Newcombe

Keywords Paper

reconstruction, shape embedding, 3d vision, object detection, shape prior, object representation, monocular, sdf, pointcloud, inference

0

0

0

0

1:01

14/06/2020

Self-Supervised Monocular Scene Flow Estimation

Junhwa Hur, Stefan Roth

Keywords Paper

monocular scene flow, self-supervised learning, 3d scene flow, optical flow, monocular depth estimation

0

0

0

0

5:00

14/06/2020

3D Human Mesh Regression With Dense Correspondence

Wang Zeng, Wanli Ouyang, Ping Luo and
Wentao Liu, Xiaogang Wang

Keywords Paper

human, 3d, mesh, estimation, monocular image, pose, dense correspondence, model-free, uv map, feature transfer

0

0

0

0

1:01

14/06/2020

Learning Fused Pixel and Feature-Based View Reconstructions for Light Fields

Jinglei Shi, Xiaoran Jiang, Christine Guillemot

Keywords Paper

light field, view synthesis, feature-based reconstruction, pixel-based reconstruction, deep learning, angular super-resolution

0

0

0

0

4:56

14/06/2020

Novel View Synthesis of Dynamic Scenes With Globally Coherent Depths From a Monocular Camera

Jae Shin Yoon, Kihwan Kim, Orazio Gallo and
Hyun Soo Park, Jan Kautz

Keywords Paper

view synthesis, depth estimation, dynamic scene, depth fusion, globally coherent depth, monocular camera

0

0

0

0

1:00

17/08/2020

Radiative backpropagation: An adjoint method for lightning-fast differentiable rendering

Merlin Nimier-David, Sébastien Speierer, Benoı̂t Ruiz, Wenzel Jakob

Keywords Paper

ray tracing, global illumination, differentiable rendering

0

0

0

0

17:54

06/12/2020

Generative View Synthesis: From Single-view Semantics to Novel-view Images

Tewodros Amberbir Habtegebrial, Varun Jampani, Orazio Gallo, Didier Stricker

Keywords Paper

0

0

0

0

3:20

06/12/2021

To The Point: Correspondence-driven monocular 3D category reconstruction

Filippos Kokkinos, Iasonas Kokkinos

Keywords Paper

optimization

0

0

0

0

5:27

30/11/2020

Robust High Dynamic Range (HDR) Imaging with Complex Motion and Parallax

Zhiyuan Pu, Peiyao Guo, M. Salman Asif, Zhan Ma

Keywords Paper

0

0

0

0

7:38

14/06/2020

Anisotropic Convolutional Networks for 3D Semantic Scene Completion

Jie Li, Kai Han, Peng Wang and
Yu Liu, Xia Yuan

Keywords Paper

semantic scene completion, dense voxel prediction, shape completion, semantic segmentation, rgb-d, anisotropic convolution, voxel-wise receptive fields, 3d convolution

0

0

0

0

1:01

14/06/2020

Smooth Shells: Multi-Scale Shape Registration With Functional Maps

Marvin Eisenberger, Zorah Lähner, Daniel Cremers

Keywords Paper

shape correspondence, functional maps, shape registration, non-rigid correspondence, interclass matching

0

0

0

0

4:56

05/01/2021

Long-Range Attention Network for Multi-View Stereo

Xudong Zhang, Yutao Hu, Haochen Wang and
Xianbin Cao, Baochang Zhang

Keywords Paper

0

0

0

0

4:18

19/08/2021

Spline Positional Encoding for Learning 3D Implicit Signed Distance Fields

Peng-Shuai Wang, Yang Liu, Yu-Qi Yang, Xin Tong

Keywords Paper

Computer Vision, 2D and 3D Computer Vision

0

0

0

0

9:42

14/06/2020

Deep Implicit Volume Compression

Danhang Tang, Saurabh Singh, Philip A. Chou and
Christian Häne, Mingsong Dou, Sean Fanello, Jonathan Taylor, Philip Davidson, Onur G. Guleryuz, Yinda Zhang, Shahram Izadi, Andrea Tagliasacchi, Sofien Bouaziz, Cem Keskin

Keywords Paper

deep neural network, 3d compression, volume, texture, entropy encoding, marching cubes

0

0

0

0

5:01

30/11/2020

Mask-Ranking Network for Semi-Supervised Video Object Segmentation

Wenjing Li, Xiang Zhang, Yujie Hu, Yingqi Tang

Keywords Paper

0

0

0

0

5:36

05/01/2021

Improve CAM With Auto-Adapted Segmentation and Co-Supervised Augmentation

Ziyi Kou, Guofeng Cui, Shaojie Wang and
Wentian Zhao, Chenliang Xu

Keywords Paper

0

0

0

0

4:48

06/12/2021

Manifold Topology Divergence: a Framework for Comparing Data Manifolds.

Serguei Barannikov, Ilya Trofimov, Grigorii Sotnikov and
Ekaterina Trimbach, Alexander Korotin, Alexander Filippov, Evgeny Burnaev

Keywords Paper

generative model

0

0

0

0

15:01

19/08/2021

Towards Cross-View Consistency in Semantic Segmentation While Varying View Direction

Xin Tong, Xianghua Ying, Yongjie Shi and
He Zhao, Ruibin Wang

Keywords Paper

Computer Vision, Recognition, Robotics and Vision

0

0

0

0

10:10

22/11/2021

SuperStyleNet: Deep Image Synthesis with Superpixel Based Style Encoder

Jonghyun Kim, Gen Li, Cheolkon Jung, Joongkyu Kim

Keywords Paper

image-to-image translation, semantic image synthesis, image generation, superpixel, style encoder, graph self-attention

0

0

0

0

2:52

14/06/2020

Stylization-Based Architecture for Fast Deep Exemplar Colorization

Zhongyou Xu, Tingting Wang, Faming Fang and
Yun Sheng, Guixu Zhang

Keywords Paper

colorization, two-subnets, adain, stylization, fast

0

0

0

0

0:58

14/06/2020

RDCFace: Radial Distortion Correction for Face Recognition

He Zhao, Xianghua Ying, Yongjie Shi and
Xin Tong, Jingsi Wen, Hongbin Zha

Keywords Paper

radial distortion correction, face recognition, spatial transformer network, cascaded network, fisheye camera, wide-angle camera

0

0

0

0

1:00

03/08/2020

Locally Masked Convolution for Autoregressive Models

Ajay Jain, Pieter Abbeel, Deepak Pathak

Keywords Paper

0

0

0

0

8:28

06/12/2021

A Shading-Guided Generative Implicit Model for Shape-Accurate 3D-Aware Image Synthesis

Xingang Pan, Xudong XU, Chen Change Loy and
Christian Theobalt, Bo Dai

Keywords Paper

generative model

0

0

0

0

11:15

06/12/2021

Panoptic 3D Scene Reconstruction From a Single RGB Image

Manuel Dahnert, Ji Hou, Matthias Niessner, Angela Dai

Keywords Paper

vision

0

0

0

0

6:06

06/12/2021

Non-local Latent Relation Distillation for Self-Adaptive 3D Human Pose Estimation

Jogendra Nath Kundu, Siddharth Seth, Anirudh Jamkhandi and
Pradyumna YM, Varun Jampani, Anirban Chakraborty, Venkatesh Babu R

Keywords Paper

vision, domain adaptation

0

0

0

0

14:55

30/11/2020

3D Guided Weakly Supervised Semantic Segmentation

Weixuan Sun, Jing Zhang, Nick Barnes

Keywords Paper

0

0

0

0

9:20

22/11/2021

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

0

0

0

0

2:49