Detailed 2D-3D Joint Representation for Human-Object Interaction

Abstract: Human-Object Interaction (HOI) detection lies at the core of action understanding. Besides 2D information such as human/object appearance and locations, 3D pose is also usually utilized in HOI learning since its view-independence. However, rough 3D body joints just carry sparse body information and are not sufficient to understand complex interactions. Thus, we need detailed 3D body shape to go further. Meanwhile, the interacted object in 3D is also not fully studied in HOI learning. In light of these, we propose a detailed 2D-3D joint representation learning method. First, we utilize the single-view human body capture method to obtain detailed 3D body, face and hand shapes. Next, we estimate the 3D object location and size with reference to the 2D human-object spatial configuration and object category priors. Finally, a joint learning framework and cross-modal consistency tasks are proposed to learn the joint HOI representation. To better evaluate the 2D ambiguity processing capacity of models, we propose a new benchmark named Ambiguous-HOI consisting of hard ambiguous images. Extensive experiments in large-scale HOI benchmark and Ambiguous-HOI show impressive effectiveness of our method. Code and data are available at https://github.com/DirtyHarryLYL/DJ-RN.

Detailed 2D-3D Joint Representation for Human-Object Interaction

Yong-Lu Li, Xinpeng Liu, Han Lu, Shiyi Wang, Junqi Liu, Jiefeng Li, Cewu Lu

Comments

Similar Papers

Learning Deep Network for Detecting 3D Object Keypoints and 6D Poses

Wanqing Zhao, Shaobo Zhang, Ziyu Guan and Wei Zhao, Jinye Peng, Jianping Fan

Keywords Abstract Paper

6d pose estimation, keypoints detection, relative pose, object detection, deep learning, computer vision, multi-task learning, metric learning, multi-view learning, epipolar geometry

Anatomy and Geometry Constrained One-Stage Framework for 3D Human Pose Estimation

Xin Cao, Xu Zhao

Keywords Abstract Paper

PC-HMR: Pose Calibration for 3D Human Mesh Recovery from 2D Images/Videos

Tianyu Luan, Yali Wang, Junhao Zhang and Zhe Wang, Zhipeng Zhou, Yu Qiao

Keywords Abstract Paper

Three-Dimensional Reconstruction of Human Interactions

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Abstract Paper

human, interaction, 3d, reconstruction, contact, dataset, pose, shape, body, person

BiHand: Recovering Hand Mesh with Multi-stage Bisected Hourglass Networks

Lixin Yang, Jiasen Li, Wenqiang Xu and Yiqun Diao, Cewu Lu

Keywords Abstract Paper

hand pose estimation, 3d hand, hand reconstruction, hand mesh

LatentFusion: End-to-End Differentiable Reconstruction and Rendering for Unseen Object Pose Estimation

Keunhong Park, Arsalan Mousavian, Yu Xiang, Dieter Fox

Keywords Abstract Paper

pose estimation, pose, neural rendering, zero-shot, shape learning, 3d reconstruction, datasets, generative models, multi-view, robotics

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and Yen-Yu Lin, Xiaohui Xie

Keywords Abstract Paper

Novel Object Viewpoint Estimation Through Reconstruction Alignment

Mohamed El Banani, Jason J. Corso, David F. Fouhey

Keywords Abstract Paper

viewpoint estimation, geometry-aware, alignment, reconstruction, 3d, cross-dataset, generalization

Unsupervised Learning of Visual 3D Keypoints for Control

Boyuan Chen, Pieter Abbeel, Deepak Pathak

Keywords Abstract Paper

Reinforcement Learning and Planning, Deep RL

Explicit Residual Descent for 3D Human Pose Estimation from 2D Joint Locations

Yangyuxuan Kang, Anbang Yao, Shandong Wang and Ming Lu, Yurong Chen, Enhua Wu

Keywords Abstract Paper

3D human pose estimation, pose lifting network, feedback optimization, deep neural network, supervised learning

Active 3D Shape Reconstruction from Vision and Touch

Edward Smith, David Meger, Luis Pineda and Roberto Calandra, Jitendra Malik, Adriana Romero Soriano, Michal Drozdzal

Keywords Abstract Paper

deep learning, reinforcement learning and planning

Implicit Functions in Feature Space for 3D Shape Reconstruction and Completion

Julian Chibane, Thiemo Alldieck, Gerard Pons-Moll

Keywords Abstract Paper

shape reconstruction, shape completion, implicit function learning, 3d scene understanding, single-view reconstruction, point cloud completion, voxel super-resolution, representation learning, surface reconstruction, 3d vision

GaussiGAN: Controllable Image Synthesis with 3D Gaussians from Unposed Silhouettes

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and Oliver Wang, Kwang In Kim, James Tompkin

Keywords Abstract Paper

structured representation, 3D representation, 3D Gaussians, image generation, image synthesis, image editing, controlled generation, GANs

Amodal 3D Reconstruction for Robotic Manipulation via Stability and Connectivity

William Agnew, Christopher Xie, Aaron Walsman and Octavian Murad, Yubo Wang, Pedro Domingos, Siddhartha Srinivasa

Keywords Abstract Paper

Self-Supervised 3D Human Pose Estimation via Part Guided Novel Image Synthesis

Jogendra Nath Kundu, Siddharth Seth, Varun Jampani and Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty

Keywords Abstract Paper

3d human pose estimation, self-supervised learning, disentangling factors of variation, human puppet model, pose transfer, novel view synthesis, human part segmentation

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson, Bugra Tekin, Federica Bogo and Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Abstract Paper

hand-object reconstruction, pose estimation, object manipulation, photometric consistency, self-supervised learning, hands, objects, 3d reconstruction, manipulation

Learning Complex 3D Human Self-Contact

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Abstract Paper

Unsupervised View-Invariant Human Posture Representation

Faegheh Sardari, Bjorn Ommer, Majid Mirmehdi

Keywords Abstract Paper

Object-Occluded Human Shape and Pose Estimation From a Single Color Image

Tianshu Zhang, Buzhen Huang, Yangang Wang

Keywords Abstract Paper

human shape and pose estimation, occlusion, 3d human dataset, representation for 3d human

Epipolar Transformers

Yihui He, Rui Yan, Katerina Fragkiadaki, Shoou-I Yu

Keywords Abstract Paper

keypoint detection, pose estimation, epipolar geometry, attention, transformer, non-local networks

3DV: 3D Dynamic Voxel for Action Recognition in Depth Video

Yancheng Wang, Yang Xiao, Fu Xiong and Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Wanqing Zhao, Shaobo Zhang, Ziyu Guan and
Wei Zhao, Jinye Peng, Jianping Fan

Keywords Paper

Keywords Paper

Tianyu Luan, Yali Wang, Junhao Zhang and
Zhe Wang, Zhipeng Zhou, Yu Qiao

Keywords Paper

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and
Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Paper

Lixin Yang, Jiasen Li, Wenqiang Xu and
Yiqun Diao, Cewu Lu

Keywords Paper

Keywords Paper

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and
Yen-Yu Lin, Xiaohui Xie

Keywords Paper

Keywords Paper

Keywords Paper

Yangyuxuan Kang, Anbang Yao, Shandong Wang and
Ming Lu, Yurong Chen, Enhua Wu

Keywords Paper

Edward Smith, David Meger, Luis Pineda and
Roberto Calandra, Jitendra Malik, Adriana Romero Soriano, Michal Drozdzal

Keywords Paper

Keywords Paper

Youssef Alami Mejjati, Isa Milefchik, Aaron K Gokaslan and
Oliver Wang, Kwang In Kim, James Tompkin

Keywords Paper

William Agnew, Christopher Xie, Aaron Walsman and
Octavian Murad, Yubo Wang, Pedro Domingos, Siddhartha Srinivasa

Keywords Paper

Jogendra Nath Kundu, Siddharth Seth, Varun Jampani and
Mugalodi Rakesh, R. Venkatesh Babu, Anirban Chakraborty

Keywords Paper

Yana Hasson, Bugra Tekin, Federica Bogo and
Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Paper

Mihai Fieraru, Mihai Zanfir, Elisabeta Oneata and
Alin-Ionut Popa, Vlad Olaru, Cristian Sminchisescu

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Yancheng Wang, Yang Xiao, Fu Xiong and
Wenxiang Jiang, Zhiguo Cao, Joey Tianyi Zhou, Junsong Yuan

Keywords Paper

Yunjie Wu, Zhengxing Sun, Youcheng Song and
Yunhan Sun, YiJie Zhong, Jinlong Shi

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Huan Fu, Shunming Li, Rongfei Jia and
Mingming Gong, Binqiang Zhao, Dacheng Tao

Keywords Paper

Shichao Li, Lei Ke, Kevin Pratama and
Yu-Wing Tai, Chi-Keung Tang, Kwang-Ting Cheng

Keywords Paper

Jingwei Xu, Zhenbo Yu, Bingbing Ni and
Jiancheng Yang, Xiaokang Yang, Wenjun Zhang

Keywords Paper

Keywords Paper

Yan Zhang, Mohamed Hassan, Heiko Neumann and
Michael J. Black, Siyu Tang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Riccardo Spezialetti, Federico Stella, Marlon Marcon and
Luciano Silva, Samuele Salti, Luigi Di Stefano

Keywords Paper

Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov and
David Barker, Rugile Pevceviciute, Thomas Roth ̈orl, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

Keywords Paper