Understanding Human Hands in Contact at Internet Scale

Abstract: Hands are the central means by which humans manipulate their world and being able to reliably extract hand state information from Internet videos of humans engaged in their hands has the potential to pave the way to systems that can learn from petabytes of video data. This paper proposes steps towards this by inferring a rich representation of hands engaged in interaction method that includes: hand location, side, contact state, and a box around the object in contact. To support this effort, we gather a large-scale dataset of hands in contact with objects consisting of 131 days of footage as well as a 100K annotated hand-contact video frame dataset. The learned model on this dataset can serve as a foundation for hand-contact understanding in videos. We quantitatively evaluate it both on its own and in service of predicting and learning from 3D meshes of human hands.

Understanding Human Hands in Contact at Internet Scale

Dandan Shan, Jiaqi Geng, Michelle Shu, David F. Fouhey

Comments

Similar Papers

GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes

Enric Corona, Albert Pumarola, Guillem Alenyà and Francesc Moreno-Noguer, Grégory Rogez

Keywords Abstract Paper

generative model, grasp prediction, generative model, large-scale dataset, hand pose estimation, hand shape estimation, affordances, augmented reality, robotics, human-object interaction

COHESIV: Contrastive Object and Hand Embedding Segmentation In Video

Dandan Shan, Richard Higgins, David Fouhey

Keywords Abstract Paper

deep learning, contrastive learning

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson, Bugra Tekin, Federica Bogo and Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Abstract Paper

hand-object reconstruction, pose estimation, object manipulation, photometric consistency, self-supervised learning, hands, objects, 3d reconstruction, manipulation

Learning Object Manipulation Skills via Approximate State Estimation from Real Videos

Vladimír Petrík, Makarand Tapaswi, Ivan Laptev, Josef Sivic

Keywords Abstract Paper

Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos and Michael M. Bronstein, Stefanos Zafeiriou

Keywords Abstract Paper

hand pose estimation, hand reconstruction, mesh reconstruction, geometric deep learning, graph neural networks, weak supervision

Learning Predictive Representations for Deformable Objects Using Contrastive Estimation

Wilson Yan, Ashwin Vangipuram, Pieter Abbeel, Lerrel Pinto

Keywords Abstract Paper

Learning rich touch representations through cross-modal self-supervision

Martina Zambelli, Yusuf Aytar, Francesco Visin and Yuxiang Zhou, Raia Hadsell

Keywords Abstract Paper

RealSmileNet: A Deep End-To-End Network for Spontaneous and Posed Smile Recognition

Yan Yang, Md Zakir Hossain, Tom Gedeon, Shafin Rahman

Keywords Abstract Paper

Towards Holistic Real-time Human 3D Pose Estimation using MocapNETs

Ammar Qammaz, Antonis A Argyros

Keywords Abstract Paper

3d hand pose estimation, 3d body pose estimation, 3d pose estimation, human perception, ensemble, BVH, hierarchical coordinate descent, eNSRM, RGB, mocap

Modeling Long-horizon Tasks as Sequential Interaction Landscapes

Soeren Pirk, Karol Hausman, Alexander Toshev, Mohi Khansari

Keywords Abstract Paper

Unsupervised Co-part Segmentation through Assembly

Qingzhe Gao, Bin Wang, Libin Liu, Baoquan Chen

Keywords Abstract Paper

Applications, Computer Vision

Attention Distillation for Learning Video Representations

Miao Liu, Xin Chen, Yun Zhang and Yin Li, James Rehg

Keywords Abstract Paper

Action Recognition, Deep Learning, Representation Learning

Scene Restoring for Narrative Machine Reading Comprehension

Zhixing Tian, Yuanzhe Zhang, Kang Liu and Jun Zhao, Yantao Jia, Zhicheng Sheng

Keywords Abstract Paper

machine comprehension, graph network, graph gdin, gdin

Feedback-guided Attributed Graph Embedding for Relevant Video Recommendation

Taofeng Xue, Xinzhou Dong, Wei Zhuo and Beihong Jin, He Chen, Wenhai Pan, Beibei Li, Xuejian Zhang

Keywords Abstract Paper

recommender system, representation learning, graph embedding, user behavior mining

Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning

Zhenfang Chen, Jiayuan Mao, Jiajun Wu and Kwan-Yee K Wong, Joshua B Tenenbaum, Chuang Gan

Keywords Abstract Paper

Visual Reasoning, Video Reasoning, Neuro-Symbolic Learning, Concept Learning

Explicit Knowledge Distillation for 3D Hand Pose Estimation from Monocular RGB

Yumeng Zhang, Li Chen, Yufeng Liu and Wen Zheng, JunHai Yong

Keywords Abstract Paper

3D hand pose estimation, knowledge distillation

HOPE-Net: A Graph-Based Model for Hand-Object Pose Estimation

Bardia Doosti, Shujon Naha, Majid Mirbagheri, David J. Crandall

Keywords Abstract Paper

hand pose estimation, object pose estimation, graph convolution, adaptive graph convolution, trainable pooling, trainable unpooling, graph u-net, 2d pose, 3d pose, resnet10

Hand-Object Contact Prediction via Motion-Based Pseudo-Labeling and Guided Progressive Label Correction

Takuma Yagi, Md Tasnimul Hasan, Yoichi Sato

Keywords Abstract Paper

hand-object interaction, contact prediction, learning from noisy labels, label correction, pseudo-labeling, first-person vision, egocentric vision

HandTailor: Towards High-Precision Monocular 3D Hand Recovery

Jun Lv, Wenqiang Xu, Lixin Yang and Sucheng Qian, Chongzhao Mao, Cewu Lu

Keywords Abstract Paper

hand pose estimation, hand shape reconstruction

Reinforcement Learning with Videos: Combining Offline Observations with Interaction

Karl Schmeckpeper, Oleh Rybkin, Kostas Daniilidis and Sergey Levine, Chelsea Finn

Keywords Abstract Paper

Detecting Hands and Recognizing Physical Contact in the Wild

Supreeth Narasimhaswamy, Trung Nguyen, Minh Hoai Nguyen

Enric Corona, Albert Pumarola, Guillem Alenyà and
Francesc Moreno-Noguer, Grégory Rogez

Keywords Paper

Keywords Paper

Yana Hasson, Bugra Tekin, Federica Bogo and
Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Paper

Keywords Paper

Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos and
Michael M. Bronstein, Stefanos Zafeiriou

Keywords Paper

Keywords Paper

Martina Zambelli, Yusuf Aytar, Francesco Visin and
Yuxiang Zhou, Raia Hadsell

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Miao Liu, Xin Chen, Yun Zhang and
Yin Li, James Rehg

Keywords Paper

Zhixing Tian, Yuanzhe Zhang, Kang Liu and
Jun Zhao, Yantao Jia, Zhicheng Sheng

Keywords Paper

Taofeng Xue, Xinzhou Dong, Wei Zhuo and
Beihong Jin, He Chen, Wenhai Pan, Beibei Li, Xuejian Zhang

Keywords Paper

Zhenfang Chen, Jiayuan Mao, Jiajun Wu and
Kwan-Yee K Wong, Joshua B Tenenbaum, Chuang Gan

Keywords Paper

Yumeng Zhang, Li Chen, Yufeng Liu and
Wen Zheng, JunHai Yong

Keywords Paper

Keywords Paper

Keywords Paper

Jun Lv, Wenqiang Xu, Lixin Yang and
Sucheng Qian, Chongzhao Mao, Cewu Lu

Keywords Paper

Karl Schmeckpeper, Oleh Rybkin, Kostas Daniilidis and
Sergey Levine, Chelsea Finn

Keywords Paper

Keywords Paper

Seung Wook Kim, Yuhao Zhou, Jonah Philion and
Antonio Torralba, Sanja Fidler

Keywords Paper

Keywords Paper

Yizhak Ben-Shabat, Xin Yu, Fatemeh Saleh and
Dylan Campbell, Cristian Rodriguez-Opazo, Hongdong Li, Stephen Gould

Keywords Paper

Keywords Paper

Jean-Baptiste Alayrac, Adria Recasens, Rosalia Schneider and
Relja Arandjelović, Jason Ramapuram, Jeffrey De Fauw, Lucas Smaira, Sander Dieleman, Andrew Zisserman

Keywords Paper

Jannik Kossen, Karl Stelzner, Marcel Hussing and
Claas Voelcker, Kristian Kersting

Keywords Paper

Miao Liao, Sibo Zhang, Peng Wang and
Hao Zhu, Xinxin Zuo, Ruigang Yang

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Kiana Ehsani, Daniel Gordon, Thomas H Nguyen and
Roozbeh Mottaghi, Ali Farhadi

Keywords Paper

Kunpeng Li, Chen Fang, Zhaowen Wang and
Seokhwan Kim, Hailin Jin, Yun Fu

Keywords Paper

AJ Piergiovanni, Anelia Angelova, Michael S Ryoo (Google and
Stony Brook University), Irfan Essa

Keywords Paper

Yu Deng, Jiaolong Yang, Dong Chen and
Fang Wen, Xin Tong

Keywords Paper

Jiaying Liu, Jing Ren, Wenqing Zheng and
Lianhua Chi, Ivan Lee, Feng Xia

Keywords Paper