First-Person View Hand Segmentation of Multi-Modal Hand Activity Video Dataset

Abstract: First-person-view videos of hands interacting with tools are widely used in the computer vision industry. However, creating a dataset with pixel-wise segmentation of hands is challenging since most videos are captured with fingertips occluded by the hand dorsum and grasped tools. Current methods often rely on manually segmenting hands to create annotations, which is inefficient and costly. To relieve this challenge, we create a method that utilizes thermal information of hands for efficient pixel-wise hand segmentation to create a multi-modal activity video dataset. Our method is not affected by fingertip and joint occlusions and does not require hand pose ground truth. We show our method to be 24 times faster than the traditional polygon labeling method while maintaining high quality. With the segmentation method, we propose a multi-modal hand activity video dataset with 790 sequences and 401,765 frames of "hands using tools" videos captured by thermal and RGB-D cameras with hand segmentation data. We analyze multiple models for hand segmentation performance and benchmark four segmentation networks. We show that our multi-modal dataset with fusing Long-Wave InfraRed~(LWIR) and RGB-D frames achieves 5% better hand IoU performance than using RGB frames.

First-Person View Hand Segmentation of Multi-Modal Hand Activity Video Dataset

Sangpil Kim, Hyung-gun Chi, Xiao Hu, Anirudh Vegesana, Karthik Ramani

Comments

Similar Papers

MEgATrack: Monochrome egocentric articulated hand-tracking for virtual reality

Shangchen Han, Beibei Liu, Randi Cabezas and Christopher D. Twigg, Peizhao Zhang, Jeff Petkau, Tsz-Ho Yu, Chun-Jung Tai, Muzaffer Akbay, Zheng Wang, Asaf Nitzan, Gang Dong, Yuting Ye, Lingling Tao, Chengde Wan, Robert Wang

Keywords Abstract Paper

hand tracking, virtual reality, motion capture

Two-Hand Global 3D Pose Estimation Using Monocular RGB

Fanqing Lin, Connor Wilhelm, Tony Martinez

Keywords Abstract Paper

MVHM: A Large-Scale Multi-View Hand Mesh Benchmark for Accurate 3D Hand Pose Estimation

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and Yen-Yu Lin, Xiaohui Xie

Keywords Abstract Paper

Whose Hand Is This? Person Identification From Egocentric Hand Gestures

Satoshi Tsutsui, Yanwei Fu, David J. Crandall

Keywords Abstract Paper

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson, Bugra Tekin, Federica Bogo and Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Abstract Paper

hand-object reconstruction, pose estimation, object manipulation, photometric consistency, self-supervised learning, hands, objects, 3d reconstruction, manipulation

XNect: Real-time multi-person 3D motion capture with a single RGB camera

Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller and Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

Keywords Abstract Paper

human body pose, motion capture, real-time, RGB, monocular

Weakly-Supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects

Seungryul Baek, Kwang In Kim, Tae-Kyun Kim

Keywords Abstract Paper

hand pose estimation, 3d pose estimation, hand-object interaction, weak supervision, domain adaptation, generative adversarial network, 3d mesh model, mano, hand gesture recognition, data synthesis

HOnnotate: A Method for 3D Annotation of Hand and Object Poses

Shreyas Hampali, Mahdi Rad, Markus Oberweger, Vincent Lepetit

Keywords Abstract Paper

3d pose estimation, bayesian formulation, hand-object pose, joint-optimization, markerless dataset, single frame pose estimation

Temporal-Aware Self-Supervised Learning for 3D Hand Pose and Mesh Estimation in Videos

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and Yen-Yu Lin, Xiaohui Xie

Keywords Abstract Paper

GIF Thumbnails: Attract More Clicks to Your Videos

Yi Xu, Fan Bai, Yingxuan Shi and Qiuyu Chen, Longwen Gao, Kai Tian, Shuigeng Zhou, Huyang Sun

Keywords Abstract Paper

Decoupled Representation Learning for Skeleton-Based Gesture Recognition

Jianbo Liu, Yongcheng Liu, Ying Wang and Véronique Prinet, Shiming Xiang, Chunhong Pan

Keywords Abstract Paper

gesture recognition, skeleton-based, decoupled, two-stream, 3d cnn

GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes

Enric Corona, Albert Pumarola, Guillem Alenyà and Francesc Moreno-Noguer, Grégory Rogez

Keywords Abstract Paper

generative model, grasp prediction, generative model, large-scale dataset, hand pose estimation, hand shape estimation, affordances, augmented reality, robotics, human-object interaction

Speech2Video Synthesis with 3D Skeleton Regularization and Expressive Body Poses

Miao Liao, Sibo Zhang, Peng Wang and Hao Zhu, Xinxin Zuo, Ruigang Yang

Keywords Abstract Paper

How We Type: Eye and Finger Movement Strategies in Mobile Typing

Xinhui Jiang, Yang Li, Jussi Jokinen and Viet Ba Hirvola, Antti Oulasvirta, Xiangshi Ren

Keywords Abstract Paper

text input, mobile device, eye-hand coordination, eye movement, finger movement

Understanding Human Hands in Contact at Internet Scale

Dandan Shan, Jiaqi Geng, Michelle Shu, David F. Fouhey

Keywords Abstract Paper

hand understanding, human object interaction, interaction detection, hand detection, video dataset, affordance, hand mesh prediction, hand reconstruction

HandVoxNet: Deep Voxel-Based Network for 3D Hand Shape and Pose Estimation From a Single Depth Map

Jameel Malik, Ibrahim Abdelaziz, Ahmed Elhayek and Soshi Shimada, Sk Aziz Ali, Vladislav Golyanik, Christian Theobalt, Didier Stricker

Keywords Abstract Paper

3d computer vision, 3d pose, 3d shape, depth image, convolutional neural network, voxelized hand shape, hand surface, depth map synthesizers, weak supervision, shape registration

Learning 3-D Human Pose Estimation from Catadioptric Videos

Chenchen Liu, Yongzhi Li, Kangqi Ma and Duo Zhang, Peijun Bao, Yadong Mu

Keywords Abstract Paper

Computer Vision, 2D and 3D Computer Vision, Video

Real-Time RGBD-Based Extended Body Pose Estimation

Renat Bashirov, Anastasia Ianina, Karim Iskakov and Yevgeniy Kononenko, Valeriya Strizhkova, Victor Lempitsky, Alexander Vakhitov

Keywords Abstract Paper

Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos and Michael M. Bronstein, Stefanos Zafeiriou

Keywords Abstract Paper

hand pose estimation, hand reconstruction, mesh reconstruction, geometric deep learning, graph neural networks, weak supervision

Dynamic Normalization and Relay for Video Action Recognition

Dongqi Cai, Anbang Yao, Yurong Chen

Keywords Abstract Paper

deep learning, representation learning

TransMoMo: Invariance-Driven Unsupervised Video Motion Retargeting

Zhuoqian Yang, Wentao Zhu, Wayne Wu and Chen Qian, Qiang Zhou, Bolei Zhou, Chen Change Loy

Keywords Abstract Paper

Shangchen Han, Beibei Liu, Randi Cabezas and
Christopher D. Twigg, Peizhao Zhang, Jeff Petkau, Tsz-Ho Yu, Chun-Jung Tai, Muzaffer Akbay, Zheng Wang, Asaf Nitzan, Gang Dong, Yuting Ye, Lingling Tao, Chengde Wan, Robert Wang

Keywords Paper

Keywords Paper

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and
Yen-Yu Lin, Xiaohui Xie

Keywords Paper

Keywords Paper

Yana Hasson, Bugra Tekin, Federica Bogo and
Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Paper

Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller and
Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

Keywords Paper

Keywords Paper

Keywords Paper

Liangjian Chen, Shih-Yao Lin, Yusheng Xie and
Yen-Yu Lin, Xiaohui Xie

Keywords Paper

Yi Xu, Fan Bai, Yingxuan Shi and
Qiuyu Chen, Longwen Gao, Kai Tian, Shuigeng Zhou, Huyang Sun

Keywords Paper

Jianbo Liu, Yongcheng Liu, Ying Wang and
Véronique Prinet, Shiming Xiang, Chunhong Pan

Keywords Paper

Enric Corona, Albert Pumarola, Guillem Alenyà and
Francesc Moreno-Noguer, Grégory Rogez

Keywords Paper

Miao Liao, Sibo Zhang, Peng Wang and
Hao Zhu, Xinxin Zuo, Ruigang Yang

Keywords Paper

Xinhui Jiang, Yang Li, Jussi Jokinen and
Viet Ba Hirvola, Antti Oulasvirta, Xiangshi Ren

Keywords Paper

Keywords Paper

Jameel Malik, Ibrahim Abdelaziz, Ahmed Elhayek and
Soshi Shimada, Sk Aziz Ali, Vladislav Golyanik, Christian Theobalt, Didier Stricker

Keywords Paper

Chenchen Liu, Yongzhi Li, Kangqi Ma and
Duo Zhang, Peijun Bao, Yadong Mu

Keywords Paper

Renat Bashirov, Anastasia Ianina, Karim Iskakov and
Yevgeniy Kononenko, Valeriya Strizhkova, Victor Lempitsky, Alexander Vakhitov

Keywords Paper

Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos and
Michael M. Bronstein, Stefanos Zafeiriou

Keywords Paper

Keywords Paper

Zhuoqian Yang, Wentao Zhu, Wayne Wu and
Chen Qian, Qiang Zhou, Bolei Zhou, Chen Change Loy

Keywords Paper

Jun Lv, Wenqiang Xu, Lixin Yang and
Sucheng Qian, Chongzhao Mao, Cewu Lu

Keywords Paper

Valerii Likhosherstov, Krzysztof Choromanski, Jared Quincy Davis and
Xingyou Song, Adrian Weller

Keywords Paper

Keywords Paper

Keywords Paper

Sabarinath Mahadevan, Ali Athar, Aljosa Osep and
Laura Leal-Taixé, Bastian Leibe, Sebastian Hennen

Keywords Paper

Michel Breyer, Jen Jen Chung, Lionel Ott and
Roland Siegwart, Juan Nieto

Keywords Paper

Keywords Paper

Keywords Paper

Keywords Paper

Cenek Albl, Zuzana Kukelova, Viktor Larsson and
Michal Polic, Tomas Pajdla, Konrad Schindler

Keywords Paper

Jiajun Deng, Shaoshuai Shi, Peiwei Li and
Wengang Zhou, Yanyong Zhang, Houqiang Li

Keywords Paper

Keywords Paper

Joan Puigcerver Puigcerver i Perez, Carlos Riquelme, Basil Mustafa and
Cedric Renggli, André Susano Pinto, Sylvain Gelly, Daniel Keysers, Neil Houlsby

Keywords Paper

Antoine Miech, Jean-Baptiste Alayrac, Lucas Smaira and
Ivan Laptev, Josef Sivic, Andrew Zisserman

Keywords Paper

Hongyu Liu, Sam Silvestro, Xiangyu Zhang and
Jian Huang, Tongping Liu

Keywords Paper