COHESIV: Contrastive Object and Hand Embedding Segmentation In Video

06/12/2021

COHESIV: Contrastive Object and Hand Embedding Segmentation In Video

Dandan Shan, Richard Higgins, David Fouhey

Keywords: deep learning, contrastive learning

Abstract Paper Similar Papers

Abstract: In this paper we learn to segment hands and hand-held objects from motion. Our system takes a single RGB image and hand location as input to segment the hand and hand-held object. For learning, we generate responsibility maps that show how well a hand's motion explains other pixels' motion in video. We use these responsibility maps as pseudo-labels to train a weakly-supervised neural network using an attention-based similarity loss and contrastive loss. Our system outperforms alternate methods, achieving good performance on the 100DOH, EPIC-KITCHENS, and HO3D datasets.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at NeurIPS 2021 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Hand-Model-Aware Sign Language Recognition

Hezhen Hu, Wengang Zhou, Houqiang Li

Keywords Paper

0

0

0

0

14:38

14/06/2020

Leveraging Photometric Consistency Over Time for Sparsely Supervised Hand-Object Reconstruction

Yana Hasson, Bugra Tekin, Federica Bogo and
Ivan Laptev, Marc Pollefeys, Cordelia Schmid

Keywords Paper

hand-object reconstruction, pose estimation, object manipulation, photometric consistency, self-supervised learning, hands, objects, 3d reconstruction, manipulation

0

0

0

0

1:01

14/06/2020

GanHand: Predicting Human Grasp Affordances in Multi-Object Scenes

Enric Corona, Albert Pumarola, Guillem Alenyà and
Francesc Moreno-Noguer, Grégory Rogez

Keywords Paper

generative model, grasp prediction, generative model, large-scale dataset, hand pose estimation, hand shape estimation, affordances, augmented reality, robotics, human-object interaction

0

0

0

0

4:56

14/06/2020

Disentangled and Controllable Face Image Generation via 3D Imitative-Contrastive Learning

Yu Deng, Jiaolong Yang, Dong Chen and
Fang Wen, Xin Tong

Keywords Paper

face image synthesis, disentangled representation learning, controllable generation, gan, 3d

0

0

0

0

5:01

05/01/2021

Towards Contextual Learning in Few-Shot Object Classification

Mathieu Page Fortin, Brahim Chaib-draa

Keywords Paper

0

0

0

0

4:57

18/07/2021

Learning Intra-Batch Connections for Deep Metric Learning

Jenny Seidenschwarz, Ismail Elezi, Laura Leal-Taixé

Keywords Paper

Algorithms, Metric Learning

0

0

0

0

5:17

05/01/2021

Where to Look?: Mining Complementary Image Regions for Weakly Supervised Object Localization

Sadbhavana Babar, Sukhendu Das

Keywords Paper

0

0

0

0

5:01

14/06/2020

Learn to Augment: Joint Data Augmentation and Network Optimization for Text Recognition

Canjie Luo, Yuanzhi Zhu, Lianwen Jin, Yongpan Wang

Keywords Paper

data augmentation, text recognition, joint training

0

0

0

0

0:59

30/11/2020

Horizontal Flipping Assisted Disentangled Feature Learning for Semi-Supervised Person Re-Identification

Gehan Hao, Yang Yang, Xue Zhou and
Guanan Wang, Zhen Lei

Keywords Paper

0

0

0

0

5:09

06/12/2020

FLAMBE: Structural Complexity and Representation Learning of Low Rank MDPs

Alekh Agarwal, Sham Kakade, Akshay Krishnamurthy, Wen Sun

Keywords Paper

0

0

0

0

3:34

03/05/2021

Learning Task-General Representations with Generative Neuro-Symbolic Modeling

Reuben Feinman, Brenden Lake

Keywords Paper

probabilistic programs, neuro-symbolic models, few-shot concept learning, generative models

0

0

0

0

6:13

07/09/2020

BiHand: Recovering Hand Mesh with Multi-stage Bisected Hourglass Networks

Lixin Yang, Jiasen Li, Wenqiang Xu and
Yiqun Diao, Cewu Lu

Keywords Paper

hand pose estimation, 3d hand, hand reconstruction, hand mesh

0

0

0

0

5:30

02/02/2021

Deep Metric Learning with Self-Supervised Ranking

Zheren Fu, Yan Li, Zhendong Mao and
Quan Wang, Yongdong Zhang

Keywords Paper

0

0

0

0

12:36

14/06/2020

Weakly-Supervised Mesh-Convolutional Hand Reconstruction in the Wild

Dominik Kulon, Riza Alp Güler, Iasonas Kokkinos and
Michael M. Bronstein, Stefanos Zafeiriou

Keywords Paper

hand pose estimation, hand reconstruction, mesh reconstruction, geometric deep learning, graph neural networks, weak supervision

0

0

0

0

5:01

03/05/2021

Few-Shot Learning via Learning the Representation, Provably

Simon Du, Wei Hu, Sham M Kakade and
Jason Lee, Qi Lei

Keywords Paper

statistical learning theory, representation learning

0

0

0

0

6:29

05/01/2021

3D Human Pose and Shape Estimation Through Collaborative Learning and Multi-View Model-Fitting

Zhongguo Li, Magnus Oskarsson, Anders Heyden

Keywords Paper

0

0

0

0

5:13

22/11/2021

FacialGAN: Style Transfer and Attribute Manipulation on Synthetic Faces

Ricard Durall Lopez, Jireh Jam, Dominik Strassel and
Moi Hoon Yap, Janis Keuper

Keywords Paper

GAN, attribute manipulation, style transfer, face editing

0

0

0

0

2:55

07/09/2020

Explicit Knowledge Distillation for 3D Hand Pose Estimation from Monocular RGB

Yumeng Zhang, Li Chen, Yufeng Liu and
Wen Zheng, JunHai Yong

Keywords Paper

3D hand pose estimation, knowledge distillation

0

0

0

0

9:15

14/06/2020

Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning

Yuan Yao, Chang Liu, Dezhao Luo and
Yu Zhou, Qixiang Ye

Keywords Paper

self-supervised spatio-temporal representation learning, multi-temporal resolution characteristic, playback rate perception, motion attention mechanism

0

0

0

0

1:01

03/05/2021

Self-supervised Learning from a Multi-view Perspective

Yao-Hung Hubert Tsai, Yue Wu, Ruslan Salakhutdinov, LP Morency

Keywords Paper

Self-supervised Learning, Unsupervised Learning, Multi-view Representation Learning

0

0

0

0

5:36

22/11/2021

Hand-Object Contact Prediction via Motion-Based Pseudo-Labeling and Guided Progressive Label Correction

Takuma Yagi, Md Tasnimul Hasan, Yoichi Sato

Keywords Paper

hand-object interaction, contact prediction, learning from noisy labels, label correction, pseudo-labeling, first-person vision, egocentric vision

0

0

0

0

3:03

14/06/2020

Understanding Human Hands in Contact at Internet Scale

Dandan Shan, Jiaqi Geng, Michelle Shu, David F. Fouhey

Keywords Paper

hand understanding, human object interaction, interaction detection, hand detection, video dataset, affordance, hand mesh prediction, hand reconstruction

0

0

0

0

5:01

07/09/2020

Attention Distillation for Learning Video Representations

Miao Liu, Xin Chen, Yun Zhang and
Yin Li, James Rehg

Keywords Paper

Action Recognition, Deep Learning, Representation Learning

0

0

0

0

9:50

03/05/2021

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li, Pan Zhou, Caiming Xiong, Steven Hoi

Keywords Paper

self-supervised learning, unsupervised learning, representation learning, contrastive learning

0

0

0

0

4:51

03/05/2021

Learning and Evaluating Representations for Deep One-Class Classification

Kihyuk Sohn, Chun-Liang Li, Jinsung Yoon and
Minho Jin, Tomas Pfister

Keywords Paper

self-supervised learning, deep one-class classification

0

0

0

1

5:13

06/12/2021

Look at What I’m Doing: Self-Supervised Spatial Grounding of Narrations in Instructional Videos

Reuben Tan, Bryan Plummer, Kate Saenko and
Hailin Jin, Bryan Russell

Keywords Paper

optimization

0

0

0

0

12:28

06/12/2021

Unsupervised Part Discovery from Contrastive Reconstruction

Subhabrata Choudhury, Iro Laina, Christian Rupprecht, Andrea Vedaldi

Keywords Paper

machine learning, self-supervised learning, clustering, representation learning

0

0

0

0

6:46

02/02/2021

Attributes-Guided and Pure-Visual Attention Alignment for Few-Shot Recognition

Siteng Huang, Min Zhang, Yachen Kang, Donglin Wang

Keywords Paper

0

0

0

0

17:04

06/12/2021

CorticalFlow: A Diffeomorphic Mesh Transformer Network for Cortical Surface Reconstruction

Leo Lebrat, Rodrigo Santa Cruz, Frederic de Gournay and
Darren Fu, Pierrick Bourgeat, Jurgen Fripp, Clinton Fookes, Olivier Salvado

Keywords Paper

deep learning, transformers

0

0

0

0

12:47

19/08/2021

TCIC: Theme Concepts Learning Cross Language and Vision for Image Captioning

Zhihao Fan, Zhongyu Wei, Siyuan Wang and
Ruize Wang, Zejun Li, Haijun Shan, Xuanjing Huang

Keywords Paper

Computer Vision, Language and Vision, Natural Language Generation

0

0

0

0

10:46

05/01/2021

Active Learning for Bayesian 3D Hand Pose Estimation

Razvan Caramalau, Binod Bhattarai, Tae-Kyun Kim

Keywords Paper

0

0

0

0

5:18

14/06/2020

Weakly-Supervised Domain Adaptation via GAN and Mesh Model for Estimating 3D Hand Poses Interacting Objects

Seungryul Baek, Kwang In Kim, Tae-Kyun Kim

Keywords Paper

hand pose estimation, 3d pose estimation, hand-object interaction, weak supervision, domain adaptation, generative adversarial network, 3d mesh model, mano, hand gesture recognition, data synthesis

0

0

0

0

5:01

06/12/2020

Latent Template Induction with Gumbel-CRFs

Yao Fu, Chuanqi Tan, Bin Bi and
Mosha Chen, Yansong Feng, Alexander Rush

Keywords Paper

0

0

0

0

3:14

30/11/2020

Learning 3D Face Reconstruction with a Pose Guidance Network

Pengpeng Liu, Xintong Han, Michael Lyu and
Irwin King, Jia Xu

Keywords Paper

0

0

0

0

9:32

14/06/2020

Novel Object Viewpoint Estimation Through Reconstruction Alignment

Mohamed El Banani, Jason J. Corso, David F. Fouhey

Keywords Paper

viewpoint estimation, geometry-aware, alignment, reconstruction, 3d, cross-dataset, generalization

0

0

0

0

1:01

18/07/2021

Efficient Iterative Amortized Inference for Learning Symmetric and Disentangled Multi-Object Representations

Patrick Emami, Pan He, Sanjay Ranka, Anand Rangarajan

Keywords Paper

Deep Learning, Embedding and Representation learning

0

0

0

0

5:10

22/11/2021

Hierarchical Contrastive Motion Learning for Video Action Recognition

Xitong Yang, Xiaodong Yang, Sifei Liu and
Deqing Sun, Larry Davis, Jan Kautz

Keywords Paper

action recognition, motion hierarchy, motion representation, contrastive learning

0

0

0

0

8:29

07/09/2020

SketchHealer: A Graph-to-Sequence Network for Recreating Partial Human Sketches

Guoyao Su, Yonggang Qi, Kaiyue Pang and
Jie Yang, Yi-Zhe Song

Keywords Paper

sketch healing, sketch synthesis, graph-to-sequence network, GCN

0

0

0

0

8:34

14/06/2020

PatchVAE: Learning Local Latent Codes for Recognition

Kamal Gupta, Saurabh Singh, Abhinav Shrivastava

Keywords Paper

unsupervised, vae, self-supervised, generative, mid-level patches, variational auto-encoder, representation learning, autoencoder, part mining, patchvae

0

1

1

1

1:01

05/01/2021

Set Augmented Triplet Loss for Video Person Re-Identification

Pengfei Fang, Pan Ji, Lars Petersson, Mehrtash Harandi

Keywords Paper

0

0

0

0

4:56