Unsupervised Part Representation by Flow Capsules

Abstract: Capsule networks aim to parse images into a hierarchy of objects, parts and relations. While promising, they remain limited by an inability to learn effective low level part descriptions. To address this issue we propose a way to learn primary capsule encoders that detect atomic parts from a single image. During training we exploit motion as a powerful perceptual cue for part definition, with an expressive decoder for part generation within a layered image model with occlusion. Experiments demonstrate robust part discovery in the presence of multiple objects, cluttered backgrounds, and occlusion. The learned part decoder is shown to infer the underlying shape masks, effectively filling in occluded regions of the detected shapes. We evaluate FlowCapsules on unsupervised part segmentation and unsupervised image classification.

06/12/2021

transformer, image captioning, vision and language, fully-attentive models, mesh connectivity, memory vectors, self-attention

1:00

14/06/2020

Weakly supervised segmentation, semi supervised segmentation, Pseudo-label generation, Class Activation Maps, Objectness, Saliency

3:02

16/11/2020

Mel Vecerik, Jean-Baptiste Regli, Oleg Sushkov and
David Barker, Rugile Pevceviciute, Thomas Roth ̈orl, Raia Hadsell, Lourdes Agapito, Jonathan Scholz

Keywords Paper

5:06

14/06/2020

DOPS: Learning to Detect 3D Objects and Predict Their 3D Shapes

Mahyar Najibi, Guangda Lai, Abhijit Kundu and
Zhichao Lu, Vivek Rathod, Thomas Funkhouser, Caroline Pantofaru, David Ross, Larry S. Davis, Alireza Fathi

Di Hu, Rui Qian, Minyue Jiang and
Xiao Tan, Shilei Wen, Errui Ding, Weiyao Lin, Dejing Dou

Keywords Paper

3:07

14/06/2020

Multi-Path Learning for Object Pose Estimation Across Domains

Martin Sundermeyer, Maximilian Durner, En Yen Puang and
Zoltan-Csaba Marton, Narunas Vaskevicius, Kai O. Arras, Rudolph Triebel

Keywords Paper

object pose estimation, encodings, multi object, synthetic data, symmetries, autoencoder, embedding, 6d object detection, t-less, relative pose estimation

1:01

14/06/2020

interpretability, novelty-detection, deep-anomaly-detection, one-class-classification, xai, explanations, anomaly-detection, deep-learning, outlier-detection

5:26

14/06/2020

image classification, partial occlusion, compositional model, out of distribution, analysis by synthesis, robustness, deep learning

1:01

14/06/2020

saliency detection, salient object detection, feature interaction strategy, scale-insensitive loss, multi-scale features, multi-level features, fully convolutional network, deep learning

1:01

26/04/2020