Utilizing Structure-rich Features to improve Clustering

14/09/2020

Utilizing Structure-rich Features to improve Clustering

Benjamin Schelling, Lena Greta Marie Bauer, Sahar Behzadi, Claudia Plant

Keywords:

Abstract Paper Similar Papers

Abstract: For successful clustering, an algorithm needs to find the boundaries between clusters. While this is comparatively easy if the clusters are compact and non-overlapping and thus the boundaries clearly defined, features where the clusters blend into each other hinder clustering methods to correctly estimate these boundaries. Therefore, we aim to extract features showing clear cluster boundaries and thus enhance the cluster structure in the data. Our novel technique creates a condensed version of the data set containing the structure important for clustering, but without the noise-information. We demonstrate that this transformation of the data set is much easier to cluster for k-means, but also various other algorithms. Furthermore, we introduce a deterministic initialisation strategy for k-means based on these structure-rich features.

0

0

0

0

Share

This is an embedded video. Talk and the respective paper are published at ECML PKDD 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment

no comments yet

Similar Papers

02/02/2021

Automated Clustering of High-dimensional Data with a Feature Weighted Mean Shift Algorithm

Saptarshi Chakraborty, Debolina Paul, Swagatam Das

Keywords Paper

0

0

0

0

20:09

14/09/2020

Model-based Clustering with HDBSCAN*

Michael Strobl, Joerg Sander, Ricardo Campello, Osmar Zaiane

Keywords Paper

hierarchical clustering, expectation maximization, model selection

0

0

0

0

15:31

03/08/2020

Brief announcement: Deterministic lower bound for dynamic balanced graph partitioning

Maciej Pacut, Mahmoud Parham, Stefan Schmid

Keywords Paper

online algorithms, graph partitioning, self-adjusting networks

0

0

0

0

10:22

14/09/2020

An efficient K-means clustering algorithm for tall data

Marco Capó, Aritz Pérez, Jose A. Lozan

Keywords Paper

0

0

0

0

14:46

06/12/2021

Label consistency in overfitted generalized $k$-means

Linfan Zhang, Arash Amini

Keywords Paper

clustering

0

0

0

0

9:26

06/12/2020

Deep Transformation-Invariant Clustering

Tom Monnier, Thibault Groueix, Mathieu Aubry

Keywords Paper

0

0

0

0

3:22

02/02/2021

Variational Fair Clustering

Imtiaz Masud Ziko, Jing Yuan, Eric Granger, Ismail Ben Ayed

Keywords Paper

0

0

0

0

19:28

06/12/2021

Overlapping Spaces for Compact Graph Representations

Kirill Shevkunov, Liudmila Prokhorenkova

Keywords Paper

optimization, graph learning

0

0

0

0

7:39

06/12/2020

BanditPAM: Almost Linear Time k-Medoids Clustering via Multi-Armed Bandits

Mo Tiwari, Martin Zhang, James J Mayclin and
Sebastian Thrun, Chris Piech, Ilan Shomorony

Keywords Paper

0

0

0

0

3:16

06/12/2021

Better Algorithms for Individually Fair $k$-Clustering

Maryam Negahbani, Deeparnab Chakrabarty

Keywords Paper

theory, self-supervised learning, clustering, fairness

0

0

0

0

14:02

12/07/2020

Sparse Subspace Clustering with Entropy-Norm

Liang Bai, Jiye Liang

Keywords Paper

Unsupervised and Semi-Supervised Learning

0

0

0

0

11:51

19/08/2021

Details (Don't) Matter: Isolating Cluster Information in Deep Embedded Spaces

Lukas Miklautz, Lena G. M. Bauer, Dominik Mautz and
Sebastian Tschiatschek, Christian Böhm, Claudia Plant

Keywords Paper

Machine Learning, Deep Learning, Explainable/Interpretable Machine Learning, Clustering

0

0

0

0

14:37

18/07/2021

Differentially-Private Clustering of Easy Instances

Edith Cohen, Haim Kaplan, Yishay Mansour and
Uri Stemmer, Eliad Tsfadia

Keywords Paper

Social Aspects of Machine Learning, Privacy, Anonymity, and Security

0

0

0

0

4:58

06/12/2020

Near-Optimal Comparison Based Clustering

Michaël Perrot, Pascal Esser, Debarghya Ghoshdastidar

Keywords Paper

0

0

0

0

3:08

08/07/2020

Proportionally Fair Clustering Revisited

Evi Micha, Nisarg Shah

Keywords Paper

Fairness, Clustering, Facility location

0

0

0

0

24:22

06/12/2020

Unsupervised Learning of Visual Features by Contrasting Cluster Assignments

Mathilde Caron, Ishan Misra, Julien Mairal and
Priya Goyal, Piotr Bojanowski, Armand Joulin

Keywords Paper

0

1

0

0

3:22

26/08/2020

Entropy Weighted Power k-Means Clustering

Saptarshi Chakraborty, Debolina Paul, Swagatam Das, Jason Xu

Keywords Paper

0

0

0

0

15:20

14/06/2020

Learning to Cluster Faces via Confidence and Connectivity Estimation

Lei Yang, Dapeng Chen, Xiaohang Zhan and
Rui Zhao, Chen Change Loy, Dahua Lin

Keywords Paper

learnable clustering, vertex confidence, edge connectivity

0

0

0

0

1:01

13/04/2021

Large scale k-median clustering for stable clustering instances

Konstantin Voevodski

Keywords Paper

0

0

0

0

2:50

12/07/2020

Neural Clustering Processes

Ari Pakman, Yueqi Wang, Catalin Mitelut and
JinHyung Lee, Department of Statistics Liam Paninski

Keywords Paper

Deep Learning - General

0

0

0

0

7:27

06/12/2020

Improving Local Identifiability in Probabilistic Box Embeddings

Shib Dasgupta, Michael Boratko, Dongxu Zhang and
Luke Vilnis, Xiang Li, Andrew McCallum

Keywords Paper

0

0

0

0

3:20

03/05/2021

Fast Geometric Projections for Local Robustness Certification

Aymeric Fromherz, Klas Leino, Matt Fredrikson and
Bryan Parno, Corina Pasareanu

Keywords Paper

verification, robustness, safety

0

1

0

0

11:54

14/06/2020

Density-Aware Feature Embedding for Face Clustering

Senhui Guo, Jing Xu, Dapeng Chen and
Chao Zhang, Xiaogang Wang, Rui Zhao

Keywords Paper

representation learning, clustering, face clustering, density, density-aware, gcn

0

0

0

0

0:55

03/08/2020

Scalable and Flexible Clustering of Grouped Data via Parallel and Distributed Sampling in Versatile Hierarchical Dirichlet Processes

Or Dinari, Oren Freifeld

Keywords Paper

0

0

0

0

7:43

06/12/2020

Adversarial Learning for Robust Deep Clustering

Xu Yang, Cheng Deng, Kun Wei and
Junchi Yan, Wei Liu

Keywords Paper

0

0

0

0

3:23

06/12/2020

Faster DBSCAN via subsampled similarity queries

Heinrich Jiang, Jennifer Jang, Jakub Lacki

Keywords Paper

0

0

0

0

3:13

12/07/2020

How to Solve Fair k-Center in Massive Data Models

Ashish Chiplunkar, Sagar Kale, Sivaramakrishnan Natarajan Ramamoorthy

Keywords Paper

Fairness, Equity, Justice, and Safety

0

0

0

0

13:45

19/08/2021

Discrete Multiple Kernel k-means

Rong Wang, Jitao Lu, Yihang Lu and
Feiping Nie, Xuelong Li

Keywords Paper

Machine Learning, Clustering, Kernel Methods, Multi-instance; Multi-label; Multi-view learning

0

0

0

0

15:04

06/12/2021

Improving Contrastive Learning on Imbalanced Data via Open-World Sampling

Ziyu Jiang, Tianlong Chen, Ting Chen, Zhangyang Wang

Keywords Paper

contrastive learning

0

0

0

0

10:12

30/11/2020

HPGCNN: Hierarchical Parallel Group Convolutional Neural Networks for Point Clouds Processing

Jisheng Dang, Jun Yang

Keywords Paper

0

0

0

0

9:33

26/04/2020

Minimizing FLOPs to Learn Efficient Sparse Representations

Biswajit Paria, Chih-Kuan Yeh, Ian E.H. Yen and
Ning Xu, Pradeep Ravikumar, Barnabás Póczos

Keywords Paper

sparse embeddings, deep representations, metric learning, regularization

0

0

0

0

4:41

06/12/2020

Learning outside the Black-Box: The pursuit of interpretable models

Jonathan Crabbe, Yao Zhang, William Zame, Mihaela van der Schaar

Keywords Paper

0

0

0

0

3:16

03/05/2021

Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk, Timothy Hospedales, massimiliano pontil

Keywords Paper

Statistical Learning Theory, Transfer Learning, Deep Learning

0

0

0

0

4:57

06/12/2020

Rethinking Learnable Tree Filter for Generic Feature Transform

Lin Song, Yanwei Li, Zhengkai Jiang and
Zeming Li, Xiangyu Zhang, Hongbin Sun, Jian Sun, Nanning Zheng

Keywords Paper

Neuroscience and Cognitive Science -> Memory; Optimization -> Combinatorial Optimization; Optimization -> Submodular Optimizati, Neuroscience and Cognitive Science -> Human or Animal Learning

0

0

0

0

3:11

14/06/2020

A Graduated Filter Method for Large Scale Robust Estimation

Huu Le, Christopher Zach

Keywords Paper

robust fitting, bundle adjustment, non-convex, poor local minima, non-linear least squares, graduated non-convexity.

0

0

0

0

1:01

06/12/2021

Dual Parameterization of Sparse Variational Gaussian Processes

Vincent ADAM, Paul Chang, Mohammad Emtiyaz Khan, Arno Solin

Keywords Paper

optimization, generative model, kernel methods

0

0

0

0

13:29

02/02/2021

Discriminative Region Suppression for Weakly-Supervised Semantic Segmentation

Beomyoung Kim, Sangeun Han, Junmo Kim

Keywords Paper

0

0

0

0

14:27

02/02/2021

Deep Mutual Information Maximin for Cross-Modal Clustering

Yiqiao Mao, Xiaoqiang Yan, Qiang Guo, Yangdong Ye

Keywords Paper

0

0

0

0

16:31

13/04/2021

Federated learning with compression: Unified analysis and sharp guarantees

Farzin Haddadpour, Mohammad Mahdi Kamani, Aryan Mokhtari, Mehrdad Mahdavi

Keywords Paper

0

0

0

0

3:03

06/12/2020

Exact Recovery of Mangled Clusters with Same-Cluster Queries

Marco Bressan, Nicolò Cesa-Bianchi, Silvio Lattanzi, Andrea Paudice

Keywords Paper

Algorithms -> Image Segmentation; Applications -> Computer Vision; Applications -> Image Segmentation; Applications -> Visual S, Deep Learning -> Visualization or Exposition Techniques for Deep Networks

0

0

0

0

3:13