Deep Clustering Using the Soft Silhouette Score: Towards Compact and Well-Separated Clusters
Georgios Vardakas, Ioannis Papakostas, Aristidis Likas
TL;DR
This work introduces the soft silhouette score, a differentiable probabilistic generalization of the classic silhouette, as a clustering objective for deep learning. It integrates soft silhouette into an autoencoder based architecture (DCSS) with a dedicated clustering head to produce cluster probabilities, jointly optimizing reconstruction and a clustering loss that promotes both compactness and inter-cluster separation. Empirical results across diverse real and synthetic datasets show that DCSS yields superior internal clustering quality (NMI, ARI) compared to established deep clustering methods, highlighting the benefit of optimizing cluster separability alongside compactness. The approach advances unsupervised deep clustering by providing a differentiable, probability-aware objective that improves representation learning for more distinct and cohesive clusters, with future work on data augmentation and model enhancements.
Abstract
Unsupervised learning has gained prominence in the big data era, offering a means to extract valuable insights from unlabeled datasets. Deep clustering has emerged as an important unsupervised category, aiming to exploit the non-linear mapping capabilities of neural networks in order to enhance clustering performance. The majority of deep clustering literature focuses on minimizing the inner-cluster variability in some embedded space while keeping the learned representation consistent with the original high-dimensional dataset. In this work, we propose soft silhoutte, a probabilistic formulation of the silhouette coefficient. Soft silhouette rewards compact and distinctly separated clustering solutions like the conventional silhouette coefficient. When optimized within a deep clustering framework, soft silhouette guides the learned representations towards forming compact and well-separated clusters. In addition, we introduce an autoencoder-based deep learning architecture that is suitable for optimizing the soft silhouette objective function. The proposed deep clustering method has been tested and compared with several well-studied deep clustering methods on various benchmark datasets, yielding very satisfactory clustering results.
