Towards Calibrated Deep Clustering Network
Yuheng Jia, Jianhong Cheng, Hui Liu, Junhui Hou
TL;DR
This work tackles the problem of overconfidence in deep clustering by introducing Calibrated Deep Clustering (CDC), a dual-head network with a Calibration Head and a Clustering Head. The Calibration Head produces calibrated confidences using a region-aware loss with a mini-cluster target and a negative-entropy term, while the Clustering Head uses these confidences to dynamically select high-confidence pseudo-labels for self-training. A feature-prototype-based initialization, built on MoCo-v2 pretraining, ensures stable and fast convergence, and theoretical guarantees underpin the calibration strategy. Empirical results across six benchmarks show CDC achieving about a 5x reduction in expected calibration error ($ECE$) and substantial gains in clustering accuracy, with strong failure-rejection and OOD-detection capabilities. The approach enables trustworthy unsupervised clustering with reliable confidence estimates and practical impact on downstream decision-making tasks.
Abstract
Deep clustering has exhibited remarkable performance; however, the over confidence problem, i.e., the estimated confidence for a sample belonging to a particular cluster greatly exceeds its actual prediction accuracy, has been over looked in prior research. To tackle this critical issue, we pioneer the development of a calibrated deep clustering framework. Specifically, we propose a novel dual head (calibration head and clustering head) deep clustering model that can effectively calibrate the estimated confidence and the actual accuracy. The calibration head adjusts the overconfident predictions of the clustering head, generating prediction confidence that matches the model learning status. Then, the clustering head dynamically selects reliable high-confidence samples estimated by the calibration head for pseudo-label self-training. Additionally, we introduce an effective network initialization strategy that enhances both training speed and network robustness. The effectiveness of the proposed calibration approach and initialization strategy are both endorsed with solid theoretical guarantees. Extensive experiments demonstrate the proposed calibrated deep clustering model not only surpasses the state-of-the-art deep clustering methods by 5x on average in terms of expected calibration error, but also significantly outperforms them in terms of clustering accuracy. The code is available at https://github.com/ChengJianH/CDC.
