Simplex Clustering via sBeta with Applications to Online Adjustment of Black-Box Predictions
Florent Chiaroni, Malik Boudiaf, Amar Mitiche, Ismail Ben Ayed
TL;DR
The paper tackles domain shift in deep-network predictions by proposing a model-agnostic, privacy-preserving approach that clusters softmax outputs on the probability simplex. It introduces sBeta, a unimodal generalization of the Beta density on each simplex coordinate, and the k-sBetas clustering model that uses block-coordinate descent to estimate $\alpha_k,\beta_k$ and assignments $\mathbf U$. Key contributions include a tractable, moment-based parameter estimation (MoM) and unimodality-constrained optimization to avoid bimodality and degeneracy, together with an optimal transport-based cluster-to-class mapping when $K=D$. Empirical results across unsupervised domain adaptation, transductive zero/one-shot CLIP tasks, and real-time road segmentation demonstrate competitive performance and practicality, with public code for reproducibility.
Abstract
We explore clustering the softmax predictions of deep neural networks and introduce a novel probabilistic clustering method, referred to as k-sBetas. In the general context of clustering discrete distributions, the existing methods focused on exploring distortion measures tailored to simplex data, such as the KL divergence, as alternatives to the standard Euclidean distance. We provide a general maximum a posteriori (MAP) perspective of clustering distributions, emphasizing that the statistical models underlying the existing distortion-based methods may not be descriptive enough. Instead, we optimize a mixed-variable objective measuring data conformity within each cluster to the introduced sBeta density function, whose parameters are constrained and estimated jointly with binary assignment variables. Our versatile formulation approximates various parametric densities for modeling simplex data and enables the control of the cluster-balance bias. This yields highly competitive performances for the unsupervised adjustment of black-box model predictions in various scenarios. Our code and comparisons with the existing simplex-clustering approaches and our introduced softmax-prediction benchmarks are publicly available: https://github.com/fchiaroni/Clustering_Softmax_Predictions.
