Scalable Uncertainty Quantification for Black-Box Density-Based Clustering
Nicola Bariletto, Stephen G. Walker
TL;DR
A novel framework for uncertainty quantification in clustering is introduced by combining the martingale posterior paradigm with density-based clustering, where uncertainty in the estimated density is naturally propagated to the clustering structure.
Abstract
We introduce a novel framework for uncertainty quantification in clustering. By combining the martingale posterior paradigm with density-based clustering, uncertainty in the estimated density is naturally propagated to the clustering structure. The approach scales effectively to high-dimensional and irregularly shaped data by leveraging modern neural density estimators and GPU-friendly parallel computation. We establish frequentist consistency guarantees and validate the methodology on synthetic and real data.
