Improving the Utility of Differentially Private Clustering through Dynamical Processing
Junyoung Byun, Yujin Choi, Jaewook Lee
TL;DR
This work tackles the utility-privacy trade-off in differentially private clustering by introducing Morse-theory–driven dynamical processing that links Gaussian sub-clusters into complex, nonconvex structures. It builds on DP MoG and DP k-means through a gradient-flow framework that identifies transition equilibrium vectors (TEVs) to connect centers into a hierarchical, DP-preserving graph, enabling any target number of clusters. Theoretical results prove the dynamical processing preserves DP, while experiments across six real datasets show consistent ARI improvements over baseline DP clustering methods, especially when baseline performance is moderate. The approach is versatile, allowing integration with different DP clustering baselines and scalable to various density estimators, with potential extensions to kernel methods and broader hierarchical clustering tasks.
Abstract
This study aims to alleviate the trade-off between utility and privacy of differentially private clustering. Existing works focus on simple methods, which show poor performance for non-convex clusters. To fit complex cluster distributions, we propose sophisticated dynamical processing inspired by Morse theory, with which we hierarchically connect the Gaussian sub-clusters obtained through existing methods. Our theoretical results imply that the proposed dynamical processing introduces little to no additional privacy loss. Experiments show that our framework can improve the clustering performance of existing methods at the same privacy level.
