The Correlated Gaussian Sparse Histogram Mechanism
Christian Janos Lebeda, Lukas Retschmeier
TL;DR
This work addresses releasing high-dimensional sparse histograms under $(\varepsilon,\delta)$-DP by extending the Gaussian Sparse Histogram Mechanism with correlated noise, yielding the Correlated Stability Histogram (CSH). By exploiting $k$-sparsity and monotonicity, CSH reduces the total noise and lowers the threshold, achieving up to a $2\times$ improvement in utility while preserving privacy. The authors provide both an add-the-deltas analysis and a tighter, case-based bound, and extend the framework to top-$k$ queries and discrete Gaussian noise. They validate the approach with experiments showing substantial utility gains and discuss practical extensions to sparsity thresholds and aggregators. The work thus advances private release of sparse, high-dimensional histograms with practical impact for large-scale data analytics.
Abstract
We consider the problem of releasing a sparse histogram under $(\varepsilon, δ)$-differential privacy. The stability histogram independently adds noise from a Laplace or Gaussian distribution to the non-zero entries and removes those noisy counts below a threshold. Thereby, the introduction of new non-zero values between neighboring histograms is only revealed with probability at most $δ$, and typically, the value of the threshold dominates the error of the mechanism. We consider the variant of the stability histogram with Gaussian noise. Recent works ([Joseph and Yu, COLT '24] and [Lebeda, SOSA '25]) reduced the error for private histograms using correlated Gaussian noise. However, these techniques can not be directly applied in the very sparse setting. Instead, we adopt Lebeda's technique and show that adding correlated noise to the non-zero counts only allows us to reduce the magnitude of noise when we have a sparsity bound. This, in turn, allows us to use a lower threshold by up to a factor of $1/2$ compared to the non-correlated noise mechanism. We then extend our mechanism to a setting without a known bound on sparsity. Additionally, we show that correlated noise can give a similar improvement for the more practical discrete Gaussian mechanism.
