Optimal Pure Differentially Private Sparse Histograms in Deterministic Linear Time
Florian Kerschbaum, Steven Lee, Hao Wu
TL;DR
This work tackles the problem of releasing a pure differentially private sparse histogram when the domain size $d$ far exceeds the number of participants $n$. It introduces a deterministic linear-time algorithm in the Word-RAM model, achieving the optimal $\ell_\infty$ error and supporting efficient circuit/MPC implementations via a novel private item blanket with target-length padding. The approach hinges on a time-oblivious, purified noise sampler for the discrete Laplace distribution and a careful padding scheme that preserves privacy while keeping the output sparse. By bridging central and distributed DP utilities, the paper closes the utility gap for histograms under pure DP and provides a practical path to secure MPC deployments with near-linear computation and communication costs.
Abstract
We present an algorithm that releases a pure differentially private (under the replacement neighboring relation) sparse histogram for $n$ participants over a domain of size $d \gg n$. Our method achieves the optimal $\ell_\infty$-estimation error and runs in strictly $O(n)$ time in the Word-RAM model, improving upon the previous best deterministic-time bound of $\tilde{O}(n^2)$ and resolving the open problem of breaking this quadratic barrier (Balcer and Vadhan, 2019). Moreover, the algorithm admits an efficient circuit implementation, enabling the first near-linear communication and computation cost pure DP histogram MPC protocol with optimal $\ell_\infty$-estimation error. Central to our algorithm is a novel **private item blanket** technique with target-length padding, which hides differences in the supports of neighboring histograms while remaining efficiently implementable.
