Perturb-and-Project: Differentially Private Similarities and Marginals
Vincent Cohen-Addad, Tommaso d'Orsi, Alessandro Epasto, Vahab Mirrokni, Peilin Zhong
TL;DR
Perturb-and-Project designs differentially private outputs by adding noise to inputs and projecting onto admissible sets, with error governed by the Gaussian complexity of the projection space and enhanced by sum-of-squares certificates. The authors provide new DP algorithms for privately releasing pairwise cosine similarities and for computing $k$-way marginals, including strong gains for $t$-sparse datasets, and they show how alternating projections yield practical, scalable implementations. The work grounds utility guarantees in tight SOS-based analyses of injective tensor norms and Gaussian complexity, linking privacy with the intrinsic richness of the target set. This framework enables efficient, high-signal private computation of similarities and marginals applicable to nearest-neighbor search, contingency-table analysis, and synthetic-data tasks, while offering theoretical foundations for when fast input-perturbation methods perform well in practice.
Abstract
We revisit the input perturbations framework for differential privacy where noise is added to the input $A\in \mathcal{S}$ and the result is then projected back to the space of admissible datasets $\mathcal{S}$. Through this framework, we first design novel efficient algorithms to privately release pair-wise cosine similarities. Second, we derive a novel algorithm to compute $k$-way marginal queries over $n$ features. Prior work could achieve comparable guarantees only for $k$ even. Furthermore, we extend our results to $t$-sparse datasets, where our efficient algorithms yields novel, stronger guarantees whenever $t\le n^{5/6}/\log n\,.$ Finally, we provide a theoretical perspective on why \textit{fast} input perturbation algorithms works well in practice. The key technical ingredients behind our results are tight sum-of-squares certificates upper bounding the Gaussian complexity of sets of solutions.
