Efficient and Private Marginal Reconstruction with Local Non-Negativity
Brett Mullins, Miguel Fuentes, Yingtai Xiao, Daniel Kifer, Cameron Musco, Daniel Sheldon
TL;DR
The paper addresses private reconstruction of marginal queries under differential privacy by introducing ReM, a residuals-to-marginals post-processing method that reconstructs workload marginals from residual measurements using a Kronecker-structured representation. It extends to Gaussian noise with GReM-LNN, which enforces local non-negativity to reduce reconstruction error, and proves efficiency through residual decomposition and pseudoinverse mappings. Empirical results show substantial accuracy gains and scalability improvements when integrating ReM and GReM-LNN into private query mechanisms such as ResidualPlanner and Scalable MWEM. The work provides practical algorithms and complexity analyses, enabling accurate, scalable private marginal reconstruction without exponential blow-up in high-dimensional domains.
Abstract
Differential privacy is the dominant standard for formal and quantifiable privacy and has been used in major deployments that impact millions of people. Many differentially private algorithms for query release and synthetic data contain steps that reconstruct answers to queries from answers to other queries that have been measured privately. Reconstruction is an important subproblem for such mechanisms to economize the privacy budget, minimize error on reconstructed answers, and allow for scalability to high-dimensional datasets. In this paper, we introduce a principled and efficient postprocessing method ReM (Residuals-to-Marginals) for reconstructing answers to marginal queries. Our method builds on recent work on efficient mechanisms for marginal query release, based on making measurements using a residual query basis that admits efficient pseudoinversion, which is an important primitive used in reconstruction. An extension GReM-LNN (Gaussian Residuals-to-Marginals with Local Non-negativity) reconstructs marginals under Gaussian noise satisfying consistency and non-negativity, which often reduces error on reconstructed answers. We demonstrate the utility of ReM and GReM-LNN by applying them to improve existing private query answering mechanisms.
