Locally Optimal Private Sampling: Beyond the Global Minimax
Hrad Ghoukasian, Bonwoo Lee, Shahab Asoodeh
TL;DR
The paper advances private sampling under local DP by introducing a local minimax framework around a public prior $P_0$, with neighborhood defined via $E_\gamma$-divergence. It extends the global functional-LDP minimax analysis to a local setting, proving that the local minimax risk in $N_\gamma(P_0)$ matches the restricted global minimax risk and deriving closed-form samplers that are independent of the specific $f$-divergence. It also provides a pure-LDP, nonlinear sampler that is instance-optimal on the local neighborhood, surpassing the linear functional-LDP mechanisms. Numerical experiments on finite and continuous spaces demonstrate that locally minimax samplers consistently outperform global minimax samplers under both pure LDP and GLDP, highlighting the practical benefits of incorporating public data into privacy-preserving data generation. The work lays groundwork for personalized private sampling and suggests several avenues for future work, including scalable implementations and generalized neighborhood definitions.
Abstract
We study the problem of sampling from a distribution under local differential privacy (LDP). Given a private distribution $P \in \mathcal{P}$, the goal is to generate a single sample from a distribution that remains close to $P$ in $f$-divergence while satisfying the constraints of LDP. This task captures the fundamental challenge of producing realistic-looking data under strong privacy guarantees. While prior work by Park et al. (NeurIPS'24) focuses on global minimax-optimality across a class of distributions, we take a local perspective. Specifically, we examine the minimax risk in a neighborhood around a fixed distribution $P_0$, and characterize its exact value, which depends on both $P_0$ and the privacy level. Our main result shows that the local minimax risk is determined by the global minimax risk when the distribution class $\mathcal{P}$ is restricted to a neighborhood around $P_0$. To establish this, we (1) extend previous work from pure LDP to the more general functional LDP framework, and (2) prove that the globally optimal functional LDP sampler yields the optimal local sampler when constrained to distributions near $P_0$. Building on this, we also derive a simple closed-form expression for the locally minimax-optimal samplers which does not depend on the choice of $f$-divergence. We further argue that this local framework naturally models private sampling with public data, where the public data distribution is represented by $P_0$. In this setting, we empirically compare our locally optimal sampler to existing global methods, and demonstrate that it consistently outperforms global minimax samplers.
