Multi-user Pufferfish Privacy
Ni Ding, Songpei Lu, Wenjing Yang, Zijian Zhang
TL;DR
The paper tackles achieving individual indistinguishability in multi-user aggregated queries under the pufferfish privacy framework by calibrating Laplace noise using the Kantorovich (Wasserstein-1) mechanism. It derives explicit sufficient conditions for four secret-pair sets—covering value changes, presence/absence, and distribution shifts—showing that privacy guarantees can depend primarily on the statistics of the individual user involved, with attendance probability largely inconsequential. For binary data, the authors provide relaxations that reduce noise and improve data utility, and they employ numerical methods (Brent’s method) to approximate optimal noise scales under relaxed conditions. The framework is validated conceptually with experiments indicating practical privacy protection when adding, removing, or modifying a class of users, offering a tractable approach to privacy-utility trade-offs in tabular data releases.
Abstract
This paper studies how to achieve individual indistinguishability by pufferfish privacy in aggregated query to a multi-user system. It is assumed that each user reports realization of a random variable. We study how to calibrate Laplace noise, added to the query answer, to attain pufferfish privacy when user changes his/her reported data value, leaves the system and is replaced by another use with different randomness. Sufficient conditions are derived for all scenarios for attaining statistical indistinguishability on four sets of secret pairs. They are derived using the existing Kantorovich method (Wasserstain metric of order $1$). These results can be applied to attain indistinguishability when a certain class of users is added or removed from a tabular data. It is revealed that attaining indifference in individual's data is conditioned on the statistics of this user only. For binary (Bernoulli distributed) random variables, the derived sufficient conditions can be further relaxed to reduce the noise and improve data utility.
