Pufferfish Privacy: An Information-Theoretic Study
Theshani Nuradha, Ziv Goldfeld
TL;DR
This work generalizes differential privacy through Pufferfish privacy and introduces an information-theoretic formulation, ε-MI PP, to quantify privacy with domain knowledge via conditional mutual information constraints. It develops a structured PP framework using private/public function pairs connected by a bipartite graph, proving that ε-MI PP lies between ε-PP and (ε,δ)-PP and establishing properties such as convexity, post-processing, and composability. The paper then designs noise mechanisms (Laplace, Gaussian) with variance- or covariance-based bounds that guarantee ε-MI PP, and proposes projection-based methods to handle high-dimensional queries, along with explicit dependence on private functions. Auditing tools based on sliced mutual information (SMI) enable practical privacy verification and PP auditing in high dimensions, complemented by applications to private mean estimation and algorithmic stability. Together, these results offer flexible privacy-utility tradeoffs that exploit distributional knowledge, along with scalable auditing and applicability to modern private inference and learning tasks.
Abstract
Pufferfish privacy (PP) is a generalization of differential privacy (DP), that offers flexibility in specifying sensitive information and integrates domain knowledge into the privacy definition. Inspired by the illuminating formulation of DP in terms of mutual information due to Cuff and Yu, this work explores PP through the lens of information theory. We provide an information-theoretic formulation of PP, termed mutual information PP (MI PP), in terms of the conditional mutual information between the mechanism and the secret, given the public information. We show that MI PP is implied by the regular PP and characterize conditions under which the reverse implication is also true, recovering the relationship between DP and its information-theoretic variant as a special case. We establish convexity, composability, and post-processing properties for MI PP mechanisms and derive noise levels for the Gaussian and Laplace mechanisms. The obtained mechanisms are applicable under relaxed assumptions and provide improved noise levels in some regimes. Lastly, applications to auditing privacy frameworks, statistical inference tasks, and algorithm stability are explored.
