Enhanced Privacy Bound for Shuffle Model with Personalized Privacy
Yixuan Liu, Yuhan Liu, Li Xiong, Yujie Gu, Hong Chen
TL;DR
This work tightens privacy guarantees for the shuffle model under personalized local DP by introducing a hypothesis-testing-based method to compute the confounding probability $p$ and applying $f$-DP to bound the central privacy amplification. By precisely modeling clone-generation and leveraging the convexity of the resulting distributions, the authors derive a general, tighter bound for arbitrary DP mechanisms, with worst-case analysis incorporating heterogeneous budgets. Numerical results show the bound outperforms state-of-the-art results in both pure- and approximate-PLDP, demonstrating stronger amplification as the number of users grows and across diverse privacy settings. The approach provides a practical, general framework for quantifying privacy amplification in heterogeneous privacy regimes, with significant implications for real-world distributed data collection systems.
Abstract
The shuffle model of Differential Privacy (DP) is an enhanced privacy protocol which introduces an intermediate trusted server between local users and a central data curator. It significantly amplifies the central DP guarantee by anonymizing and shuffling the local randomized data. Yet, deriving a tight privacy bound is challenging due to its complicated randomization protocol. While most existing work are focused on unified local privacy settings, this work focuses on deriving the central privacy bound for a more practical setting where personalized local privacy is required by each user. To bound the privacy after shuffling, we first need to capture the probability of each user generating clones of the neighboring data points. Second, we need to quantify the indistinguishability between two distributions of the number of clones on neighboring datasets. Existing works either inaccurately capture the probability, or underestimate the indistinguishability between neighboring datasets. Motivated by this, we develop a more precise analysis, which yields a general and tighter bound for arbitrary DP mechanisms. Firstly, we derive the clone-generating probability by hypothesis testing %from a randomizer-specific perspective, which leads to a more accurate characterization of the probability. Secondly, we analyze the indistinguishability in the context of $f$-DP, where the convexity of the distributions is leveraged to achieve a tighter privacy bound. Theoretical and numerical results demonstrate that our bound remarkably outperforms the existing results in the literature.
