Breaking the Communication-Privacy-Accuracy Tradeoff with $f$-Differential Privacy
Richeng Jin, Zhonggen Su, Caijun Zhong, Zhaoyang Zhang, Tony Quek, Huaiyu Dai
TL;DR
This work tackles privacy-communication-accuracy tradeoffs in federated data analysis by analyzing discrete-valued local mechanisms under $f$-differential privacy. It derives closed-form $f$-DP guarantees for binomial noise and the binomial mechanism, establishing tight privacy bounds and enabling compression-aware privacy analysis. Building on this, the authors introduce a ternary stochastic compressor that exploits privacy amplification by sparsification, achieving the same mean-squared error (MSE) and communication cost with substantially stronger privacy guarantees in distributed mean estimation. The vector extension leverages GDP via a central limit theorem for $f$-DP and Kashin representations to preserve privacy in high dimensions, and experiments demonstrate improved privacy-accuracy-communication tradeoffs compared to state-of-the-art baselines like SQKR and Gaussian mechanisms. Overall, the work provides a principled framework and practical compressor design for privacy-preserving, communication-efficient federated data analytics, with broad applicability to discrete-valued mechanisms.
Abstract
We consider a federated data analytics problem in which a server coordinates the collaborative data analysis of multiple users with privacy concerns and limited communication capability. The commonly adopted compression schemes introduce information loss into local data while improving communication efficiency, and it remains an open problem whether such discrete-valued mechanisms provide any privacy protection. In this paper, we study the local differential privacy guarantees of discrete-valued mechanisms with finite output space through the lens of $f$-differential privacy (DP). More specifically, we advance the existing literature by deriving tight $f$-DP guarantees for a variety of discrete-valued mechanisms, including the binomial noise and the binomial mechanisms that are proposed for privacy preservation, and the sign-based methods that are proposed for data compression, in closed-form expressions. We further investigate the amplification in privacy by sparsification and propose a ternary stochastic compressor. By leveraging compression for privacy amplification, we improve the existing methods by removing the dependency of accuracy (in terms of mean square error) on communication cost in the popular use case of distributed mean estimation, therefore breaking the three-way tradeoff between privacy, communication, and accuracy. Finally, we discuss the Byzantine resilience of the proposed mechanism and its application in federated learning.
