Federated Learning with Differential Privacy: An Utility-Enhanced Approach
Kanishka Ranaweera, Dinh C. Nguyen, Pubudu N. Pathirana, David Smith, Ming Ding, Thierry Rakotoarivelo, Aruna Seneviratne
TL;DR
This work addresses privacy in federated learning by integrating a Haar wavelet transform into differentially private SGD and federated averaging. The core idea is to perform noise injection in the wavelet domain, with per-level clipping that reduces the effective noise variance and strengthens utility under the same $(\varepsilon,\delta)$ privacy budget. Theoretical results show improved noise variance bounds by a factor of $\sqrt{\dfrac{2+\log_2(m)}{2}}$, along with convergence guarantees, while empirical experiments on MNIST, Fashion-MNIST, and CIFAR-10 demonstrate superior model performance compared to standard DP baselines. The approach offers a practical path to higher-utility private FL, with future work exploring unequal noise across coefficients and adaptive clipping for non-i.i.d. data settings.
Abstract
Federated learning has emerged as an attractive approach to protect data privacy by eliminating the need for sharing clients' data while reducing communication costs compared with centralized machine learning algorithms. However, recent studies have shown that federated learning alone does not guarantee privacy, as private data may still be inferred from the uploaded parameters to the central server. In order to successfully avoid data leakage, adopting differential privacy (DP) in the local optimization process or in the local update aggregation process has emerged as two feasible ways for achieving sample-level or user-level privacy guarantees respectively, in federated learning models. However, compared to their non-private equivalents, these approaches suffer from a poor utility. To improve the privacy-utility trade-off, we present a modification to these vanilla differentially private algorithms based on a Haar wavelet transformation step and a novel noise injection scheme that significantly lowers the asymptotic bound of the noise variance. We also present a holistic convergence analysis of our proposed algorithm, showing that our method yields better convergence performance than the vanilla DP algorithms. Numerical experiments on real-world datasets demonstrate that our method outperforms existing approaches in model utility while maintaining the same privacy guarantees.
