Privacy-preserving Federated Primal-dual Learning for Non-convex and Non-smooth Problems with Model Sparsification
Yiwei Li, Chien-Wei Huang, Shuai Wang, Chong-Yung Chi, Tony Q. S. Quek
TL;DR
The paper tackles privacy-preserving federated learning for non-convex and non-smooth objectives under communication constraints. It introduces two DP-based primal-dual FL algorithms: DP-FedPDM and BSDP-FedPDM, with bidirectional model sparsification to reduce signaling overhead while maintaining privacy guarantees. The authors provide privacy analyses showing how per-round and total privacy losses scale with participation and local updates, and a convergence proof establishing an $\mathcal{O}(1/\zeta)$ round complexity to reach a $\zeta$-stationary solution. Empirical results on MNIST and Adult data demonstrate superior performance and robustness to DP noise compared to state-of-the-art baselines, highlighting practical impact for efficient and private FL in IoT and edge systems.
Abstract
Federated learning (FL) has been recognized as a rapidly growing research area, where the model is trained over massively distributed clients under the orchestration of a parameter server (PS) without sharing clients' data. This paper delves into a class of federated problems characterized by non-convex and non-smooth loss functions, that are prevalent in FL applications but challenging to handle due to their intricate non-convexity and non-smoothness nature and the conflicting requirements on communication efficiency and privacy protection. In this paper, we propose a novel federated primal-dual algorithm with bidirectional model sparsification tailored for non-convex and non-smooth FL problems, and differential privacy is applied for privacy guarantee. Its unique insightful properties and some privacy and convergence analyses are also presented as the FL algorithm design guidelines. Extensive experiments on real-world data are conducted to demonstrate the effectiveness of the proposed algorithm and much superior performance than some state-of-the-art FL algorithms, together with the validation of all the analytical results and properties.
