Federated Learning with Enhanced Privacy via Model Splitting and Random Client Participation
Yiwei Li, Shuai Wang, Zhuojun Tian, Xiuhua Wang, Shijian Su
TL;DR
MS-PAFL addresses the core challenge of maintaining model utility under differential privacy in Federated Learning by combining model splitting with privacy amplification from random client participation and local data subsampling. By partitioning each client’s model into a private submodel (on-device) and a public submodel (shared for aggregation), and injecting noise only into the public submodel, the framework achieves stronger privacy with reduced utility loss. The authors provide formal single-round and total privacy guarantees, demonstrating that joint amplification tightens privacy bounds beyond what either sampling method achieves alone, and they validate these claims with experiments on the Adult dataset showing improved privacy–utility trade-offs relative to standard DP-FedAvg. The work offers a practical, theoretically grounded approach for privacy-preserving FL that supports flexible participation and scalable deployment while maintaining high model performance under stringent privacy budgets.
Abstract
Federated Learning (FL) often adopts differential privacy (DP) to protect client data, but the added noise required for privacy guarantees can substantially degrade model accuracy. To resolve this challenge, we propose model-splitting privacy-amplified federated learning (MS-PAFL), a novel framework that combines structural model splitting with statistical privacy amplification. In this framework, each client's model is partitioned into a private submodel, retained locally, and a public submodel, shared for global aggregation. The calibrated Gaussian noise is injected only into the public submodel, thereby confining its adverse impact while preserving the utility of the local model. We further present a rigorous theoretical analysis that characterizes the joint privacy amplification achieved through random client participation and local data subsampling under this architecture. The analysis provides tight bounds on both single-round and total privacy loss, demonstrating that MS-PAFL significantly reduces the noise necessary to satisfy a target privacy protection level. Extensive experiments validate our theoretical findings, showing that MS-PAFL consistently attains a superior privacy-utility trade-off and enables the training of highly accurate models under strong privacy guarantees.
