Navigating Heterogeneity and Privacy in One-Shot Federated Learning with Diffusion Models
Matias Mendieta, Guangyu Sun, Chen Chen
TL;DR
This paper tackles data heterogeneity and privacy in one-shot federated learning by introducing FedDiff, a diffusion-model-based approach that trains class-conditioned generators on clients and synthesizes a global dataset at the server for discriminative model training. It demonstrates that diffusion models offer high sample quality and diversity, yielding substantial gains over state-of-the-art one-shot FL methods under both standard and differential privacy settings. To counteract DP-induced quality drops, the authors propose Fourier Magnitude Filtering (FMF), which improves the usefulness of generated samples for global training. The results show FedDiff achieves notable accuracy improvements (often 5%–20%) across heterogeneous data regimes and datasets, with FMF providing further boosts in challenging DP scenarios, illustrating practical impact for privacy-aware, communication-efficient FL systems.
Abstract
Federated learning (FL) enables multiple clients to train models collectively while preserving data privacy. However, FL faces challenges in terms of communication cost and data heterogeneity. One-shot federated learning has emerged as a solution by reducing communication rounds, improving efficiency, and providing better security against eavesdropping attacks. Nevertheless, data heterogeneity remains a significant challenge, impacting performance. This work explores the effectiveness of diffusion models in one-shot FL, demonstrating their applicability in addressing data heterogeneity and improving FL performance. Additionally, we investigate the utility of our diffusion model approach, FedDiff, compared to other one-shot FL methods under differential privacy (DP). Furthermore, to improve generated sample quality under DP settings, we propose a pragmatic Fourier Magnitude Filtering (FMF) method, enhancing the effectiveness of generated data for global model training.
