Phoenix: A Federated Generative Diffusion Model
Fiona Victoria Stanley Jothiraj, Afra Mashhadi
TL;DR
Phoenix addresses privacy and data-ownership concerns in generative AI by federatedly training an unconditional diffusion model (DDPM) across distributed data sources. It introduces two core strategies—Data Sharing Strategy with parameters $\beta$ and $\alpha$, and Personalization layers with Threshold Filtering—to mitigate non-IID data effects and improve sample fidelity and mode coverage under FL. Across CIFAR-10 experiments, these methods yield meaningful improvements over GAN baselines and the default diffusion-FL setup, particularly in non-IID settings, while maintaining privacy and reducing inter-site communication. This work demonstrates the feasibility of high-quality, diverse synthetic data generation in a privacy-preserving, distributed regime and points to practical directions for speedups, real-world deployments, and further privacy enhancements.
Abstract
Generative AI has made impressive strides in enabling users to create diverse and realistic visual content such as images, videos, and audio. However, training generative models on large centralized datasets can pose challenges in terms of data privacy, security, and accessibility. Federated learning (FL) is an approach that uses decentralized techniques to collaboratively train a shared deep learning model while retaining the training data on individual edge devices to preserve data privacy. This paper proposes a novel method for training a Denoising Diffusion Probabilistic Model (DDPM) across multiple data sources using FL techniques. Diffusion models, a newly emerging generative model, show promising results in achieving superior quality images than Generative Adversarial Networks (GANs). Our proposed method Phoenix is an unconditional diffusion model that leverages strategies to improve the data diversity of generated samples even when trained on data with statistical heterogeneity or Non-IID (Non-Independent and Identically Distributed) data. We demonstrate how our approach outperforms the default diffusion model in an FL setting. These results indicate that high-quality samples can be generated by maintaining data diversity, preserving privacy, and reducing communication between data sources, offering exciting new possibilities in the field of generative AI.
