Training Diffusion Models with Federated Learning
Matthijs de Goede, Bart Cox, Jérémie Decouchant
TL;DR
The paper addresses privacy, copyright, and data authority concerns in diffusion-model training by introducing FedDiffuse, a federated learning framework that adapts FedAvg to train a DDPM on a shared UNet backbone. It introduces three communication-efficient strategies—USplit, ULatDec, and UDec—that leverage the UNet structure to reduce exchanged parameters while preserving image quality, achieving up to $74\%$ reduction in communication with IID data. Experimental results on Fashion-MNIST (and CelebA) show that FedDiffuse can approach centralized performance in FID with a smaller number of clients, though robustness to non-IID skew varies by method. Overall, the approach demonstrates that diffusion models can be trained in a privacy-preserving, decentralized manner, potentially broadening participation beyond major tech entities.
Abstract
The training of diffusion-based models for image generation is predominantly controlled by a select few Big Tech companies, raising concerns about privacy, copyright, and data authority due to their lack of transparency regarding training data. To ad-dress this issue, we propose a federated diffusion model scheme that enables the independent and collaborative training of diffusion models without exposing local data. Our approach adapts the Federated Averaging (FedAvg) algorithm to train a Denoising Diffusion Model (DDPM). Through a novel utilization of the underlying UNet backbone, we achieve a significant reduction of up to 74% in the number of parameters exchanged during training,compared to the naive FedAvg approach, whilst simultaneously maintaining image quality comparable to the centralized setting, as evaluated by the FID score.
