Gradient Inversion of Federated Diffusion Models
Jiyue Huang, Chi Hong, Lydia Y. Chen, Stefanie Roos
TL;DR
This work investigates privacy risks in federated diffusion-model training by demonstrating gradient inversion attacks that recover training data from client gradients. It introduces GIDM, a two-phase attack that leverages the diffusion model as a prior to constrain the inversion search and a pixel-wise fine-tuning step to achieve high-fidelity reconstructions, and GIDM+ for scenarios where training noise $\boldsymbol{\epsilon}$ and timestep $t$ are privately held. The authors show that, when $\{\boldsymbol{\epsilon},t\}$ are known, 128×128 reconstructions can nearly reproduce ground-truth images, and even with private parameters, the triple-optimization strategy enables substantial reconstruction on smaller images like CIFAR-100 (32×32). The results underscore a strong privacy risk in gradient sharing for diffusion models and motivate the development of defense strategies for federated diffusion training.
Abstract
Diffusion models are becoming defector generative models, which generate exceptionally high-resolution image data. Training effective diffusion models require massive real data, which is privately owned by distributed parties. Each data party can collaboratively train diffusion models in a federated learning manner by sharing gradients instead of the raw data. In this paper, we study the privacy leakage risk of gradient inversion attacks. First, we design a two-phase fusion optimization, GIDM, to leverage the well-trained generative model itself as prior knowledge to constrain the inversion search (latent) space, followed by pixel-wise fine-tuning. GIDM is shown to be able to reconstruct images almost identical to the original ones. Considering a more privacy-preserving training scenario, we then argue that locally initialized private training noise $ε$ and sampling step t may raise additional challenges for the inversion attack. To solve this, we propose a triple-optimization GIDM+ that coordinates the optimization of the unknown data, $ε$ and $t$. Our extensive evaluation results demonstrate the vulnerability of sharing gradient for data protection of diffusion models, even high-resolution images can be reconstructed with high quality.
