Enhanced Privacy Leakage from Noise-Perturbed Gradients via Gradient-Guided Conditional Diffusion Models
Jiayang Meng, Tao Huang, Hong Chen, Chen Hou, Guolong Zheng
TL;DR
This paper addresses the privacy leakage risk in Federated Learning where leaked gradients can reveal private data. It introduces Gradient-guided Conditional Diffusion Models (GG-CDMs), which leverage the denoising capabilities of conditional diffusion models to reconstruct high-fidelity images from Gaussian-noised gradients without requiring target-data priors. The authors provide rigorous reconstruction-error bounds, convergence guarantees for the attack loss, and a new Reconstruction Vulnerability (RV) metric to quantify model susceptibility. Extensive experiments across multiple datasets and attacked architectures show that GG-CDMs achieve superior reconstructions under noise defenses, underscoring a significant privacy risk and motivating stronger defenses in gradient leakage settings.
Abstract
Federated learning synchronizes models through gradient transmission and aggregation. However, these gradients pose significant privacy risks, as sensitive training data is embedded within them. Existing gradient inversion attacks suffer from significantly degraded reconstruction performance when gradients are perturbed by noise-a common defense mechanism. In this paper, we introduce gradient-guided conditional diffusion models for reconstructing private images from leaked gradients, without prior knowledge of the target data distribution. Our approach leverages the inherent denoising capability of diffusion models to circumvent the partial protection offered by noise perturbation, thereby improving attack performance under such defenses. We further provide a theoretical analysis of the reconstruction error bounds and the convergence properties of the attack loss, characterizing the impact of key factors-such as noise magnitude and attacked model architecture-on reconstruction quality. Extensive experiments demonstrate our attack's superior reconstruction performance with Gaussian noise-perturbed gradients, and confirm our theoretical findings.
