Mjolnir: Breaking the Shield of Perturbation-Protected Gradients via Adaptive Diffusion
Xuan Liu, Siqi Cai, Qihua Zhou, Song Guo, Ruibin Li, Kaiwei Lin
TL;DR
This paper addresses the vulnerability of perturbation-based gradient protections in Federated Learning by introducing Mjölnir, a diffusion-based gradient leakage attack. It leverages a surrogate gradient data supply model and a Gradient Diffusion Model, with an adaptive diffusion parameter $M$, to denoise perturbed gradients and recover original gradients without access to the original model or external data. The key contributions are: (1) revealing the diffusion properties of gradient perturbations, (2) proposing Mjölnir as the first general gradient diffusion attack, and (3) empirically demonstrating strong gradient denoising and private data recovery across DP and non-DP perturbations for DNN/CNN models, along with ablation studies on variant configurations. The findings highlight a substantive privacy risk in gradient perturbation protections and motivate the development of defense strategies beyond perturbation-based approaches for FL privacy.
Abstract
Perturbation-based mechanisms, such as differential privacy, mitigate gradient leakage attacks by introducing noise into the gradients, thereby preventing attackers from reconstructing clients' private data from the leaked gradients. However, can gradient perturbation protection mechanisms truly defend against all gradient leakage attacks? In this paper, we present the first attempt to break the shield of gradient perturbation protection in Federated Learning for the extraction of private information. We focus on common noise distributions, specifically Gaussian and Laplace, and apply our approach to DNN and CNN models. We introduce Mjolnir, a perturbation-resilient gradient leakage attack that is capable of removing perturbations from gradients without requiring additional access to the original model structure or external data. Specifically, we leverage the inherent diffusion properties of gradient perturbation protection to develop a novel diffusion-based gradient denoising model for Mjolnir. By constructing a surrogate client model that captures the structure of perturbed gradients, we obtain crucial gradient data for training the diffusion model. We further utilize the insight that monitoring disturbance levels during the reverse diffusion process can enhance gradient denoising capabilities, allowing Mjolnir to generate gradients that closely approximate the original, unperturbed versions through adaptive sampling steps. Extensive experiments demonstrate that Mjolnir effectively recovers the protected gradients and exposes the Federated Learning process to the threat of gradient leakage, achieving superior performance in gradient denoising and private data recovery.
