The Gradient Puppeteer: Adversarial Domination in Gradient Leakage Attacks through Model Poisoning
Kunlan Xiang, Haomiao Yang, Meng Hao, Shaofeng Li, Haoxin Wang, Zikang Ding, Wenbo Jiang, Tianwei Zhang
TL;DR
This paper tackles privacy risks in Federated Learning by reframing Active Gradient Leakage Attacks (AGLAs) through a backdoor-theoretic lens centered on per-sample gradient contributions, encapsulated by the parameter $\lambda$. It identifies fundamental limitations of prior AGLAs—partial batch coverage and detectability—and proposes Enhanced Gradient Global Vulnerability (EGGV), which poisons the global model and uses a gradient projector and discriminator to uniformly amplify leakage across all samples. The authors prove the existence of optimal poisoned parameters and derive stealth guarantees, while extensive experiments show EGGV achieves complete batch reconstruction and outperforms state-of-the-art methods by about 43% in PSNR and 45% in D-SNR. These results highlight a new class of privacy risks in FL and motivate the development of robust defenses against balanced, stealthy gradient-poisoning attacks.
Abstract
In Federated Learning (FL), clients share gradients with a central server while keeping their data local. However, malicious servers could deliberately manipulate the models to reconstruct clients' data from shared gradients, posing significant privacy risks. Although such active gradient leakage attacks (AGLAs) have been widely studied, they suffer from two severe limitations: (i) coverage: no existing AGLAs can reconstruct all samples in a batch from the shared gradients; (ii) stealthiness: no existing AGLAs can evade principled checks of clients. In this paper, we address these limitations with two core contributions. First, we introduce a new theoretical analysis approach, which uniformly models AGLAs as backdoor poisoning. This analysis approach reveals that the core principle of AGLAs is to bias the gradient space to prioritize the reconstruction of a small subset of samples while sacrificing the majority, which theoretically explains the above limitations of existing AGLAs. Second, we propose Enhanced Gradient Global Vulnerability (EGGV), the first AGLA that achieves complete attack coverage while evading client-side detection. In particular, EGGV employs a gradient projector and a jointly optimized discriminator to assess gradient vulnerability, steering the gradient space toward the point most prone to data leakage. Extensive experiments show that EGGV achieves complete attack coverage and surpasses state-of-the-art (SOTA) with at least a 43% increase in reconstruction quality (PSNR) and a 45% improvement in stealthiness (D-SNR).
