Exploring the Vulnerabilities of Federated Learning: A Deep Dive into Gradient Inversion Attacks
Pengxin Guo, Runxi Wang, Shuang Zeng, Jinjing Zhu, Haoning Jiang, Yanran Wang, Yuyin Zhou, Feifei Wang, Hui Xiong, Liangqiong Qu
TL;DR
This work comprehensively analyzes gradient inversion attacks in federated learning by classifying GIAs into optimization-based (OP-GIA), generation-based (GEN-GIA), and analytics-based (ANA-GIA) categories, and by introducing theoretical bounds that relate reconstruction error to batch size and image resolution. It provides proofs of key results, including an error bound (Theorem 1) and a gradient-similarity proposition, and validates them through extensive experiments on CIFAR-10/100, ImageNet, and CelebA using multiple backbones and attack variants. The empirical results reveal OP-GIA as the most practical yet limited threat, GEN-GIA as highly dependent on external factors (e.g., pre-trained generators, activation functions), and ANA-GIA as effective but easily detectable, with additional findings on privacy leakage under PEFT. To mitigate these risks, the authors propose a three-stage defense pipeline—avoid Sigmoid activations and adopt more complex architectures, increase local batch steps, and implement client-side validation—along with strategic guidance for attack designers and a public repository for ongoing tracking. Overall, the paper delivers a nuanced taxonomy, supporting theory, and pragmatic defense recommendations to reduce gradient leakage in FL while highlighting areas for future investigation.
Abstract
Federated Learning (FL) has emerged as a promising privacy-preserving collaborative model training paradigm without sharing raw data. However, recent studies have revealed that private information can still be leaked through shared gradient information and attacked by Gradient Inversion Attacks (GIA). While many GIA methods have been proposed, a detailed analysis, evaluation, and summary of these methods are still lacking. Although various survey papers summarize existing privacy attacks in FL, few studies have conducted extensive experiments to unveil the effectiveness of GIA and their associated limiting factors in this context. To fill this gap, we first undertake a systematic review of GIA and categorize existing methods into three types, i.e., \textit{optimization-based} GIA (OP-GIA), \textit{generation-based} GIA (GEN-GIA), and \textit{analytics-based} GIA (ANA-GIA). Then, we comprehensively analyze and evaluate the three types of GIA in FL, providing insights into the factors that influence their performance, practicality, and potential threats. Our findings indicate that OP-GIA is the most practical attack setting despite its unsatisfactory performance, while GEN-GIA has many dependencies and ANA-GIA is easily detectable, making them both impractical. Finally, we offer a three-stage defense pipeline to users when designing FL frameworks and protocols for better privacy protection and share some future research directions from the perspectives of attackers and defenders that we believe should be pursued. We hope that our study can help researchers design more robust FL frameworks to defend against these attacks.
