SoK: On Gradient Leakage in Federated Learning
Jiacheng Du, Jiahui Hu, Zhibo Wang, Peng Sun, Neil Zhenqiang Gong, Kui Ren, Chun Chen
TL;DR
This work reframes gradient leakage threats in federated learning by evaluating GIAs under practical FL conditions rather than idealized settings. It develops a systematization of GIAs across system model, threat model, attack, and defense, and identifies training setup, model, and post-processing as the three pivotal factors shaping leakage risk. Through theoretical analysis and extensive experiments, the authors show that GIAs are constrained, fragile, and easily defensible in real-world FL, with leakage most pronounced in early training and highly sensitive to architectural choices. They also demonstrate that simple post-processing techniques can defend effectively with minimal utility loss, suggesting that practical FL systems can be made robust against GIAs without heavy cryptographic overhead. The study culminates in actionable guidance and metrics (e.g., IGSA, BRR, WSRR) and calls for realistic, cross-domain defense and threat analyses as FL usage expands into language, multimodal, and large-scale settings.
Abstract
Federated learning (FL) facilitates collaborative model training among multiple clients without raw data exposure. However, recent studies have shown that clients' private training data can be reconstructed from shared gradients in FL, a vulnerability known as gradient inversion attacks (GIAs). While GIAs have demonstrated effectiveness under \emph{ideal settings and auxiliary assumptions}, their actual efficacy against \emph{practical FL systems} remains under-explored. To address this gap, we conduct a comprehensive study on GIAs in this work. We start with a survey of GIAs that establishes a timeline to trace their evolution and develops a systematization to uncover their inherent threats. By rethinking GIA in practical FL systems, three fundamental aspects influencing GIA's effectiveness are identified: \textit{training setup}, \textit{model}, and \textit{post-processing}. Guided by these aspects, we perform extensive theoretical and empirical evaluations of SOTA GIAs across diverse settings. Our findings highlight that GIA is notably \textit{constrained}, \textit{fragile}, and \textit{easily defensible}. Specifically, GIAs exhibit inherent limitations against practical local training settings. Additionally, their effectiveness is highly sensitive to the trained model, and even simple post-processing techniques applied to gradients can serve as effective defenses. Our work provides crucial insights into the limited threats of GIAs in practical FL systems. By rectifying prior misconceptions, we hope to inspire more accurate and realistic investigations on this topic.
