Practical Feasibility of Gradient Inversion Attacks in Federated Learning
Viktor Valadi, Mattias Åkesson, Johan Östman, Fazeleh Hoseini, Salman Toor, Andreas Hellander
TL;DR
This paper investigates the practical privacy risk of gradient inversion attacks in federated learning for image-based tasks, questioning the prevalence of high-fidelity reconstructions in production-like systems. It adopts a realism-driven methodology, evaluating modern architectures across ImageNet-scale data and object-detection tasks, while analyzing how architectural choices, training dynamics, and attacker assumptions constrain leakage. The key contributions include large-scale cross-architecture evaluation showing that canonical modern vision models largely resist meaningful gradient inversion under realistic training, a principled framework distinguishing feasible leakage from upper-bound demonstrations, and controlled initialization studies that separate gradient information from priors. The findings have practical implications for privacy risk assessment in FL, suggesting that gradient inversion is not a critical threat in production-grade vision systems, though they also highlight the need to explore subtler leakage channels and distributional information beyond full data reconstruction.
Abstract
Gradient inversion attacks are often presented as a serious privacy threat in federated learning, with recent work reporting increasingly strong reconstructions under favorable experimental settings. However, it remains unclear whether such attacks are feasible in modern, performance-optimized systems deployed in practice. In this work, we evaluate the practical feasibility of gradient inversion for image-based federated learning. We conduct a systematic study across multiple datasets and tasks, including image classification and object detection, using canonical vision architectures at contemporary resolutions. Our results show that while gradient inversion remains possible for certain legacy or transitional designs under highly restrictive assumptions, modern, performance-optimized models consistently resist meaningful reconstruction visually. We further demonstrate that many reported successes rely on upper-bound settings, such as inference mode operation or architectural simplifications which do not reflect realistic training pipelines. Taken together, our findings indicate that, under an honest-but-curious server assumption, high-fidelity image reconstruction via gradient inversion does not constitute a critical privacy risk in production-optimized federated learning systems, and that practical risk assessments must carefully distinguish diagnostic attack settings from real-world deployments.
