Hiding in Plain Sight: Disguising Data Stealing Attacks in Federated Learning
Kostadin Garov, Dimitar I. Dimitrov, Nikola Jovanović, Martin Vechev
TL;DR
The paper addresses privacy vulnerabilities in federated learning posed by malicious servers, focusing on client-side detectability. It demonstrates that existing malicious-server attacks are detectable under principled checks and introduces SEER, a secret-decoder-based framework that embeds data disaggregation in a hidden gradient space to reconstruct client data, even with large batches and secure aggregation. SEER is trained end-to-end with a shared model and a secret decoder/reconstructor, and its efficacy is validated through extensive experiments on CIFAR-10/100 and ImageNet, showing high reconstruction rates and robustness to distribution shifts. The work argues for principled, client-side defenses and provides a foundation for evaluating and improving FL privacy protections in real-world deployments.
Abstract
Malicious server (MS) attacks have enabled the scaling of data stealing in federated learning to large batch sizes and secure aggregation, settings previously considered private. However, many concerns regarding the client-side detectability of MS attacks were raised, questioning their practicality. In this work, for the first time, we thoroughly study client-side detectability. We first demonstrate that all prior MS attacks are detectable by principled checks, and formulate a necessary set of requirements that a practical MS attack must satisfy. Next, we propose SEER, a novel attack framework that satisfies these requirements. The key insight of SEER is the use of a secret decoder, jointly trained with the shared model. We show that SEER can steal user data from gradients of realistic networks, even for large batch sizes of up to 512 and under secure aggregation. Our work is a promising step towards assessing the true vulnerability of federated learning in real-world settings.
