OASIS: Offsetting Active Reconstruction Attacks in Federated Learning
Tre' R. Jeter, Truc Nguyen, Raed Alharbi, My T. Thai
TL;DR
This work tackles the risk that dishonest federated servers can actively reconstruct private user data via gradient inversion. It introduces OASIS, an image-augmentation defense that ensures gradients encode a linear combination of original and augmented samples, preventing exact reconstruction while preserving FL performance. The authors analyze the attack principle, generalize attacks like CAH and RTF, and show that augmenting data to create common activation patterns disrupts single-sample gradient extraction. Experimental results on ImageNet and CIFAR100 demonstrate substantial reductions in reconstruction quality with only marginal accuracy loss, offering a scalable defense for privacy-preserving federated learning.
Abstract
Federated Learning (FL) has garnered significant attention for its potential to protect user privacy while enhancing model training efficiency. For that reason, FL has found its use in various domains, from healthcare to industrial engineering, especially where data cannot be easily exchanged due to sensitive information or privacy laws. However, recent research has demonstrated that FL protocols can be easily compromised by active reconstruction attacks executed by dishonest servers. These attacks involve the malicious modification of global model parameters, allowing the server to obtain a verbatim copy of users' private data by inverting their gradient updates. Tackling this class of attack remains a crucial challenge due to the strong threat model. In this paper, we propose a defense mechanism, namely OASIS, based on image augmentation that effectively counteracts active reconstruction attacks while preserving model performance. We first uncover the core principle of gradient inversion that enables these attacks and theoretically identify the main conditions by which the defense can be robust regardless of the attack strategies. We then construct our defense with image augmentation showing that it can undermine the attack principle. Comprehensive evaluations demonstrate the efficacy of the defense mechanism highlighting its feasibility as a solution.
