DROP: Poison Dilution via Knowledge Distillation for Federated Learning
Georgios Syros, Anshuman Suri, Farinaz Koushanfar, Cristina Nita-Rotaru, Alina Oprea
TL;DR
DROP tackles the stealthy backdoor threat in federated learning by combining agglomerative clustering, activity monitoring, and logit-driven knowledge distillation to cleanse the global model. The method further extends with DROPlet as a lightweight variant and demonstrates robust defense across diverse learning configurations and data distributions, with near-zero ASR in many setups. While achieving strong security, DROP trades off some main task accuracy, particularly under non-IID heavy settings, and the authors release their code to enable broader evaluation and benchmarking. Overall, DROP offers a practical, configuration-robust defense for backdoor attacks in realistic FL deployments, addressing both aggressive and stealthy adversaries.
Abstract
Federated Learning is vulnerable to adversarial manipulation, where malicious clients can inject poisoned updates to influence the global model's behavior. While existing defense mechanisms have made notable progress, they fail to protect against adversaries that aim to induce targeted backdoors under different learning and attack configurations. To address this limitation, we introduce DROP (Distillation-based Reduction Of Poisoning), a novel defense mechanism that combines clustering and activity-tracking techniques with extraction of benign behavior from clients via knowledge distillation to tackle stealthy adversaries that manipulate low data poisoning rates and diverse malicious client ratios within the federation. Through extensive experimentation, our approach demonstrates superior robustness compared to existing defenses across a wide range of learning configurations. Finally, we evaluate existing defenses and our method under the challenging setting of non-IID client data distribution and highlight the challenges of designing a resilient FL defense in this setting.
