FedDefender: Backdoor Attack Defense in Federated Learning

Waris Gill; Ali Anwar; Muhammad Ali Gulzar

FedDefender: Backdoor Attack Defense in Federated Learning

Waris Gill, Ali Anwar, Muhammad Ali Gulzar

TL;DR

This paper addresses backdoor attacks in Federated Learning by introducing FedDefender, a defense that uses differential testing to fingerprint neuron activations of client models on randomly generated inputs. At the central server, FedDefender computes a malicious-confidence score for each client and reduces a suspect client's contribution when the score surpasses a threshold, effectively altering the aggregation from FedAvg to exclude or diminish malicious updates. Empirically, FedDefender reduces the Attack Success Rate (ASR) to roughly 10% on MNIST and FashionMNIST with 20 or 30 clients while maintaining global accuracy, outperforming both FedAvg and NormClipping. The authors provide a public artifact and argue that the approach is protocol-agnostic and requires no access to raw client data, illustrating a practical defense inspired by software testing principles for FL.

Abstract

Federated Learning (FL) is a privacy-preserving distributed machine learning technique that enables individual clients (e.g., user participants, edge devices, or organizations) to train a model on their local data in a secure environment and then share the trained model with an aggregator to build a global model collaboratively. In this work, we propose FedDefender, a defense mechanism against targeted poisoning attacks in FL by leveraging differential testing. Our proposed method fingerprints the neuron activations of clients' models on the same input and uses differential testing to identify a potentially malicious client containing a backdoor. We evaluate FedDefender using MNIST and FashionMNIST datasets with 20 and 30 clients, and our results demonstrate that FedDefender effectively mitigates such attacks, reducing the attack success rate (ASR) to 10\% without deteriorating the global model performance.

FedDefender: Backdoor Attack Defense in Federated Learning

TL;DR

Abstract

Paper Structure (6 sections, 1 equation, 4 figures, 1 algorithm)

This paper contains 6 sections, 1 equation, 4 figures, 1 algorithm.

Introduction
Background and Related Work
Threat Model
FedDefender Design
Evaluation
Future Work and Conclusion

Figures (4)

Figure 1: The scaling factor increases the strength of the malicious client by increasing the number of training examples, $n_k$, by a factor of $X$. This enhances the chances of successfully injecting a backdoor in the global model $W^{t+1}$.
Figure 2: Comparison of FedDefender, with the baseline FedAvg and NormClipping defense mechanisms. Figures indicate that FedDefender successfully mitigates the attack and lowers the ASR close to 10% .
Figure 3: Evaluation of the impact of the malicious confidence threshold. FedDefender is unable to mitigate the attack if the potential malicious client is not aggressively penalized.
Figure 4: FedDefender performs similarly to FedAvg and NormClipping without penalizing benign clients.

FedDefender: Backdoor Attack Defense in Federated Learning

TL;DR

Abstract

FedDefender: Backdoor Attack Defense in Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (4)