Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning

Shihua Sun; Shridatt Sugrim; Angelos Stavrou; Haining Wang

Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning

Shihua Sun, Shridatt Sugrim, Angelos Stavrou, Haining Wang

TL;DR

Federated Learning systems are vulnerable to targeted poisoning that aims to misclassify samples from a specific source class to a designated target class. The authors propose BoTPA, a general pre-training boosting framework that selects intermediate classes via Input Similarity from Contribution Degrees and crafts soft labels for an Amplifier set to amplify the attack effect, without adding new malicious data. Theoretical analysis shows boosted updates align with vanilla attack directions, and experiments across FMNIST, CIFAR-10, and CH-MNIST demonstrate substantial RI-ASR gains under both data and model poisoning, even in the presence of Byzantine defenses like Krum, Median, and Flame, and under non-IID data. Visualizations of latent spaces corroborate the boundary-shaping effect, underscoring BoTPA’s impact on decision boundaries. These results highlight the need for density-based defenses and point to new directions for FL security research and defense design.

Abstract

Federated Learning (FL) exposes vulnerabilities to targeted poisoning attacks that aim to cause misclassification specifically from the source class to the target class. However, using well-established defense frameworks, the poisoning impact of these attacks can be greatly mitigated. We introduce a generalized pre-training stage approach to Boost Targeted Poisoning Attacks against FL, called BoTPA. Its design rationale is to leverage the model update contributions of all data points, including ones outside of the source and target classes, to construct an Amplifier set, in which we falsify the data labels before the FL training process, as a means to boost attacks. We comprehensively evaluate the effectiveness and compatibility of BoTPA on various targeted poisoning attacks. Under data poisoning attacks, our evaluations reveal that BoTPA can achieve a median Relative Increase in Attack Success Rate (RI-ASR) between 15.3% and 36.9% across all possible source-target class combinations, with varying percentages of malicious clients, compared to its baseline. In the context of model poisoning, BoTPA attains RI-ASRs ranging from 13.3% to 94.7% in the presence of the Krum and Multi-Krum defenses, from 2.6% to 49.2% under the Median defense, and from 2.9% to 63.5% under the Flame defense.

Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning

TL;DR

Abstract

Paper Structure (33 sections, 2 theorems, 18 equations, 13 figures, 1 table, 2 algorithms)

This paper contains 33 sections, 2 theorems, 18 equations, 13 figures, 1 table, 2 algorithms.

Introduction
Background and Related Work
FL System
Untargeted Poisoning Attacks in FL systems
Targeted Poisoning Attacks in FL systems
Targeted data poisoning attack
Targeted model poisoning attacks
Defenses
Design and Application of Soft Labels
Threat Model
BoTPA Framework
System Design Overview
Construction of Amplifier Set
Soft Label Design for Amplifier Set
Theoretical Analysis
...and 18 more sections

Key Result

Proposition 1

Suppose the FL model is updated using the gradient descent algorithm by minimizing the cross-entropy loss function. Let $m$ denote the number of local training iterations in each communication round. If a targeted data poisoning attack, attempting to classify source class $s$ as target class $r$, oc where $\delta_{rs}(\boldsymbol{x}, \boldsymbol{w}) = \nabla_{\boldsymbol{w}} \log f_{r}(\boldsymbo

Figures (13)

Figure 1: The FL system under targeted data poisoning and model poisoning attacks: (A) procedure for maliciously changing labels, (B) model manipulation process. Attacker (1) is performing model poisoning, and Attacker (2) is performing data poisoning.
Figure 2: Overview of BoTPA. The right box represents the poisoned FL system, and the left box displays the BoTPA procedures.
Figure 3: Illustration of local model updates under no attack, vanilla poisoning attack, and boosted poisoning attack.
Figure 4: Global model accuracy under vanilla and boosted data poisoning attacks. $s \rightarrow r$ denotes an attack aiming to misclassify data samples from class $s$ to class $r$.
Figure 5: Performance comparison of stealthy model poisoning attacks with varying ratios of malicious clients under Krum and Multi-Krum.
...and 8 more figures

Theorems & Definitions (4)

Proposition 1
proof
Proposition 2
proof

Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning

TL;DR

Abstract

Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (13)

Theorems & Definitions (4)