Partner in Crime: Boosting Targeted Poisoning Attacks against Federated Learning
Shihua Sun, Shridatt Sugrim, Angelos Stavrou, Haining Wang
TL;DR
Federated Learning systems are vulnerable to targeted poisoning that aims to misclassify samples from a specific source class to a designated target class. The authors propose BoTPA, a general pre-training boosting framework that selects intermediate classes via Input Similarity from Contribution Degrees and crafts soft labels for an Amplifier set to amplify the attack effect, without adding new malicious data. Theoretical analysis shows boosted updates align with vanilla attack directions, and experiments across FMNIST, CIFAR-10, and CH-MNIST demonstrate substantial RI-ASR gains under both data and model poisoning, even in the presence of Byzantine defenses like Krum, Median, and Flame, and under non-IID data. Visualizations of latent spaces corroborate the boundary-shaping effect, underscoring BoTPA’s impact on decision boundaries. These results highlight the need for density-based defenses and point to new directions for FL security research and defense design.
Abstract
Federated Learning (FL) exposes vulnerabilities to targeted poisoning attacks that aim to cause misclassification specifically from the source class to the target class. However, using well-established defense frameworks, the poisoning impact of these attacks can be greatly mitigated. We introduce a generalized pre-training stage approach to Boost Targeted Poisoning Attacks against FL, called BoTPA. Its design rationale is to leverage the model update contributions of all data points, including ones outside of the source and target classes, to construct an Amplifier set, in which we falsify the data labels before the FL training process, as a means to boost attacks. We comprehensively evaluate the effectiveness and compatibility of BoTPA on various targeted poisoning attacks. Under data poisoning attacks, our evaluations reveal that BoTPA can achieve a median Relative Increase in Attack Success Rate (RI-ASR) between 15.3% and 36.9% across all possible source-target class combinations, with varying percentages of malicious clients, compared to its baseline. In the context of model poisoning, BoTPA attains RI-ASRs ranging from 13.3% to 94.7% in the presence of the Krum and Multi-Krum defenses, from 2.6% to 49.2% under the Median defense, and from 2.9% to 63.5% under the Flame defense.
