BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Yi Liu; Cong Wang; Xingliang Yuan

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Yi Liu, Cong Wang, Xingliang Yuan

TL;DR

The paper investigates poisoning in Byzantine-robust Federated Learning by exploiting catastrophic forgetting through a clean-label data poisoning attack. It introduces BadSampler, which uses two adaptive sampling strategies—Top-$\kappa$ sampling and meta-sampling—optimized via Soft Actor-Critic to maximize generalization error while keeping training error low, all under a realistic threat model with $M \le 10\%$ compromised clients. The authors provide a theoretical upper bound on attack-induced gradient shifts and show favorable complexity, and they validate the approach on Fashion-MNIST and CIFAR-10 across multiple defenses, demonstrating significant reductions in accuracy. The work highlights a practical vulnerability in production FL and motivates the development of defenses that monitor training dynamics and generalization drift beyond traditional anomaly detection.

Abstract

Federated Learning (FL) is susceptible to poisoning attacks, wherein compromised clients manipulate the global model by modifying local datasets or sending manipulated model updates. Experienced defenders can readily detect and mitigate the poisoning effects of malicious behaviors using Byzantine-robust aggregation rules. However, the exploration of poisoning attacks in scenarios where such behaviors are absent remains largely unexplored for Byzantine-robust FL. This paper addresses the challenging problem of poisoning Byzantine-robust FL by introducing catastrophic forgetting. To fill this gap, we first formally define generalization error and establish its connection to catastrophic forgetting, paving the way for the development of a clean-label data poisoning attack named BadSampler. This attack leverages only clean-label data (i.e., without poisoned data) to poison Byzantine-robust FL and requires the adversary to selectively sample training data with high loss to feed model training and maximize the model's generalization error. We formulate the attack as an optimization problem and present two elegant adversarial sampling strategies, Top-$κ$ sampling, and meta-sampling, to approximately solve it. Additionally, our formal error upper bound and time complexity analysis demonstrate that our design can preserve attack utility with high efficiency. Extensive evaluations on two real-world datasets illustrate the effectiveness and performance of our proposed attacks.

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

TL;DR

sampling and meta-sampling—optimized via Soft Actor-Critic to maximize generalization error while keeping training error low, all under a realistic threat model with

compromised clients. The authors provide a theoretical upper bound on attack-induced gradient shifts and show favorable complexity, and they validate the approach on Fashion-MNIST and CIFAR-10 across multiple defenses, demonstrating significant reductions in accuracy. The work highlights a practical vulnerability in production FL and motivates the development of defenses that monitor training dynamics and generalization drift beyond traditional anomaly detection.

Abstract

sampling, and meta-sampling, to approximately solve it. Additionally, our formal error upper bound and time complexity analysis demonstrate that our design can preserve attack utility with high efficiency. Extensive evaluations on two real-world datasets illustrate the effectiveness and performance of our proposed attacks.

Paper Structure (31 sections, 3 theorems, 15 equations, 2 figures, 10 tables, 3 algorithms)

This paper contains 31 sections, 3 theorems, 15 equations, 2 figures, 10 tables, 3 algorithms.

Introduction
Related Work
Background and Threat Model
Definition of the Generalization Error
Threat Model
BadSampler Attack
Primer on BadSampler Attack
Formulating Optimization Problems
Attack Implementation
Theoretical Analysis
Error Upper Bound Analysis for BadSampler
Complexity Analysis
Experiments
Experiment Setup
Evaluation
...and 16 more sections

Key Result

Lemma 1

If Assumptions assum-1 and assum-2 hold, the expectation of the stochastic second-order correction term, i.e., the expectation of the bias, is formally expressed as follows:

Figures (2)

Figure 1: Workflow and taxonomy of our BadSampler attack.
Figure 2: Visual overview of Hessian eigenvalue distributions for benign versus poisoned models.

Theorems & Definitions (5)

Definition 1
Definition 2
Lemma 1
Lemma 2
Theorem 1

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

TL;DR

Abstract

BadSampler: Harnessing the Power of Catastrophic Forgetting to Poison Byzantine-robust Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (2)

Theorems & Definitions (5)