A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

Wei Sun; Bo Gao; Ke Xiong; Yuwei Wang

A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

Wei Sun, Bo Gao, Ke Xiong, Yuwei Wang

TL;DR

This work addresses the security risk that non-visible local data in federated learning can be poisoned to degrade global performance. It introduces VagueGAN, a GAN variant that generates vague but poisonous data with a tunable suppression factor $\kappa$ to balance attack effectiveness and stealth, and extends to an unsupervised version inspired by InfoGAN. To defend against such GAN-based attacks, the authors propose Model Consistency-Based Defense (MCD), which reuses existing local models and analyzes PCA-reduced features to detect malicious clients via centroid and footprint metrics. Extensive experiments on MNIST, Fashion-MNIST, and CIFAR-10 show that VagueGAN achieves strong degradation with low detectability, while MCD reliably identifies GAN-poisoned data or models and restores FL accuracy. The findings highlight a practical arms race between stealthy data-poisoning attacks and lightweight, model-based defenses for real-world FL deployments, and point to directions for more robust adversarial defenses and adaptive GAN designs.

Abstract

As a distributed machine learning paradigm, federated learning (FL) is collaboratively carried out on privately owned datasets but without direct data access. Although the original intention is to allay data privacy concerns, "available but not visible" data in FL potentially brings new security threats, particularly poisoning attacks that target such "not visible" local data. Initial attempts have been made to conduct data poisoning attacks against FL systems, but cannot be fully successful due to their high chance of causing statistical anomalies. To unleash the potential for truly "invisible" attacks and build a more deterrent threat model, in this paper, a new data poisoning attack model named VagueGAN is proposed, which can generate seemingly legitimate but noisy poisoned data by untraditionally taking advantage of generative adversarial network (GAN) variants. Capable of manipulating the quality of poisoned data on demand, VagueGAN enables to trade-off attack effectiveness and stealthiness. Furthermore, a cost-effective countermeasure named Model Consistency-Based Defense (MCD) is proposed to identify GAN-poisoned data or models after finding out the consistency of GAN outputs. Extensive experiments on multiple datasets indicate that our attack method is generally much more stealthy as well as more effective in degrading FL performance with low complexity. Our defense method is also shown to be more competent in identifying GAN-poisoned data or models. The source codes are publicly available at \href{https://github.com/SSssWEIssSS/VagueGAN-Data-Poisoning-Attack-and-Its-Countermeasure}{https://github.com/SSssWEIssSS/VagueGAN-Data-Poisoning-Attack-and-Its-Countermeasure}.

A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

TL;DR

to balance attack effectiveness and stealth, and extends to an unsupervised version inspired by InfoGAN. To defend against such GAN-based attacks, the authors propose Model Consistency-Based Defense (MCD), which reuses existing local models and analyzes PCA-reduced features to detect malicious clients via centroid and footprint metrics. Extensive experiments on MNIST, Fashion-MNIST, and CIFAR-10 show that VagueGAN achieves strong degradation with low detectability, while MCD reliably identifies GAN-poisoned data or models and restores FL accuracy. The findings highlight a practical arms race between stealthy data-poisoning attacks and lightweight, model-based defenses for real-world FL deployments, and point to directions for more robust adversarial defenses and adaptive GAN designs.

Abstract

Paper Structure (39 sections, 22 equations, 17 figures, 11 tables, 3 algorithms)

This paper contains 39 sections, 22 equations, 17 figures, 11 tables, 3 algorithms.

Introduction
Related Work
Objectives and Assumptions
System Model
Attack Model
Attacker's goal
Attacker's capabilities
Attacker's approach
GAN-Based Data Poisoning Attacks
VagueGAN Poisoning Attack
VagueGAN Model
Theoretical Analysis
Attack Effectiveness-Stealthiness Trade-off
Unsupervised Variant of VagueGAN
Deployment Considerations
...and 24 more sections

Figures (17)

Figure 1: Use of VagueGAN for a data poisoning attack
Figure 2: Comparison of noise-superimposed samples and samples generated by different GANs
Figure 3: Different levels of attack stealthiness
Figure 4: Trade-off between attack effectiveness and stealthiness
Figure 5: The architecture of unsupervised VagueGAN
...and 12 more figures

A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

TL;DR

Abstract

A GAN-Based Data Poisoning Attack Against Federated Learning Systems and Its Countermeasure

Authors

TL;DR

Abstract

Table of Contents

Figures (17)