Exploiting Defenses against GAN-Based Feature Inference Attacks in Federated Learning

Xinjian Luo; Xianglong Zhang

Exploiting Defenses against GAN-Based Feature Inference Attacks in Federated Learning

Xinjian Luo, Xianglong Zhang

TL;DR

This work addresses GAN-based feature inference threats in federated learning by introducing Anti-GAN, a defense that obfuscates the visual content of generated images while preserving classification features. It combines a CGAN-driven obfuscation, a low-level feature preservation mechanism via a pre-trained network, and a Mixup augmentation to form a safe training set for the federated model. Empirical results across MNIST, Fashion-MNIST, CelebA, and CIFAR-10 show substantial reductions in attacker reconstruction quality (SSIM) with limited accuracy loss (ADR typically under 5%), outperforming DP-SGD and dropout baselines. The study demonstrates that protecting group-level data distributions is feasible with practical utility, offering a new direction for privacy in FL, albeit without formal guarantees for group-level privacy.

Abstract

Federated learning (FL) is a decentralized model training framework that aims to merge isolated data islands while maintaining data privacy. However, recent studies have revealed that Generative Adversarial Network (GAN) based attacks can be employed in FL to learn the distribution of private datasets and reconstruct recognizable images. In this paper, we exploit defenses against GAN-based attacks in FL and propose a framework, Anti-GAN, to prevent attackers from learning the real distribution of the victim's data. The core idea of Anti-GAN is to manipulate the visual features of private training images to make them indistinguishable to human eyes even restored by attackers. Specifically, Anti-GAN projects the private dataset onto a GAN's generator and combines the generated fake images with the actual images to create the training dataset, which is then used for federated model training. The experimental results demonstrate that Anti-GAN is effective in preventing attackers from learning the distribution of private images while causing minimal harm to the accuracy of the federated model.

Exploiting Defenses against GAN-Based Feature Inference Attacks in Federated Learning

TL;DR

Abstract

Exploiting Defenses against GAN-Based Feature Inference Attacks in Federated Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)