Leverage Variational Graph Representation For Model Poisoning on Federated Learning
Kai Li, Xin Yuan, Jingjing Zheng, Wei Ni, Falko Dressler, Abbas Jamalipour
TL;DR
This work tackles model-poisoning in federated learning when attackers can only overhear benign local updates. It proposes VGAE-MP, an adversarial variational graph autoencoder that learns the graph-structured correlations among benign updates and regenerates a malicious update $oldsymbol{w}_j'(t)$ that degrades the global model while remaining close to the aggregated update for stealth. Key contributions include (i) a data-untethered poisoning framework that exploits feature correlations, (ii) a dual-variable optimization scheme with a knapsack-based bandwidth selection to control attack scope, (iii) a VGAE with a two-layer GCN encoder and inner-product decoder to maximize a reconstruction loss, and (iv) extensive experiments on MNIST, FashionMNIST, and CIFAR-10 showing gradual FL accuracy loss and detection evasion. The results highlight a practical threat in wireless FL settings, where eavesdropping enables potent, hard-to-detect poisoning that challenges current defenses and motivates new graph-aware protections.
Abstract
This paper puts forth a new training data-untethered model poisoning (MP) attack on federated learning (FL). The new MP attack extends an adversarial variational graph autoencoder (VGAE) to create malicious local models based solely on the benign local models overheard without any access to the training data of FL. Such an advancement leads to the VGAE-MP attack that is not only efficacious but also remains elusive to detection. VGAE-MP attack extracts graph structural correlations among the benign local models and the training data features, adversarially regenerates the graph structure, and generates malicious local models using the adversarial graph structure and benign models' features. Moreover, a new attacking algorithm is presented to train the malicious local models using VGAE and sub-gradient descent, while enabling an optimal selection of the benign local models for training the VGAE. Experiments demonstrate a gradual drop in FL accuracy under the proposed VGAE-MP attack and the ineffectiveness of existing defense mechanisms in detecting the attack, posing a severe threat to FL.
