Peak-Controlled Logits Poisoning Attack in Federated Distillation
Yuhan Tang, Aoxu Zhang, Zhiyuan Wu, Bo Gao, Tian Wen, Yuwei Wang, Sheng Sun
TL;DR
This paper addresses the security vulnerability of Federated Distillation (FD) to logits-based poisoning attacks. It introduces Peak-Controlled Federated Distillation Logits Attack (PCFDLA), an enhancement of the prior FDLA method that strategically controls peak logits to mislead learning while preserving the attacker's ability to predict correct answers. A new victim-oriented metric is proposed to quantify the impact on attacked clients, and extensive experiments across CINIC-10, CIFAR-10, and SVHN show that PCFDLA significantly degrades victim model accuracy compared with baselines, with the strength of the attack controllable via a hyperparameter. The results highlight a concrete security risk in FD pipelines and emphasize the need for defenses against logits-level manipulation in distributed distillation systems.
Abstract
Federated Distillation (FD) offers an innovative approach to distributed machine learning, leveraging knowledge distillation for efficient and flexible cross-device knowledge transfer without necessitating the upload of extensive model parameters to a central server. While FD has gained popularity, its vulnerability to poisoning attacks remains underexplored. To address this gap, we previously introduced FDLA (Federated Distillation Logits Attack), a method that manipulates logits communication to mislead and degrade the performance of client models. However, the impact of FDLA on participants with different identities and the effects of malicious modifications at various stages of knowledge transfer remain unexplored. To this end, we present PCFDLA (Peak-Controlled Federated Distillation Logits Attack), an advanced and more stealthy logits poisoning attack method for FD. PCFDLA enhances the effectiveness of FDLA by carefully controlling the peak values of logits to create highly misleading yet inconspicuous modifications. Furthermore, we introduce a novel metric for better evaluating attack efficacy, demonstrating that PCFDLA maintains stealth while being significantly more disruptive to victim models compared to its predecessors. Experimental results across various datasets confirm the superior impact of PCFDLA on model accuracy, solidifying its potential threat in federated distillation systems.
