Aggressive or Imperceptible, or Both: Network Pruning Assisted Hybrid Byzantines in Federated Learning
Emre Ozfatura, Kerem Ozfatura, Alptekin Kupcu, Deniz Gunduz
TL;DR
This paper addresses the Byzantine robustness problem in federated learning by showing that exploiting NN topology can yield stronger, more stealthy attacks. It introduces a two-part hybrid sparse Byzantine attack that splits perturbations into an imperceptible, index-wise component and a stronger, Euclidean-distance–oriented component, guided by a sparsity mask derived from network pruning. The authors demonstrate, through extensive simulations across multiple datasets, models, and defensive aggregators, that their attack can substantially degrade or even derail learning, often outperforming existing attacks especially under non-IID data. The work highlights the need to consider topology-informed vulnerabilities in designing robust aggregators and paves the way for topology-aware defenses in federated settings.
Abstract
Federated learning (FL) has been introduced to enable a large number of clients, possibly mobile devices, to collaborate on generating a generalized machine learning model thanks to utilizing a larger number of local samples without sharing to offer certain privacy to collaborating clients. However, due to the participation of a large number of clients, it is often difficult to profile and verify each client, which leads to a security threat that malicious participants may hamper the accuracy of the trained model by conveying poisoned models during the training. Hence, the aggregation framework at the parameter server also needs to minimize the detrimental effects of these malicious clients. A plethora of attack and defence strategies have been analyzed in the literature. However, often the Byzantine problem is analyzed solely from the outlier detection perspective, being oblivious to the topology of neural networks (NNs). In the scope of this work, we argue that by extracting certain side information specific to the NN topology, one can design stronger attacks. Hence, inspired by the sparse neural networks, we introduce a hybrid sparse Byzantine attack that is composed of two parts: one exhibiting a sparse nature and attacking only certain NN locations with higher sensitivity, and the other being more silent but accumulating over time, where each ideally targets a different type of defence mechanism, and together they form a strong but imperceptible attack. Finally, we show through extensive simulations that the proposed hybrid Byzantine attack is effective against 8 different defence methods.
