Protecting Federated Learning from Extreme Model Poisoning Attacks via Multidimensional Time Series Anomaly Detection
Edoardo Gabrielli, Dimitri Belli, Zoe Matrullo, Vittorio Miori, Gabriele Tolomei
TL;DR
FLANDERS introduces a novel pre-aggregation filter for federated learning that leverages a matrix autoregressive (MAR) time-series model to forecast the next round of client updates and identify outliers. By treating the per-round, per-client updates as a d-by-m matrix, FLANDERS computes an anomaly score for each selected client and filters out the suspected malicious contributions before standard or robust aggregations. The method is designed to be attack-agnostic with respect to the number of adversarial clients and to exploit temporal dependencies between intra- and inter-client updates, offering strong robustness even when malicious participants dominate. Empirical results across non-IID data, multiple datasets, and several attack types show that FLANDERS improves the resilience of existing aggregation rules (e.g., FedAvg, Multi-Krum, Bulyan) and enables them to maintain high accuracy under extreme attack scenarios, albeit with a computational cost that can be mitigated via dimensionality reduction and sampling. The work also discusses practical limitations, including efficiency, privacy considerations, cross-device scalability, and the need for broader benchmarking, while providing a pathway to integrate FLANDERS into existing FL frameworks such as Flower.
Abstract
Current defense mechanisms against model poisoning attacks in federated learning (FL) systems have proven effective up to a certain threshold of malicious clients. In this work, we introduce FLANDERS, a novel pre-aggregation filter for FL resilient to large-scale model poisoning attacks, i.e., when malicious clients far exceed legitimate participants. FLANDERS treats the sequence of local models sent by clients in each FL round as a matrix-valued time series. Then, it identifies malicious client updates as outliers in this time series by comparing actual observations with estimates generated by a matrix autoregressive forecasting model maintained by the server. Experiments conducted in several non-iid FL setups show that FLANDERS significantly improves robustness across a wide spectrum of attacks when paired with standard and robust existing aggregation methods.
