StatAvg: Mitigating Data Heterogeneity in Federated Learning for Intrusion Detection Systems
Pavlos S. Bouzinis, Panagiotis Radoglou-Grammatikis, Ioannis Makris, Thomas Lagkas, Vasileios Argyriou, Georgios Th. Papadopoulos, Panagiotis Sarigiannidis, George K. Karagiannidis
TL;DR
The paper tackles data heterogeneity in federated intrusion detection by introducing StatAvg, a universal normalization mechanism that aggregates per-client feature statistics into global mean and variance, enabling consistent input scaling before FL training. The method, proven mathematically to reflect the global data distribution, is model-agnostic and compatible with any aggregation strategy. Empirical results on TON-IoT and CIC-IoT-2023 show StatAvg delivering substantial accuracy and F1 improvements over FedAvg, FedLN, and FedBN, with more stable convergence. This approach offers a practical, deployment-friendly way to mitigate non-iid feature effects in FL-based IDS and potentially other FL applications with heterogeneous data.
Abstract
Federated learning (FL) is a decentralized learning technique that enables participating devices to collaboratively build a shared Machine Leaning (ML) or Deep Learning (DL) model without revealing their raw data to a third party. Due to its privacy-preserving nature, FL has sparked widespread attention for building Intrusion Detection Systems (IDS) within the realm of cybersecurity. However, the data heterogeneity across participating domains and entities presents significant challenges for the reliable implementation of an FL-based IDS. In this paper, we propose an effective method called Statistical Averaging (StatAvg) to alleviate non-independently and identically (non-iid) distributed features across local clients' data in FL. In particular, StatAvg allows the FL clients to share their individual data statistics with the server, which then aggregates this information to produce global statistics. The latter are shared with the clients and used for universal data normalisation. It is worth mentioning that StatAvg can seamlessly integrate with any FL aggregation strategy, as it occurs before the actual FL training process. The proposed method is evaluated against baseline approaches using datasets for network and host Artificial Intelligence (AI)-powered IDS. The experimental results demonstrate the efficiency of StatAvg in mitigating non-iid feature distributions across the FL clients compared to the baseline methods.
