Table of Contents
Fetching ...

FedMon: Federated eBPF Monitoring for Distributed Anomaly Detection in Multi-Cluster Cloud Environments

Sehar Zehra, Hassan Jamil Syed, Ummay Faseeha

TL;DR

Kubernetes multi-cluster environments face privacy, bandwidth, and heterogeneity challenges for anomaly detection. FedMon merges in-kernel telemetry via eBPF with federated learning and a hybrid detector (VAE plus Isolation Forest) to enable privacy-preserving, cross-cluster anomaly detection without sharing raw data. The approach yields strong detection performance (F1 ≈ 0.92; precision ≈ 0.94; recall ≈ 0.91) and substantial bandwidth savings (>60%) while maintaining low runtime overhead and providing Byzantine-robust aggregation. This work advances practical cloud-native security by demonstrating collaborative, kernel-level anomaly detection across clusters with real-time, risk-aware enforcement and optional differential privacy features.

Abstract

Kubernetes multi-cluster deployments demand scalable and privacy-preserving anomaly detection. Existing eBPF-based monitors provide low-overhead system and network visibility but are limited to single clusters, while centralized approaches incur bandwidth, privacy, and heterogeneity challenges. We propose FedMon, a federated eBPF framework that unifies kernel-level telemetry with federated learning (FL) for cross-cluster anomaly detection. Lightweight eBPF agents capture syscalls and network events, extract local statistical and sequence features, and share only model updates with a global server. A hybrid detection engine combining Variational Autoencoders (VAEs) with Isolation Forests enables both temporal pattern modeling and outlier detection. Deployed across three Kubernetes clusters, FedMon achieves 94% precision, 91% recall, and an F1-score of 0.92, while cutting bandwidth usage by 60% relative to centralized baselines. Results demonstrate that FedMon enhances accuracy, scalability, and privacy, providing an effective defense for large-scale, multi-tenant cloud-native environments.

FedMon: Federated eBPF Monitoring for Distributed Anomaly Detection in Multi-Cluster Cloud Environments

TL;DR

Kubernetes multi-cluster environments face privacy, bandwidth, and heterogeneity challenges for anomaly detection. FedMon merges in-kernel telemetry via eBPF with federated learning and a hybrid detector (VAE plus Isolation Forest) to enable privacy-preserving, cross-cluster anomaly detection without sharing raw data. The approach yields strong detection performance (F1 ≈ 0.92; precision ≈ 0.94; recall ≈ 0.91) and substantial bandwidth savings (>60%) while maintaining low runtime overhead and providing Byzantine-robust aggregation. This work advances practical cloud-native security by demonstrating collaborative, kernel-level anomaly detection across clusters with real-time, risk-aware enforcement and optional differential privacy features.

Abstract

Kubernetes multi-cluster deployments demand scalable and privacy-preserving anomaly detection. Existing eBPF-based monitors provide low-overhead system and network visibility but are limited to single clusters, while centralized approaches incur bandwidth, privacy, and heterogeneity challenges. We propose FedMon, a federated eBPF framework that unifies kernel-level telemetry with federated learning (FL) for cross-cluster anomaly detection. Lightweight eBPF agents capture syscalls and network events, extract local statistical and sequence features, and share only model updates with a global server. A hybrid detection engine combining Variational Autoencoders (VAEs) with Isolation Forests enables both temporal pattern modeling and outlier detection. Deployed across three Kubernetes clusters, FedMon achieves 94% precision, 91% recall, and an F1-score of 0.92, while cutting bandwidth usage by 60% relative to centralized baselines. Results demonstrate that FedMon enhances accuracy, scalability, and privacy, providing an effective defense for large-scale, multi-tenant cloud-native environments.

Paper Structure

This paper contains 28 sections, 6 figures, 1 table.

Figures (6)

  • Figure 1: FedMon architecture: The framework integrates eBPF-based kernel telemetry, local anomaly detection, and federated model coordination across Kubernetes clusters.
  • Figure 2: Federated Monitoring for Kubernetes: eBPF agents collect in-cluster telemetry; local anomaly engines detect deviations; FL clients exchange model updates; the global server aggregates models; and eBPF enforcers apply risk-aware actions.
  • Figure 3: F1-score across clusters over federated rounds. Collaboration via FL improves accuracy under non-IID workloads.
  • Figure 4: Throughput under different monitoring modes. FedMon maintains near-baseline performance versus centralized logging.
  • Figure 5: Bandwidth usage over time. FedMon replaces constant streaming with periodic model updates, cutting bandwidth by $>$60%.
  • ...and 1 more figures