Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning
Alessandro Licciardi, Davide Leo, Davide Carbone
TL;DR
This work tackles malicious or faulty client detection in Federated Learning by introducing WAFFLE, an offline detector that operates before training using low-dimensional spectral embeddings derived from either the Fourier Transform or Wavelet Scattering Transform. WAFFLE trains its detector on a distillated public auxiliary dataset and relies on PCA-derived client summaries along with spectral embeddings to label clients as benign or malicious with minimal communication overhead. The authors provide theoretical guarantees showing that removing malicious clients yields an unbiased and less noisy global estimator, and demonstrate through extensive experiments that WAFFLE, particularly the WST variant, improves detection accuracy and downstream task performance under both Gaussian and non-Gaussian attacks, including an NLP proof-of-concept. The method preserves privacy by keeping raw data on-device and sharing only non-invertible embeddings, and it can be combined with existing robust aggregation methods for a multi-layered defense in scalable FL deployments.
Abstract
Federated Learning (FL) enables the training of machine learning models across decentralized clients while preserving data privacy. However, the presence of anomalous or corrupted clients - such as those with faulty sensors or non representative data distributions - can significantly degrade model performance. Detecting such clients without accessing raw data remains a key challenge. We propose WAFFLE (Wavelet and Fourier representations for Federated Learning) a detection algorithm that labels malicious clients {\it before training}, using locally computed compressed representations derived from either the Wavelet Scattering Transform (WST) or the Fourier Transform. Both approaches provide low-dimensional, task-agnostic embeddings suitable for unsupervised client separation. A lightweight detector, trained on a distillated public dataset, performs the labeling with minimal communication and computational overhead. While both transforms enable effective detection, WST offers theoretical advantages, such as non-invertibility and stability to local deformations, that make it particularly well-suited to federated scenarios. Experiments on benchmark datasets show that our method improves detection accuracy and downstream classification performance compared to existing FL anomaly detection algorithms, validating its effectiveness as a pre-training alternative to online detection strategies.
