Interaction-Aware Gaussian Weighting for Clustered Federated Learning
Alessandro Licciardi, Davide Leo, Eros Fanì, Barbara Caputo, Marco Ciccone
TL;DR
The paper addresses non-IID data and class imbalance in distributed FL by formulating the objective as $ \min_{\theta} \sum_{k=1}^K \frac{n_k}{n} \mathcal{L}_k(\theta)$ and pursuing cluster-specific models. It introduces FedGWC, which uses a Gaussian reward derived from each client’s loss trajectory to build a similarity signal, constructs an interaction matrix $P^t$ and a symmetric affinity matrix $W$ over unbiased perceptions, and applies spectral clustering to form homogeneous client clusters; a Wasserstein Adjusted Score is developed to quantify cluster cohesion under imbalance. Theoretical contributions prove convergence of Gaussian weights to per-client rewards $\mu_k$ with variance reduction, and the method maintains bounded interaction matrices and sampling-rate stability. Empirical results on Leaf, CIFAR-100, FEMNIST, Google Landmarks, and iNaturalist show improved clustering quality and cluster-wise accuracy, while remaining communication- and computation-efficient and compatible with existing FL aggregators.
Abstract
Federated Learning (FL) emerged as a decentralized paradigm to train models while preserving privacy. However, conventional FL struggles with data heterogeneity and class imbalance, which degrade model performance. Clustered FL balances personalization and decentralized training by grouping clients with analogous data distributions, enabling improved accuracy while adhering to privacy constraints. This approach effectively mitigates the adverse impact of heterogeneity in FL. In this work, we propose a novel clustered FL method, FedGWC (Federated Gaussian Weighting Clustering), which groups clients based on their data distribution, allowing training of a more robust and personalized model on the identified clusters. FedGWC identifies homogeneous clusters by transforming individual empirical losses to model client interactions with a Gaussian reward mechanism. Additionally, we introduce the Wasserstein Adjusted Score, a new clustering metric for FL to evaluate cluster cohesion with respect to the individual class distribution. Our experiments on benchmark datasets show that FedGWC outperforms existing FL algorithms in cluster quality and classification accuracy, validating the efficacy of our approach.
