An Aggregation-Free Federated Learning for Tackling Data Heterogeneity
Yuan Wang, Huazhu Fu, Renuga Kanagavelu, Qingsong Wei, Yong Liu, Rick Siow Mong Goh
TL;DR
Federated Learning with non-IID client data suffers from client drift and slow convergence under the traditional aggregation-then-adapt framework. The paper introduces FedAF, an aggregation-free FL algorithm where clients learn condensed data and the server trains the global model using these condensed data plus soft labels, with distribution-alignment via Distribution Matching and cross-client Soft-labeling through Local-Global Knowledge Matching. Key technical ingredients include $L_{DM}$ with per-class feature means $\mu_{k,c}^{real}$ and $\mu_{k,c}^{syn}$, a SWD-based CDC term, and $L_{LGKM}$ based on KL divergence between local and global logits, plus a model re-sampling step $w \leftarrow \gamma w+(1-\gamma)\tilde{w}$. Experiments on FMNIST, CIFAR-10/100, and DomainNet show FedAF achieves up to 25.44% accuracy gains and up to 80% faster convergence compared to state-of-the-art baselines, especially under strong heterogeneity. The work demonstrates a practical, privacy-aware route to robust, fast-converging FL in heterogeneous environments.
Abstract
The performance of Federated Learning (FL) hinges on the effectiveness of utilizing knowledge from distributed datasets. Traditional FL methods adopt an aggregate-then-adapt framework, where clients update local models based on a global model aggregated by the server from the previous training round. This process can cause client drift, especially with significant cross-client data heterogeneity, impacting model performance and convergence of the FL algorithm. To address these challenges, we introduce FedAF, a novel aggregation-free FL algorithm. In this framework, clients collaboratively learn condensed data by leveraging peer knowledge, the server subsequently trains the global model using the condensed data and soft labels received from the clients. FedAF inherently avoids the issue of client drift, enhances the quality of condensed data amid notable data heterogeneity, and improves the global model performance. Extensive numerical studies on several popular benchmark datasets show FedAF surpasses various state-of-the-art FL algorithms in handling label-skew and feature-skew data heterogeneity, leading to superior global model accuracy and faster convergence.
