FedQS: Optimizing Gradient and Model Aggregation for Semi-Asynchronous Federated Learning
Yunbo Li, Jiaping Gui, Zhihang Deng, Fanchao Meng, Yue Wu
TL;DR
FedQS addresses the dual challenge of gradient- and model-aggregation in semi-asynchronous federated learning by classifying clients into four types and adaptively guiding their local training while a server-side module reweights and aggregates updates. The approach combines a pseudo-global-gradient estimation, per-type training adaptations, and a dynamic weighting scheme to reconcile stability and convergence speed, with formal convergence guarantees for both aggregation modes. Empirically, FedQS achieves the highest accuracy and fastest convergence across CV, NLP, and real-world tasks, and demonstrates robustness to varying system settings and hyperparameters. The work provides a principled, scalable framework bridging the gap between gradient and model aggregation in SAFL, with practical potential for real-world federated deployments.
Abstract
Federated learning (FL) enables collaborative model training across multiple parties without sharing raw data, with semi-asynchronous FL (SAFL) emerging as a balanced approach between synchronous and asynchronous FL. However, SAFL faces significant challenges in optimizing both gradient-based (e.g., FedSGD) and model-based (e.g., FedAvg) aggregation strategies, which exhibit distinct trade-offs in accuracy, convergence speed, and stability. While gradient aggregation achieves faster convergence and higher accuracy, it suffers from pronounced fluctuations, whereas model aggregation offers greater stability but slower convergence and suboptimal accuracy. This paper presents FedQS, the first framework to theoretically analyze and address these disparities in SAFL. FedQS introduces a divide-and-conquer strategy to handle client heterogeneity by classifying clients into four distinct types and adaptively optimizing their local training based on data distribution characteristics and available computational resources. Extensive experiments on computer vision, natural language processing, and real-world tasks demonstrate that FedQS achieves the highest accuracy, attains the lowest loss, and ranks among the fastest in convergence speed, outperforming state-of-the-art baselines. Our work bridges the gap between aggregation strategies in SAFL, offering a unified solution for stable, accurate, and efficient federated learning. The code and datasets are available at https://github.com/bkjod/FedQS_.
