FedPSA: Modeling Behavioral Staleness in Asynchronous Federated Learning
Chaoyi Lu, Yiding Sun, Zhichuan Yang, Jinqian Chen, Dongfu Yin, Jihua Zhu
TL;DR
FedPSA targets the staleness problem in asynchronous federated learning by replacing time-based staleness with a behavioral staleness metric derived from parameter sensitivity. It computes a sensitivity vector on a shared calibration batch, sketches it with a random projection, and measures cosine similarity to the global model's sensitivity, $\kappa$, to gauge compatibility with current training dynamics; a training thermometer, using $Temp = \left(\frac{M_{cur}}{M_0}\right)\gamma + \delta$, modulates a softmax aggregation over a fixed buffer of updates. The server aggregates updates with weights that adapt over training stages, allowing exploration early on and tighter convergence later, while maintaining a buffer-based asynchronous workflow. Experiments across MNIST, FMNIST, CIFAR-10, and CIFAR-100 under IID and non-IID settings show FedPSA outperforms multiple baselines, including a robust performance under system heterogeneity, with only marginal additional overhead due to sensitivity sketching and calibration. The proposed approach offers a scalable, privacy-conscious, and effective way to improve AFL performance in real-world heterogeneous environments, with practical guidance on hyperparameters and calibration data design.
Abstract
Asynchronous Federated Learning (AFL) has emerged as a significant research area in recent years. By not waiting for slower clients and executing the training process concurrently, it achieves faster training speed compared to traditional federated learning. However, due to the staleness introduced by the asynchronous process, its performance may degrade in some scenarios. Existing methods often use the round difference between the current model and the global model as the sole measure of staleness, which is coarse-grained and lacks observation of the model itself, thereby limiting the performance ceiling of asynchronous methods. In this paper, we propose FedPSA (Parameter Sensitivity-based Asynchronous Federated Learning), a more fine-grained AFL framework that leverages parameter sensitivity to measure model obsolescence and establishes a dynamic momentum queue to assess the current training phase in real time, thereby adjusting the tolerance for outdated information dynamically. Extensive experiments on multiple datasets and comparisons with various methods demonstrate the superior performance of FedPSA, achieving up to 6.37\% improvement over baseline methods and 1.93\% over the current state-of-the-art method.
