A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Jiajun Song; Jiajun Luo; Rongwei Lu; Shuzhao Xie; Bin Chen; Zhi Wang

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Jiajun Song, Jiajun Luo, Rongwei Lu, Shuzhao Xie, Bin Chen, Zhi Wang

TL;DR

This work tackles AFL under device heterogeneity and limited bandwidth by addressing model staleness through a joint optimization of local updating frequency and gradient compression. The authors derive a convergence upper bound that identifies a key factor $\phi$ depending on per-device $k_i$ and $\delta_i$, and they propose FedLuck to adapt these parameters per device by minimizing $\phi$ using locally measured times $\alpha_i$ and $\beta_i$. Empirical evaluation across image and speech tasks demonstrates that FedLuck reduces communication by $56\%$ and training time by $\approx 55\%$ on average while maintaining competitive accuracy, even in Non-IID settings. This approach advances practical AFL by integrating computation-communication trade-offs into a unified adaptive framework, enabling more efficient deployment on heterogeneous devices and networks.

Abstract

Asynchronous Federated Learning (AFL) confronts inherent challenges arising from the heterogeneity of devices (e.g., their computation capacities) and low-bandwidth environments, both potentially causing stale model updates (e.g., local gradients) for global aggregation. Traditional approaches mitigating the staleness of updates typically focus on either adjusting the local updating or gradient compression, but not both. Recognizing this gap, we introduce a novel approach that synergizes local updating with gradient compression. Our research begins by examining the interplay between local updating frequency and gradient compression rate, and their collective impact on convergence speed. The theoretical upper bound shows that the local updating frequency and gradient compression rate of each device are jointly determined by its computing power, communication capabilities and other factors. Building on this foundation, we propose an AFL framework called FedLuck that adaptively optimizes both local update frequency and gradient compression rates. Experiments on image classification and speech recognization show that FedLuck reduces communication consumption by 56% and training time by 55% on average, achieving competitive performance in heterogeneous and low-bandwidth scenarios compared to the baselines.

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

TL;DR

depending on per-device

and

, and they propose FedLuck to adapt these parameters per device by minimizing

using locally measured times

and

. Empirical evaluation across image and speech tasks demonstrates that FedLuck reduces communication by

and training time by

on average while maintaining competitive accuracy, even in Non-IID settings. This approach advances practical AFL by integrating computation-communication trade-offs into a unified adaptive framework, enabling more efficient deployment on heterogeneous devices and networks.

Abstract

Paper Structure (12 sections, 15 equations, 3 figures, 2 tables, 1 algorithm)

This paper contains 12 sections, 15 equations, 3 figures, 2 tables, 1 algorithm.

Introduction
Preliminaries and Problem Formulation
Federated Learning
AFL with Periodic Aggregation
The Joint Approach to Local Updating and Gradient Compression
Illustrative Examples of Motivation
Our Proposed AFL Framework: FedLuck
Experiments
Experiment Tasks
Baselines and Metrics
Simulation Experiments
Conclusion

Figures (3)

Figure 1: The minimum global rounds required to reach the target accuracy for the global model under different compression rates and local updating frequencies.
Figure 2: Test Accuracy and Elapsed Time comparison between FedLuck and four baselines on three tasks in IID setting. FedLuck reduces training time by $55$% in average compared with baselines.
Figure 3: Communication consumption between FedLuck and four baselines at target accuracy on three tasks in IID setting. FedLuck reduces communication consumption by $56$% in average compared with baselines.

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

TL;DR

Abstract

A Joint Approach to Local Updating and Gradient Compression for Efficient Asynchronous Federated Learning

Authors

TL;DR

Abstract

Table of Contents

Figures (3)