Quantized and Asynchronous Federated Learning
Tomas Ortega, Hamid Jafarkhani
TL;DR
This paper addresses the dual challenges of communication efficiency and asynchrony in federated learning by introducing Quantized Asynchronous Federated Learning (QAFeL). QAFeL employs a hidden-state quantization scheme with buffered, buffered aggregation to prevent error propagation and enable scalable updates without requiring uniform client participation. The authors prove an optimal ergodic convergence rate of $\mathcal{O}\big(1/\sqrt{T}\big)$ for non-convex objectives, under mild assumptions, and show that the cross-term between staleness and quantization is of higher order and thus negligible. Empirically, QAFeL demonstrates strong communication savings and robust convergence on logistic regression and neural network benchmarks (CIFAR-10, CelebA, Shakespeare), outperforming or matching baselines like FedBuff while reducing communication costs.
Abstract
Recent advances in federated learning have shown that asynchronous variants can be faster and more scalable than their synchronous counterparts. However, their design does not include quantization, which is necessary in practice to deal with the communication bottleneck. To bridge this gap, we develop a novel algorithm, Quantized Asynchronous Federated Learning (QAFeL), which introduces a hidden-state quantization scheme to avoid the error propagation caused by direct quantization. QAFeL also includes a buffer to aggregate client updates, ensuring scalability and compatibility with techniques such as secure aggregation. Furthermore, we prove that QAFeL achieves an $\mathcal{O}(1/\sqrt{T})$ ergodic convergence rate for stochastic gradient descent on non-convex objectives, which is the optimal order of complexity, without requiring bounded gradients or uniform client arrivals. We also prove that the cross-term error between staleness and quantization only affects the higher-order error terms. We validate our theoretical findings on standard benchmarks.
