Stochastic Controlled Averaging for Federated Learning with Communication Compression
Xinmeng Huang, Ping Li, Xiaoyun Li
TL;DR
The paper addresses the challenge of preserving performance in federated learning when incorporating communication compression under arbitrary data heterogeneity and partial participation. It introduces a simplified stochastic controlled averaging framework built on SCAFFOLD, enabling halved uplink communication and two algorithms: SCALLION for unbiased compression and SCAFCOM for biased compression with momentum. Theoretical results establish state-of-the-art nonconvex convergence and tight asymptotic communication/computation complexities under minimal assumptions, while experiments on MNIST and Fashion-MNIST show near-full-precision performance with substantial uplink reductions and superiority over existing compressed FL methods. The work significantly advances practical, robust compressed FL, combining momentum, error-feedback-like ideas, and a single-variable uplink scheme. The methods are poised to impact large-scale FL deployments by dramatically reducing communication overhead without compromising convergence under heterogeneous client data and partial participation.
Abstract
Communication compression, a technique aiming to reduce the information volume to be transmitted over the air, has gained great interests in Federated Learning (FL) for the potential of alleviating its communication overhead. However, communication compression brings forth new challenges in FL due to the interplay of compression-incurred information distortion and inherent characteristics of FL such as partial participation and data heterogeneity. Despite the recent development, the performance of compressed FL approaches has not been fully exploited. The existing approaches either cannot accommodate arbitrary data heterogeneity or partial participation, or require stringent conditions on compression. In this paper, we revisit the seminal stochastic controlled averaging method by proposing an equivalent but more efficient/simplified formulation with halved uplink communication costs. Building upon this implementation, we propose two compressed FL algorithms, SCALLION and SCAFCOM, to support unbiased and biased compression, respectively. Both the proposed methods outperform the existing compressed FL methods in terms of communication and computation complexities. Moreover, SCALLION and SCAFCOM accommodates arbitrary data heterogeneity and do not make any additional assumptions on compression errors. Experiments show that SCALLION and SCAFCOM can match the performance of corresponding full-precision FL approaches with substantially reduced uplink communication, and outperform recent compressed FL methods under the same communication budget.
