LoCoDL: Communication-Efficient Distributed Learning with Local Training and Compression
Laurent Condat, Artavazd Maranjyan, Peter Richtárik
TL;DR
This work addresses communication bottlenecks in distributed optimization and Federated Learning by introducing 0.9LoCoDL, a randomized primal–dual method that combines Local Training and unidirectional Compression for uplink efficiency. By lifting the problem to a consensus formulation over $x_i$ and a shared $y$, and compressing only the differences to $y$, the method achieves linear convergence with a doubly-accelerated uplink complexity that scales favorably with the condition number $\kappa$ and model dimension $d$. The compressor class $\mathbb{U}(\omega)$ and the variance parameter $\omega_{\mathrm{av}}$ enable broad practical applicability with independent, unbiased quantizers, while the theoretical rate matches the best prior LT+CC results under mild assumptions. Empirically, 0.9LoCoDL shows superior communication efficiency across multiple datasets and compression schemes, outperforming existing LT+CC approaches and even some state-of-the-art accelerated methods, highlighting the value of combining Local Training with general unbiased compression in Federated/Distributed learning.
Abstract
In Distributed optimization and Learning, and even more in the modern framework of federated learning, communication, which is slow and costly, is critical. We introduce LoCoDL, a communication-efficient algorithm that leverages the two popular and effective techniques of Local training, which reduces the communication frequency, and Compression, in which short bitstreams are sent instead of full-dimensional vectors of floats. LoCoDL works with a large class of unbiased compressors that includes widely-used sparsification and quantization methods. LoCoDL provably benefits from local training and compression and enjoys a doubly-accelerated communication complexity, with respect to the condition number of the functions and the model dimension, in the general heterogenous regime with strongly convex functions. This is confirmed in practice, with LoCoDL outperforming existing algorithms.
