Accelerated Methods with Compressed Communications for Distributed Optimization Problems under Data Similarity
Dmitry Bylinkin, Aleksandr Beznosikov
TL;DR
This work addresses the communication bottleneck in distributed optimization by marrying compression, local steps, and data similarity under a star topology. It introduces OLGA (unbiased compression) and EF-OLGA (biased compression with error feedback), which leverage variance reduction and Hessian similarity to achieve favorable communication complexities. Theoretical results provide CC-1/CC-2/CC-3 guarantees, with an optimal compression parameter roughly $\gamma_{\omega}=\Theta(\sqrt{M})$ that balances efficiency and accuracy, and corollaries for accelerated performance under similarity. Empirical validation on ridge and logistic regression tasks with LibSVM datasets confirms the proposed methods’ advantages and robustness across varying numbers of workers and compressors.
Abstract
In recent years, as data and problem sizes have increased, distributed learning has become an essential tool for training high-performance models. However, the communication bottleneck, especially for high-dimensional data, is a challenge. Several techniques have been developed to overcome this problem. These include communication compression and implementation of local steps, which work particularly well when there is similarity of local data samples. In this paper, we study the synergy of these approaches for efficient distributed optimization. We propose the first theoretically grounded accelerated algorithms utilizing unbiased and biased compression under data similarity, leveraging variance reduction and error feedback frameworks. Our results are of record and confirmed by experiments on different average losses and datasets.
