Efficient Gradient Tracking Algorithms for Distributed Optimization Problems with Inexact Communication
Shengchao Zhao, Yongchao Liu
TL;DR
The paper tackles distributed optimization when communication is inexact due to quantization, privacy, or noise. It introduces two single-timescale algorithms, VRA-DGT for deterministic problems and VRA-DSGT for stochastic problems, by integrating Variance-Reduced Aggregation with gradient tracking and, for the stochastic case, coupling with a hybrid variance reduction. Both methods achieve a mean-square convergence rate of $O\left(\frac{1}{k}\right)$ under strongly convex and smooth objectives, without the two-timescale constraint traditional in prior work. Empirical tests on logistic regression with real data demonstrate robustness to channel noise and quantization and show improved performance over existing robust DGT schemes.
Abstract
Distributed optimization problems usually face inexact communication issues induced by communication quantization, differential privacy protection, or channels noise. Most existing algorithms need two-timescale setting of the stepsize of gradient descent and the parameter of noise suppression to ensure the convergence to the optimal solution. In this paper, we propose two single-timescale algorithms, VRA-DGT and VRA--DSGT, for distributed deterministic and stochastic optimization problems with inexact communication respectively. VRA-DGT integrates the Variance-Reduced Aggregation (VRA) mechanism with the distributed gradient tracking framework, which achieves a convergence rate of $\mathcal{O}\left(k^{-1}\right)$ in the mean-square sense when the objective function is strongly convex and smooth. For distributed stochastic optimization problem,VRA-DSGT, where a hybrid variance reduction technique has been introduced in VRA-DGT, VRA-DGT,, maintains the convergence rate of $\mathcal{O}\left(k^{-1}\right)$ for strongly convex and smooth objective function. Simulated experiments on logistic regression problem with real-world data verify the effectiveness of the proposed algorithms.
