Convergence result for the gradient-push algorithm and its application to boost up the Push-DIging algorithm
Hyogi Choi, Woocheol Choi, Gwangil Kim
TL;DR
The paper addresses distributed optimization over directed graphs using the gradient-push algorithm with a constant stepsize. It introduces a contraction-based analysis via the operator $T_{\alpha}$ and proves that for $\alpha\in(0,\alpha_0]$ the iterates converge linearly to a fixed point $w^{\alpha}$, achieving an $O(\alpha)$-neighborhood of the global minimizer $x_*$. Two function classes are handled: (i) each $f_i$ is $\mu_i$-strongly convex and $L_i$-smooth, and (ii) each $f_i$ is convex quadratic with an $L_i$-smooth aggregate, both yielding an $O(\alpha)$-accurate limit with $\alpha_0$ independent of $L$. A hybrid scheme combining gradient-push with Push-DIGing is proposed to accelerate convergence, and numerical experiments demonstrate substantial performance gains. Overall, the work provides sharp, scalable convergence guarantees for constant-step-size distributed optimization on directed graphs and offers a practical pathway to faster consensus-based learning in networked systems.
Abstract
The gradient-push algorithm is a fundamental algorithm for the distributed optimization problem \begin{equation} \min_{x \in \mathbb{R}^d} f(x) = \sum_{j=1}^n f_j (x), \end{equation} where each local cost $f_j$ is only known to agent $a_i$ for $1 \leq i \leq n$ and the agents are connected by a directed graph. In this paper, we obtain convergence results for the gradient-push algorithm with constant stepsize whose range is sharp in terms the order of the smoothness constant $L>0$. Precisely, under the two settings: 1) Each local cost $f_i$ is strongly convex and $L$-smooth, 2) Each local cost $f_i$ is convex quadratic and $L$-smooth while the aggregate cost $f$ is strongly convex, we show that the gradient-push algorithm with stepsize $α>0$ converges to an $O(α)$-neighborhood of the minimizer of $f$ for a range $α\in (0, c/L]$ with a value $c>0$ independent of $L>0$. As a benefit of the result, we suggest a hybrid algorithm that performs the gradient-push algorithm with a relatively large stepsize $α>0$ for a number of iterations and then go over to perform the Push-DIGing algorithm. It is verified by a numerical test that the hybrid algorithm enhances the performance of the Push-DIGing algorithm significantly. The convergence results of the gradient-push algorithm are also supported by numerical tests.
