Decentralized Finite-Sum Optimization over Time-Varying Networks
Dmitry Metelev, Savelii Chezhegov, Alexander Rogozin, Aleksandr Beznosikov, Alexander Sholokhov, Alexander Gasnikov, Dmitry Kovalev
TL;DR
The paper studies decentralized finite-sum optimization over time-varying networks, addressing both strongly convex and nonconvex objectives. It introduces ADOM+VR for strongly convex problems and GT-PAGE for nonconvex problems, integrating variance reduction with gradient tracking to achieve improved convergence under time-varying topologies. It provides new lower bounds on both communication and stochastic oracle complexity, clarifying fundamental limits and the potential and limits of proposed algorithms in comparison to static-network results. Theoretical guarantees are complemented by numerical experiments on LibSVM datasets, illustrating the practical effectiveness and trade-offs of the proposed methods in decentralized, changing networks.
Abstract
We consider decentralized time-varying stochastic optimization problems where each of the functions held by the nodes has a finite sum structure. Such problems can be efficiently solved using variance reduction techniques. Our aim is to explore the lower complexity bounds (for communication and number of stochastic oracle calls) and find optimal algorithms. The paper studies strongly convex and nonconvex scenarios. To the best of our knowledge, variance reduced schemes and lower bounds for time-varying graphs have not been studied in the literature. For nonconvex objectives, we obtain lower bounds and develop an optimal method GT-PAGE. For strongly convex objectives, we propose the first decentralized time-varying variance-reduction method ADOM+VR and establish lower bound in this scenario, highlighting the open question of matching the algorithms complexity and lower bounds even in static network case.
