Decentralized Conjugate Gradient and Memoryless BFGS Methods
Liping Wang, Hao Wu, Hongchao Zhang
TL;DR
This work addresses decentralized optimization over a connected network by introducing two methods: NDCG, a gradient-tracking enhanced decentralized conjugate gradient designed for nonconvex objectives, and DMBFGS, a memoryless BFGS variant for strongly convex objectives. NDCG achieves global convergence with a constant stepsize by integrating gradient tracking and a PRP-type conjugate parameter, while DMBFGS yields global linear convergence through adaptive self-scaling quasi-Newton updates based solely on gradient information. The authors prove convergence results under mild assumptions, including Lipschitz gradients and strong convexity, and analyze rates in terms of problem and network condition numbers. Numerical experiments on nonconvex logistic regression and strongly convex linear/logistic regression demonstrate that NDCG and DMBFGS outperform state-of-the-art decentralized first-order and quasi-Newton methods in both iteration and communication efficiency, highlighting practical impact for scalable distributed optimization.
Abstract
This paper proposes a new decentralized conjugate gradient (NDCG) method and a decentralized memoryless BFGS (DMBFGS) method for the nonconvex and strongly convex decentralized optimization problem, respectively, of minimizing a finite sum of continuously differentiable functions over a fixed-connected undirected network. Gradient tracking techniques are applied in these two methods to enhance their convergence properties and the numerical stability. In particular, we show global convergence of NDCG with constant stepsize for general nonconvex smooth decentralized optimization. Our new DMBFGS method uses a scaled memoryless BFGS technique and only requires gradient information to approximate second-order information of the component functions in the objective. We also establish global convergence and linear convergence rate of DMBFGS with constant stepsize for strongly convex smooth decentralized optimization. Our numerical results show that NDCG and DMBFGS are very efficient in terms of both iteration and communication cost compared with other state-of-the-art methods for solving smooth decentralized optimization.
