Distributed Inexact Newton Method with Adaptive Step Sizes
Dusan Jakovetic, Natasa Krejic, Greta Malaspina
TL;DR
This work tackles distributed optimization on networks for two common problems: distributed personalized optimization using a penalty formulation Φβ and distributed consensus optimization via a min-sum objective. It introduces DINAS, a Distributed Inexact Newton method with Adaptive Step sizes, which computes Newton directions inexactly through a distributed Jacobi Overrelaxation method and uses a Polyak-inspired adaptive step to guarantee global convergence while maintaining fast local convergence. The authors prove global convergence and establish rates (linear, superlinear, quadratic) in the personalized setting, derive linear convergence with respect to communication rounds, and show that solving a sequence of penalty problems with decreasing β yields convergence to the consensus solution. They also provide a centralized inexact Newton analysis with adaptive steps, and deliver extensive numerical results demonstrating significant improvements in both computation and communication costs over state-of-the-art methods, including for ill-conditioned and large-scale problems. Overall, the method reduces global constants requirements, avoids Hessian inverses at each node, and achieves substantial practical gains in distributed second-order optimization.
Abstract
We consider two formulations for distributed optimization wherein $N$ agents in a generic connected network solve a problem of common interest: distributed personalized optimization and consensus optimization. A new method termed DINAS (Distributed Inexact Newton method with Adaptive Stepsize) is proposed. DINAS employs large adaptively computed step-sizes, requires a reduced global parameters knowledge with respect to existing alternatives, and can operate without any local Hessian inverse calculations nor Hessian communications. When solving personalized distributed learning formulations, DINAS achieves quadratic convergence with respect to computational cost and linear convergence with respect to communication cost, the latter rate being independent of the local functions condition numbers or of the network topology. When solving consensus optimization problems, DINAS is shown to converge to the global solution. Extensive numerical experiments demonstrate significant improvements of DINAS over existing alternatives. As a result of independent interest, we provide for the first time convergence analysis of the Newton method with the adaptive Polyak's step-size when the Newton direction is computed inexactly in centralized environment.
