Incremental Quasi-Newton Methods with Faster Superlinear Convergence Rates
Zhuanghua Liu, Luo Luo, Bryan Kian Hsiang Low
TL;DR
This work tackles large-scale finite-sum convex optimization by introducing Lazy Incremental Symmetric Rank-1 (LISR-1) and its block extension LISR-$k$, which apply symmetric rank-1 updates in an incremental setting to estimate curvature. The algorithms maintain low per-iteration cost through lazy updates and a cyclic update scheme, achieving a condition-number-free local superlinear convergence rate, namely $\mathcal{O}((1-d^{-1})^{\lceil t/n\rceil^2})$ for LISR-1 and $\mathcal{O}((1-k/d)^{\lceil t/n\rceil^2})$ for LISR-$k$, with $O(1)$ gradient/Hessian-vector oracle calls and $O(d^2)$ flops per iteration. Theoretical analysis leverages a Hessian-approximation metric $\nu(\cdot,\cdot)$ under Broyden-family updates and greedy SR1 updates to bound convergence and approximation errors. Empirical results on quadratic programs and regularized logistic regression show that the proposed methods outperform baseline IQN variants, validating both the theoretical rates and practical efficiency, with the block variant offering further acceleration at modest additional cost.
Abstract
We consider the finite-sum optimization problem, where each component function is strongly convex and has Lipschitz continuous gradient and Hessian. The recently proposed incremental quasi-Newton method is based on BFGS update and achieves a local superlinear convergence rate that is dependent on the condition number of the problem. This paper proposes a more efficient quasi-Newton method by incorporating the symmetric rank-1 update into the incremental framework, which results in the condition-number-free local superlinear convergence rate. Furthermore, we can boost our method by applying the block update on the Hessian approximation, which leads to an even faster local convergence rate. The numerical experiments show the proposed methods significantly outperform the baseline methods.
