Table of Contents
Fetching ...

Incremental Gauss--Newton Methods with Superlinear Convergence Rates

Zhiling Zhou, Zhuanghua Liu, Chengchang Liu, Luo Luo

TL;DR

This work addresses solving systems ${\bf f}({\bf x})={\bf 0}$ with a Jacobian that is Hölder continuous of exponent $\nu\in(0,1]$ and a nondegenerate solution ${\bf x}^*$. It introduces Incremental Gauss--Newton (IGN), which updates one component per iteration and maintains aggregated estimates ${\bf H}^t$ and ${\bf G}^t={\bf H}^t^{-1}$ to achieve explicit local superlinear convergence, plus a mini-batch extension MB-IGN for faster rates. The authors prove convergence guarantees: $\|{\bf x}^{t+1}-{f x}^*\|$ decays at rate $r_{t+1} \le c^{(1+\nu)^{\lfloor t/n\rfloor-1}} r_t$ with a problem-dependent constant $c$, and in the Lipschitz-Jacobian case ($\nu=1$) obtain $n$-step quadratic convergence; MB-IGN generalizes this with batch size $k$ and $m=\lceil n/k\rceil$, yielding analogous rates. Numerical experiments on regularized logistic regression, Chandrasekhar's H-equation, and soft maximum minimization demonstrate that IGN/MB-IGN achieve faster convergence and lower computation time than several baselines, validating the practical benefit of incremental Newton-type methods for large-scale nonlinear equations.

Abstract

This paper addresses the challenge of solving large-scale nonlinear equations with Hölder continuous Jacobians. We introduce a novel Incremental Gauss--Newton (IGN) method within explicit superlinear convergence rate, which outperforms existing methods that only achieve linear convergence rate. In particular, we formulate our problem by the nonlinear least squares with finite-sum structure, and our method incrementally iterates with the information of one component in each round. We also provide a mini-batch extension to our IGN method that obtains an even faster superlinear convergence rate. Furthermore, we conduct numerical experiments to show the advantages of the proposed methods.

Incremental Gauss--Newton Methods with Superlinear Convergence Rates

TL;DR

This work addresses solving systems with a Jacobian that is Hölder continuous of exponent and a nondegenerate solution . It introduces Incremental Gauss--Newton (IGN), which updates one component per iteration and maintains aggregated estimates and to achieve explicit local superlinear convergence, plus a mini-batch extension MB-IGN for faster rates. The authors prove convergence guarantees: decays at rate with a problem-dependent constant , and in the Lipschitz-Jacobian case () obtain -step quadratic convergence; MB-IGN generalizes this with batch size and , yielding analogous rates. Numerical experiments on regularized logistic regression, Chandrasekhar's H-equation, and soft maximum minimization demonstrate that IGN/MB-IGN achieve faster convergence and lower computation time than several baselines, validating the practical benefit of incremental Newton-type methods for large-scale nonlinear equations.

Abstract

This paper addresses the challenge of solving large-scale nonlinear equations with Hölder continuous Jacobians. We introduce a novel Incremental Gauss--Newton (IGN) method within explicit superlinear convergence rate, which outperforms existing methods that only achieve linear convergence rate. In particular, we formulate our problem by the nonlinear least squares with finite-sum structure, and our method incrementally iterates with the information of one component in each round. We also provide a mini-batch extension to our IGN method that obtains an even faster superlinear convergence rate. Furthermore, we conduct numerical experiments to show the advantages of the proposed methods.
Paper Structure (26 sections, 26 theorems, 194 equations, 3 figures, 1 table, 2 algorithms)

This paper contains 26 sections, 26 theorems, 194 equations, 3 figures, 1 table, 2 algorithms.

Key Result

Proposition 1

Under Assumption asm:b-J-sing, it holds that

Figures (3)

  • Figure 1: Experimental results of time (s) vs. $\|{\bf{f}}({\bf{x}})\|$ for MB-IGN with different mini-batch size $k$.
  • Figure 2: Experimental results of epochs vs. $\|{\bf{f}}({\bf{x}})\|$ for all methods.
  • Figure 3: Experimental results of time (s) vs. $\|{\bf{f}}({\bf{x}})\|$ for all methods.

Theorems & Definitions (51)

  • Proposition 1
  • Lemma 1
  • Lemma 2
  • Theorem 1
  • Corollary 1
  • Corollary 2
  • Theorem 2
  • Corollary 3
  • Lemma 3
  • proof
  • ...and 41 more