NLTGCR: A class of Nonlinear Acceleration Procedures based on Conjugate Residuals
Huan He, Ziyuan Tang, Shifan Zhao, Yousef Saad, Yuanzhe Xi
TL;DR
nlTGCR is a nonlinear acceleration framework that extends the linear TGCR method to general nonlinear systems by leveraging a local, Jacobian-driven linear model and a residual-minimization projection. It subsumes elements of Anderson acceleration and inexact/quasi-Newton methods, and introduces an adaptive update mechanism that switches between nonlinear-residual and linearized updates to balance robustness and efficiency. Theoretical results connect nlTGCR to multi-secant updates, provide line-search-based convergence guarantees, and extend to stochastic settings; empirical tests across Bratu PDEs, Lennard-Jones geometry optimization, ResNet training, Neural-ODE learning, and GCNs demonstrate superior convergence speed and robustness, particularly when symmetry is present. The work highlights symmetry-exploitation, flexible memory, and global-convergence strategies as practical advantages, with promising potential for deep learning and large-scale nonlinear problems where Jacobian operations are accessible.
Abstract
This paper develops a new class of nonlinear acceleration algorithms based on extending conjugate residual-type procedures from linear to nonlinear equations. The main algorithm has strong similarities with Anderson acceleration as well as with inexact Newton methods - depending on which variant is implemented. We prove theoretically and verify experimentally, on a variety of problems from simulation experiments to deep learning applications, that our method is a powerful accelerated iterative algorithm.
