SVD-Preconditioned Gradient Descent Method for Solving Nonlinear Least Squares Problems
Zhipeng Chang, Wenrui Hao, Nian Liu
TL;DR
This work introduces SPGD, a SVD-based preconditioned gradient method for nonlinear least-squares problems, and integrates it with Adam-style adaptivity to form SPGD-Adam. By leveraging the local spectral information of the Jacobian, SPGD achieves a more favorable convergence factor than classical gradient descent, and the modified Adam variant provides global convergence under AMSGrad-style stabilization and regularized preconditioning. The authors supply a rigorous convergence analysis establishing local linear convergence for SPGD and global convergence for the modified Adam framework, along with detailed bounds on error terms. Empirically, SPGD and SPGD-Adam demonstrate faster convergence and lower residuals across function-approximation tasks, PDE-like problems, and image-classification settings (CIFAR-10) compared with standard Adam, highlighting the practical impact of problem-structure–driven preconditioning.
Abstract
This paper introduces a novel optimization algorithm designed for nonlinear least-squares problems. The method is derived by preconditioning the gradient descent direction using the Singular Value Decomposition (SVD) of the Jacobian. This SVD-based preconditioner is then integrated with the first- and second-moment adaptive learning rate mechanism of the Adam optimizer. We establish the local linear convergence of the proposed method under standard regularity assumptions and prove global convergence for a modified version of the algorithm under suitable conditions. The effectiveness of the approach is demonstrated experimentally across a range of tasks, including function approximation, partial differential equation (PDE) solving, and image classification on the CIFAR-10 dataset. Results show that the proposed method consistently outperforms standard Adam, achieving faster convergence and lower error in both regression and classification settings.
