Gauss Newton method for solving variational problems of PDEs with neural network discretizaitons
Wenrui Hao, Qingguo Hong, Xianlin Jin
TL;DR
The paper addresses solving PDEs using neural-network discretizations in a variational energy framework and introduces a Gauss-Newton method tailored to this variational form. It derives the update via an approximate Hessian H(\\theta) = J(\\theta) + Q(\\theta), argues that Q is small and can be neglected, and establishes a Gauss-Newton iteration that achieves superlinear convergence to semiregular zeros. It also proves that, under suitable quadrature and rank conditions, the variational GN can be identical to the L2-Gauss-Newton method, and it analyzes a randomized GN variant for scalability. Numerical experiments in 1D, 2D, and 5D demonstrate that Gauss-Newton outperforms gradient-based methods and L-BFGS in accuracy and efficiency, validating the approach and its potential for high-dimensional PDE discretizations.
Abstract
The numerical solution of differential equations using machine learning-based approaches has gained significant popularity. Neural network-based discretization has emerged as a powerful tool for solving differential equations by parameterizing a set of functions. Various approaches, such as the deep Ritz method and physics-informed neural networks, have been developed for numerical solutions. Training algorithms, including gradient descent and greedy algorithms, have been proposed to solve the resulting optimization problems. In this paper, we focus on the variational formulation of the problem and propose a Gauss- Newton method for computing the numerical solution. We provide a comprehensive analysis of the superlinear convergence properties of this method, along with a discussion on semi-regular zeros of the vanishing gradient. Numerical examples are presented to demonstrate the efficiency of the proposed Gauss-Newton method.
