Accelerating the convergence of Newton's method for nonlinear elliptic PDEs using Fourier neural operators
Joubine Aghili, Emmanuel Franck, Romain Hild, Victor Michel-Dansac, Vincent Vigon
TL;DR
This work addresses the slow convergence of Newton's method for nonlinear elliptic PDEs discretized by finite differences by learning a mesh-robust initial guess with Fourier neural operators (FNOs). The authors train an FNO to map PDE data $(\Phi,K)$ to an approximate discrete solution, using a discretization-informed loss that couples data fidelity with the PDE residual, enabling mesh-independent predictions. Across 1D and 2D tests of nonlinear and anisotropic diffusion, the FNO-based initialization substantially reduces the number of Newton iterations and CPU time, especially on coarser grids, and remains effective when solving multiple realizations or time-dependent problems. The approach offers a practical offline/online workflow with potential extensions to unstructured geometries and multi-solution PDEs, and it compares favorably to existing line-search and ML-based initializers in both performance and robustness.
Abstract
It is well known that Newton's method can have trouble converging if the initial guess is too far from the solution. Such a problem particularly occurs when this method is used to solve nonlinear elliptic partial differential equations (PDEs) discretized via finite differences. This work focuses on accelerating Newton's method convergence in this context. We seek to construct a mapping from the parameters of the nonlinear PDE to an approximation of its discrete solution, independently of the mesh resolution. This approximation is then used as an initial guess for Newton's method. To achieve these objectives, we elect to use a Fourier neural operator (FNO). The loss function is the sum of a data term (i.e., the comparison between known solutions and outputs of the FNO) and a physical term (i.e., the residual of the PDE discretization). Numerical results, in one and two dimensions, show that the proposed initial guess accelerates the convergence of Newton's method by a large margin compared to a naive initial guess, especially for highly nonlinear and anisotropic problems, with larger gains on coarse grids.
