Efficient Shallow Ritz Method For 1D Diffusion-Reaction Problems
Zhiqiang Cai, Anastassia Doktorova, Robert D. Falgout, César Herrera
TL;DR
This work develops an efficient shallow Ritz discretization for one-dimensional diffusion-reaction problems by coupling a shallow ReLU NN with a damped block Newton (dBN) method, enabling near-optimal approximation orders even for non-smooth cases. Key contributions include a rigorous optimality framework, an $O(n)$-cost inversion strategy for the dense, ill-conditioned mass matrix via algebraic and geometric factorizations, and a reduced, invertible nonlinear system that sustains Newton updates despite Hessian singularities. The authors extend the framework to least-squares problems, demonstrate adaptive breakpoint placement via ANE/AdBN to improve convergence, and validate the approach with numerical experiments showing competitive or superior performance to L-BFGS and adaptive FEM, particularly for singularly perturbed problems. Overall, the method offers a scalable, accurate, and adaptable solver for 1D problems and provides a pathway to higher-dimensional extensions and rigorous convergence analyses in future work.
Abstract
This paper studies the shallow Ritz method for solving one-dimensional diffusion-reaction problems. The method is capable of improving the order of approximation for non-smooth problems. By following a similar approach to the one presented in [9], we present a damped block Newton (dBN) method to achieve nearly optimal order of approximation. The dBN method optimizes the Ritz functional by alternating between the linear and non-linear parameters of the shallow ReLU neural network (NN). For diffusion-reaction problems, new difficulties arise: (1) for the linear parameters, the mass matrix is dense and even more ill-conditioned than the stiffness matrix, and (2) for the non-linear parameters, the Hessian matrix is dense and may be singular. This paper addresses these challenges, resulting in a dBN method with computational cost of ${\cal O}(n)$. The ideas presented for diffusion-reaction problems can also be applied to least-squares approximation problems. For both applications, starting with the non-linear parameters as a uniform partition, numerical experiments show that the dBN method moves the mesh points to nearly optimal locations.
