Adaptive quadratures for nonlinear approximation of low-dimensional PDEs using smooth neural networks
Alexandre Magueresse, Santiago Badia
TL;DR
This paper tackles the accuracy of numerical integration in physics-informed neural networks for low-dimensional PDEs by introducing adaptive quadratures built from CPWL approximations of smooth activations. By decomposing the domain into regions where the network is almost linear, the authors construct Gaussian quadratures on convex cells to evaluate the loss with high precision, and they prove a quadratic convergence rate for CPWL activation approximations. They address the non-smoothness of CPWL networks by smoothing activations and provide a rigorous error analysis that bounds integration error in terms of derivative bounds. Numerical experiments on Poisson problems in 1D and 2D show that the adaptive quadrature reduces the number of integration points needed, improves robustness to initialization, and speeds up convergence compared to Monte Carlo, especially for weak formulations and complex domains. The approach also enables handling polygonal domains without relying on tensor-product meshes, offering a practical pathway toward more reliable NN-based PDE solvers.
Abstract
Physics-informed neural networks (PINNs) and their variants have recently emerged as alternatives to traditional partial differential equation (PDE) solvers, but little literature has focused on devising accurate numerical integration methods for neural networks (NNs), which is essential for getting accurate solutions. In this work, we propose adaptive quadratures for the accurate integration of neural networks and apply them to loss functions appearing in low-dimensional PDE discretisations. We show that at opposite ends of the spectrum, continuous piecewise linear (CPWL) activation functions enable one to bound the integration error, while smooth activations ease the convergence of the optimisation problem. We strike a balance by considering a CPWL approximation of a smooth activation function. The CPWL activation is used to obtain an adaptive decomposition of the domain into regions where the network is almost linear, and we derive an adaptive global quadrature from this mesh. The loss function is then obtained by evaluating the smooth network (together with other quantities, e.g., the forcing term) at the quadrature points. We propose a method to approximate a class of smooth activations by CPWL functions and show that it has a quadratic convergence rate. We then derive an upper bound for the overall integration error of our proposed adaptive quadrature. The benefits of our quadrature are evaluated on a strong and weak formulation of the Poisson equation in dimensions one and two. Our numerical experiments suggest that compared to Monte-Carlo integration, our adaptive quadrature makes the convergence of NNs quicker and more robust to parameter initialisation while needing significantly fewer integration points and keeping similar training times.
