Variational Physics-Informed Neural Networks For Solving Partial Differential Equations
E. Kharazmi, Z. Zhang, G. E. Karniadakis
TL;DR
The paper introduces variational physics-informed neural networks (VPINN), a Petrov-Galerkin framework that replaces the PINN strong-form residual with a variational (weak) residual. By selecting neural networks as the trial space and flexible test spaces (e.g., sine functions or Legendre polynomials), VPINN leverages integration by parts to lower the differential order and reduce training cost while improving accuracy. The authors derive analytic variational residuals for shallow networks and demonstrate substantial performance gains over PINNs on 1D Burgers and Poisson problems, and extend the approach to deep networks using Gauss-type quadrature. Through 1D and 2D numerical experiments, VPINN shows higher accuracy and faster convergence, highlighting its potential for efficient PDE solving with neural networks and offering directions for future quadrature-based analyses and domain-decomposed learning.
Abstract
Physics-informed neural networks (PINNs) [31] use automatic differentiation to solve partial differential equations (PDEs) by penalizing the PDE in the loss function at a random set of points in the domain of interest. Here, we develop a Petrov-Galerkin version of PINNs based on the nonlinear approximation of deep neural networks (DNNs) by selecting the {\em trial space} to be the space of neural networks and the {\em test space} to be the space of Legendre polynomials. We formulate the \textit{variational residual} of the PDE using the DNN approximation by incorporating the variational form of the problem into the loss function of the network and construct a \textit{variational physics-informed neural network} (VPINN). By integrating by parts the integrand in the variational form, we lower the order of the differential operators represented by the neural networks, hence effectively reducing the training cost in VPINNs while increasing their accuracy compared to PINNs that essentially employ delta test functions. For shallow networks with one hidden layer, we analytically obtain explicit forms of the \textit{variational residual}. We demonstrate the performance of the new formulation for several examples that show clear advantages of VPINNs over PINNs in terms of both accuracy and speed.
