Deep learning methods for stochastic Galerkin approximations of elliptic random PDEs
Fabio Musco, Andrea Barth
TL;DR
This work tackles stochastic elliptic PDEs with random forcing and diffusion that may vanish or become unbounded, by replacing traditional stochastic Galerkin solvers with deep learning surrogates. It introduces S-GalerkinNet (strong-form residual) and S-RitzNet (weighted weak-form Ritz energy) to approximate the PC spectral coefficients of the stochastic Galerkin expansion, with the latter offering weaker regularity requirements and applicability to log-Gaussian fields. A theoretical key contribution is a Ritz-energy functional with a unique minimizer, guaranteeing a well-posed stochastic Galerkin approximation in the weighted setting. Empirically, the methods demonstrate enhanced training efficiency and scalability on high-dimensional, fully-coupled SG systems, solving challenging problems (including log-normal-type coefficients) on a single GPU within minutes, thus broadening practical applicability of intrusive uncertainty quantification techniques.
Abstract
This work considers stochastic Galerkin approximations of linear elliptic partial differential equations (PDEs) with stochastic forcing terms and stochastic diffusion coefficients, that cannot be bounded uniformly away from zero and infinity. A traditional numerical method for solving the resulting high-dimensional coupled system of PDEs is replaced by deep learning techniques. In order to achieve this, physics-informed neural networks (PINNs), which typically operate on the strong residual of the PDE and can therefore be applied in a wide range of settings, are considered. As a second approach, the Deep Ritz method, which is a neural network that minimizes the Ritz energy functional to find the weak solution, is employed. While the second approach only works in special cases, it overcomes the necessity of testing in variational problems while maintaining mathematical rigor and ensuring the existence of a unique solution. Furthermore, the residual is of a lower differentiation order, reducing the training cost considerably. The efficiency of the method is demonstrated on several model problems.
