A Mini-Batch Method for Solving Nonlinear PDEs with Gaussian Processes
Xianjin Yang, Houman Owhadi
TL;DR
This work introduces a mini-batch framework to solve nonlinear PDEs with Gaussian Processes, addressing the cubic $O(N^3)$ cost of exact GP inversions by performing updates on small minibatches with a slack variable $\boldsymbol{z}$. The formulation yields a finite-dimensional representer expression and an $O(M^3)$ per-iteration cost, with convergence to a near-stationary point governed by a rate $O\left(\frac{1}{K}+\frac{1}{M}\right)$ under stability and weak convexity. Numerical experiments on a nonlinear elliptic PDE and Burgers' equation demonstrate that the method achieves accuracy comparable to full GP solvers while significantly reducing computation when using appropriately sized minibatches. Overall, the approach blends GP/regression theory in RKHS with stochastic proximal optimization to produce scalable, principled solvers for nonlinear PDEs with uncertainty quantification potential.
Abstract
Gaussian processes (GPs) based methods for solving partial differential equations (PDEs) demonstrate great promise by bridging the gap between the theoretical rigor of traditional numerical algorithms and the flexible design of machine learning solvers. The main bottleneck of GP methods lies in the inversion of a covariance matrix, whose cost grows cubically concerning the size of samples. Drawing inspiration from neural networks, we propose a mini-batch algorithm combined with GPs to solve nonlinear PDEs. A naive deployment of a stochastic gradient descent method for solving PDEs with GPs is challenging, as the objective function in the requisite minimization problem cannot be depicted as the expectation of a finite-dimensional random function. To address this issue, we employ a mini-batch method to the corresponding infinite-dimensional minimization problem over function spaces. The algorithm takes a mini-batch of samples at each step to update the GP model. Thus, the computational cost is allotted to each iteration. Using stability analysis and convexity arguments, we show that the mini-batch method steadily reduces a natural measure of errors towards zero at the rate of $O(1/K+1/M)$, where $K$ is the number of iterations and $M$ is the batch size.
