Derivative-informed neural operator acceleration of geometric MCMC for infinite-dimensional Bayesian inverse problems
Lianghao Cao, Thomas O'Leary-Roseberry, Omar Ghattas
TL;DR
The paper introduces derivative-informed neural operators (DINO) trained on joint maps of the parameter-to-observable (PtO) relationship and its Jacobian to accelerate geometric MCMC for infinite-dimensional Bayesian inverse problems. By embedding a reduced-basis DINO surrogate within a delayed-acceptance, dimension-independent geometric MCMC (mMALA) framework, the method avoids online forward/adjoint sensitivity solves while preserving posterior geometry and sampling correctness. The authors provide $H^1_{oldsymbol{}}$-level error analysis, cost decompositions for PDE-based training, and two PDE benchmarks (coefficient inversion in nonlinear diffusion–reaction and heterogeneous hyperelastic material property inference) showing 3–9x faster geometric MCMC and 60–97x faster than prior geometry-based MCMC, with training break-even after 10–25 effective samples. This approach substantially lowers the online cost of Bayesian inference on function spaces and offers a scalable, rigorous route to fast uncertainty quantification in PDE-constrained problems.
Abstract
We propose an operator learning approach to accelerate geometric Markov chain Monte Carlo (MCMC) for solving infinite-dimensional Bayesian inverse problems (BIPs). While geometric MCMC employs high-quality proposals that adapt to posterior local geometry, it requires repeated computations of gradients and Hessians of the log-likelihood, which becomes prohibitive when the parameter-to-observable (PtO) map is defined through expensive-to-solve parametric partial differential equations (PDEs). We consider a delayed-acceptance geometric MCMC method driven by a neural operator surrogate of the PtO map, where the proposal exploits fast surrogate predictions of the log-likelihood and, simultaneously, its gradient and Hessian. To achieve a substantial speedup, the surrogate must accurately approximate the PtO map and its Jacobian, which often demands a prohibitively large number of PtO map samples via conventional operator learning methods. In this work, we present an extension of derivative-informed operator learning [O'Leary-Roseberry et al., J. Comput. Phys., 496 (2024)] that uses joint samples of the PtO map and its Jacobian. This leads to derivative-informed neural operator (DINO) surrogates that accurately predict the observables and posterior local geometry at a significantly lower training cost than conventional methods. Cost and error analysis for reduced basis DINO surrogates are provided. Numerical studies demonstrate that DINO-driven MCMC generates effective posterior samples 3--9 times faster than geometric MCMC and 60--97 times faster than prior geometry-based MCMC. Furthermore, the training cost of DINO surrogates breaks even compared to geometric MCMC after just 10--25 effective posterior samples.
