Sequential infinite-dimensional Bayesian optimal experimental design with derivative-informed latent attention neural operator
Jinwoo Go, Peng Chen
TL;DR
This work tackles sequential Bayesian optimal experimental design for PDE-constrained systems with infinite-dimensional parameters by introducing an adaptive terminal formulation and an equivalent conditional KL objective, enabling scalable global decision-making. It couples a Laplace-based and low-rank posterior framework with a novel derivative-informed latent attention neural operator (LANO) that compresses inputs/outputs with DIS and PCA, propagates dynamics through latent attention, and yields accurate PtO maps and Jacobians with automatic differentiation. Numerical experiments on tumor-growth MRI design demonstrate that LANO achieves high accuracy for MAP points and eigenvalues while delivering substantial online/offline speedups (for example, up to 388× faster for PtO evaluations and 1364× faster for eigenpairs), enabling amortized speedups around 180×. The proposed framework offers a practical, scalable pathway to high-dimensional SBOED and suggests extensions to variational inference and spatial sensor placement for broader predictive digital-twin applications.
Abstract
We develop a new computational framework to solve sequential Bayesian optimal experimental design (SBOED) problems constrained by large-scale partial differential equations with infinite-dimensional random parameters. We propose an adaptive terminal formulation of the optimality criteria for SBOED to achieve adaptive global optimality. We also establish an equivalent optimization formulation to achieve computational simplicity enabled by Laplace and low-rank approximations of the posterior. To accelerate the solution of the SBOED problem, we develop a derivative-informed latent attention neural operator (LANO), a new neural network surrogate model that leverages (1) derivative-informed dimension reduction for latent encoding, (2) an attention mechanism to capture the dynamics in the latent space, (3) an efficient training in the latent space augmented by projected Jacobian, which collectively leads to an efficient, accurate, and scalable surrogate in computing not only the parameter-to-observable (PtO) maps but also their Jacobians. We further develop the formulation for the computation of the MAP points, the eigenpairs, and the sampling from posterior by LANO in the reduced spaces and use these computations to solve the SBOED problem. We demonstrate the superior accuracy of LANO compared to two other neural architectures and the high accuracy of LANO compared to the finite element method (FEM) for the computation of MAP points and eigenvalues in solving the SBOED problem with application to the experimental design of the time to take MRI images in monitoring tumor growth. We show that the proposed computational framework achieves an amortized $180\times$ speedup.
