Bayesian D-Optimal Experimental Designs via Column Subset Selection
Srinivas Eswar, Vishwas Rao, Arvind K. Saibaba
TL;DR
The paper reframes Bayesian D-optimal experimental design for linear inverse problems as a Column Subset Selection Problem and develops efficient, provably reliable algorithms based on the Golub–Klema–Stewart framework. It introduces deterministic CSSP methods and multiple randomized, adjoint-free variants (notably RAF-OED) along with a data completion technique (bdeim), providing rigorous bounds and cost analyses dominated by truncated SVDs. The approach yields scalable sensor placement with strong performance guarantees, combining matrix-free computations and parallelizability, and it is validated on 2D heat-equation and seismic travel-time tomography examples. Overall, the work offers a practical, tunable toolkit for OED in large-scale Bayesian inverse problems, with potential extensions to other criteria and non-linear settings.
Abstract
This paper tackles optimal sensor placement for Bayesian linear inverse problems, a popular version of the more general Optimal Experimental Design (OED) problem, using the D-optimality criterion. This is done by establishing connections between sensor placement and Column Subset Selection Problem (CSSP), which is a well-studied problem in Numerical Linear Algebra (NLA). In particular, we use the Golub-Klema-Stewart (GKS) approach which involves computing the truncated Singular Value Decomposition (SVD) followed by a pivoted QR factorization on the right singular vectors. The algorithms are further accelerated by using randomization to compute the low-rank approximation as well as for sampling the indices. The resulting algorithms are robust, computationally efficient, amenable to parallelization, require virtually no parameter tuning, and come with strong theoretical guarantees. One of the proposed algorithms is also adjoint-free which is beneficial in situations, where the adjoint is expensive to evaluate or is not available. Additionally, we develop a method for data completion without solving the inverse problem. Numerical experiments on model inverse problems involving the heat equation and seismic tomography in two spatial dimensions demonstrate the performance of our approaches.
