Thompson Sampling in Function Spaces via Neural Operators
Rafael Oliveira, Xuesong Wang, Kian Ming A. Chai, Edwin V. Bonilla
TL;DR
This work extends Thompson sampling to optimization over function spaces by treating neural operators as surrogates for unknown solution operators ${G}_*:{\mathcal A}\to{\mathcal U}$ and optimizing known functionals $f:{\mathcal U}\to\mathbb{R}$ of their outputs. It provides a sample-then-optimize framework (NOTS) that avoids explicit posterior uncertainty quantification by leveraging the infinite-width GP correspondence of neural operators via the conjugate kernel, yielding sublinear Bayesian regret in the finite-domain setting. The authors establish a theoretical bridge between neural operators and Gaussian processes for operator-valued kernels, derive regret guarantees for NOTS, and validate the approach on PDE benchmarks (Darcy flow and shallow-water) where functionals of the operator output are optimized. The results show significant sample-efficiency improvements over GP-based and neural TS baselines, highlighting NOTS's scalability to high-dimensional, function-valued inputs and outputs with practical PDE applications.
Abstract
We propose an extension of Thompson sampling to optimization problems over function spaces where the objective is a known functional of an unknown operator's output. We assume that queries to the operator (such as running a high-fidelity simulator or physical experiment) are costly, while functional evaluations on the operator's output are inexpensive. Our algorithm employs a sample-then-optimize approach using neural operator surrogates. This strategy avoids explicit uncertainty quantification by treating trained neural operators as approximate samples from a Gaussian process (GP) posterior. We derive regret bounds and theoretical results connecting neural operators with GPs in infinite-dimensional settings. Experiments benchmark our method against other Bayesian optimization baselines on functional optimization tasks involving partial differential equations of physical systems, demonstrating better sample efficiency and significant performance gains.
