Linearization Turns Neural Operators into Function-Valued Gaussian Processes
Emilia Magnani, Marvin Pförtner, Tobias Weber, Philipp Hennig
TL;DR
The paper introduces LUNO, a practical framework for uncertainty quantification in neural operators by linearizing a Gaussian weight belief to produce a function-valued Gaussian process over the operator outputs. It establishes a theoretical link, via probabilistic currying, between Banach-space–valued function processes and augmented-input Gaussian processes, enabling post-hoc, scalable uncertainty without retraining. The method is demonstrated on Fourier neural operators (FNOs) and specializes to a computable last-layer Laplace variant (LUNO-LA), showing improved calibration and predictive performance in low-data and out-of-distribution scenarios. This work enables principled, resolution-agnostic uncertainty for operator learning with potential impact on safe scientific computing and active learning in PDE contexts.
Abstract
Neural operators generalize neural networks to learn mappings between function spaces from data. They are commonly used to learn solution operators of parametric partial differential equations (PDEs) or propagators of time-dependent PDEs. However, to make them useful in high-stakes simulation scenarios, their inherent predictive error must be quantified reliably. We introduce LUNO, a novel framework for approximate Bayesian uncertainty quantification in trained neural operators. Our approach leverages model linearization to push (Gaussian) weight-space uncertainty forward to the neural operator's predictions. We show that this can be interpreted as a probabilistic version of the concept of currying from functional programming, yielding a function-valued (Gaussian) random process belief. Our framework provides a practical yet theoretically sound way to apply existing Bayesian deep learning methods such as the linearized Laplace approximation to neural operators. Just as the underlying neural operator, our approach is resolution-agnostic by design. The method adds minimal prediction overhead, can be applied post-hoc without retraining the network, and scales to large models and datasets. We evaluate these aspects in a case study on Fourier neural operators.
