A Mathematical Guide to Operator Learning
Nicolas Boullé, Alex Townsend
TL;DR
This survey frames neural operator learning as learning the action of a (potentially nonlinear) operator between function spaces, unifying approaches like DeepONet, Fourier neural operators, Green-based learning, and graph-based operators under a common lens linked to numerical linear algebra. It details how discretization turns operators into structured matrices (low-rank, circulant, banded, hierarchical) and how these structures guide architecture choices and efficiency, including multiscale MGNOs and spectral methods. The authors discuss data-generation strategies via Gaussian-process source terms, solver choices (FEM/FDM/spectral), and practical optimization considerations (losses, optimizers, convergence), while highlighting zero-shot super-resolution and data-efficiency findings. They also outline challenges and directions—software, theory, physical properties, and real-world deployments—emphasizing interpretability and the discovery of unknown physics through operator learning.
Abstract
Operator learning aims to discover properties of an underlying dynamical system or partial differential equation (PDE) from data. Here, we present a step-by-step guide to operator learning. We explain the types of problems and PDEs amenable to operator learning, discuss various neural network architectures, and explain how to employ numerical PDE solvers effectively. We also give advice on how to create and manage training data and conduct optimization. We offer intuition behind the various neural network architectures employed in operator learning by motivating them from the point-of-view of numerical linear algebra.
