Operator Learning Using Random Features: A Tool for Scientific Computing
Nicholas H. Nelsen, Andrew M. Stuart
TL;DR
The paper addresses learning operators between infinite-dimensional function spaces to accelerate many-query PDE tasks. It introduces function-valued random features (RFM), a convex, data-driven surrogate that reduces training to a finite-dimensional quadratic problem and corresponds to kernel ridge regression in a low-rank RKHS induced by random features. The authors provide convergence guarantees and error bounds, and demonstrate mesh-invariant, transferable performance on Burgers' equation and Darcy flow operator learning. The approach offers a nonintrusive, scalable alternative to deep neural operators, with practical impact for scientific computing and uncertainty quantification. Overall, RFMs enable reliable, discretization-agnostic operator learning with theoretical support and concrete PDE applications.
Abstract
Supervised operator learning centers on the use of training data, in the form of input-output pairs, to estimate maps between infinite-dimensional spaces. It is emerging as a powerful tool to complement traditional scientific computing, which may often be framed in terms of operators mapping between spaces of functions. Building on the classical random features methodology for scalar regression, this paper introduces the function-valued random features method. This leads to a supervised operator learning architecture that is practical for nonlinear problems yet is structured enough to facilitate efficient training through the optimization of a convex, quadratic cost. Due to the quadratic structure, the trained model is equipped with convergence guarantees and error and complexity bounds, properties that are not readily available for most other operator learning architectures. At its core, the proposed approach builds a linear combination of random operators. This turns out to be a low-rank approximation of an operator-valued kernel ridge regression algorithm, and hence the method also has strong connections to Gaussian process regression. The paper designs function-valued random features that are tailored to the structure of two nonlinear operator learning benchmark problems arising from parametric partial differential equations. Numerical results demonstrate the scalability, discretization invariance, and transferability of the function-valued random features method.
