A Deep Learning Framework for Multi-Operator Learning: Architectures and Approximation Theory
Adrien Weihs, Jingmin Sun, Zecheng Zhang, Hayden Schaeffer
TL;DR
The work addresses the challenge of learning operators between function spaces by introducing two architectures, MONet and MNO, to handle families of operators (multi-operator learning) and by providing a rigorous universal approximation framework for continuous, measurable, and Lipschitz operator classes. It integrates scaling laws that quantify how network capacity must grow to achieve a target accuracy, specifically deriving explicit rates for Lipschitz operators and outlining how approximation order and architectural balance affect efficiency. The theory is complemented by extensive numerical experiments on parametric PDE benchmarks, where MONet and MNO demonstrate strong expressive power and computational efficiency relative to existing neural-operator baselines. Overall, the paper establishes a unified, scalable foundation for neural operator learning across operator collections, bridging expressive guarantees with practical design guidance for complex scientific applications.
Abstract
While many problems in machine learning focus on learning mappings between finite-dimensional spaces, scientific applications require approximating mappings between function spaces, i.e., operators. We study the problem of learning collections of operators and provide both theoretical and empirical advances. We distinguish between two regimes: (i) multiple operator learning, where a single network represents a continuum of operators parameterized by a parametric function, and (ii) learning several distinct single operators, where each operator is learned independently. For the multiple operator case, we introduce two new architectures, $\mathrm{MNO}$ and $\mathrm{MONet}$, and establish universal approximation results in three settings: continuous, integrable, or Lipschitz operators. For the latter, we further derive explicit scaling laws that quantify how the network size must grow to achieve a target approximation accuracy. For learning several single operators, we develop a framework for balancing architectural complexity across subnetworks and show how approximation order determines computational efficiency. Empirical experiments on parametric PDE benchmarks confirm the strong expressive power and efficiency of the proposed architectures. Overall, this work establishes a unified theoretical and practical foundation for scalable neural operator learning across multiple operators.
