Kernel Neural Operators (KNOs) for Scalable, Memory-efficient, Geometrically-flexible Operator Learning
Matthew Lowery, John Turnage, Zachary Morrow, John D. Jakeman, Akil Narayan, Shandian Zhe, Varun Shankar
TL;DR
Kernel Neural Operators (KNOs) introduce a provably convergent operator-learning framework that stacks deep kernel-based integral operators with trainable, closed-form kernels. By decoupling the kernel from the numerical quadrature and allowing domain-specific quadrature rules, KNOs achieve geometric flexibility on irregular domains and use dimension-wise factorization to mitigate the curse of dimensionality on regular grids. The architecture combines a lifting operator, a sequence of integral operators with diagonal matrix-valued kernels (notably NS-GSM), nonlinear activations, and a final projection, enabling learnable operators from functions to functions with far fewer parameters than FFT-based or transformer-based rivals. Universal approximation guarantees are established for both infinite-dimensional and discretized formulations, and empirical results on seven benchmarks show competitive accuracy with substantially reduced parameter counts and memory footprints, including strong performance on irregular geometries and zero-shot super-resolution capabilities. These properties make KNOs attractive as memory-efficient surrogates for PDE operators and other function-to-function mappings in resource-constrained settings.
Abstract
This paper introduces the Kernel Neural Operator (KNO), a provably convergent operator-learning architecture that utilizes compositions of deep kernel-based integral operators for function-space approximation of operators (maps from functions to functions). The KNO decouples the choice of kernel from the numerical integration scheme (quadrature), thereby naturally allowing for operator learning with explicitly-chosen trainable kernels on irregular geometries. On irregular domains, this allows the KNO to utilize domain-specific quadrature rules. To help ameliorate the curse of dimensionality, we also leverage an efficient dimension-wise factorization algorithm on regular domains. More importantly, the ability to explicitly specify kernels also allows the use of highly expressive, non-stationary, neural anisotropic kernels whose parameters are computed by training neural networks. Numerical results demonstrate that on existing benchmarks the training and test accuracy of KNOs is comparable to or higher than popular operator learning techniques while typically using an order of magnitude fewer trainable parameters, with the more expressive kernels proving important to attaining high accuracy. KNOs thus facilitate low-memory, geometrically-flexible, deep operator learning, while retaining the implementation simplicity and transparency of traditional kernel methods from both scientific computing and machine learning.
