Orthogonal greedy algorithm for linear operator learning with shallow neural network
Ye Lin, Jiwei Jia, Young Ju Lee, Ran Zhang
TL;DR
This work addresses learning linear operators associated with PDE solution operators by focusing on kernel (Green's function) estimation. It introduces two orthogonal greedy algorithm (OGA)-based frameworks operating on shallow neural-network dictionaries, including a data-driven semi-inner product and a point-wise kernel estimation variant (PW-OGA) to improve convergence. The authors establish convergence guarantees in the semi-norm, derive rate estimates that align with optimal shallow-network bounds, and demonstrate through extensive 1D–3D PDE experiments that the proposed methods achieve orders-of-magnitude improvements over baselines such as GL, FNO, and DON. The results highlight the practicality and interpretability of kernel-based operator learning with shallow networks, while revealing nuances such as warm-up behavior and overfitting in PW-OGA, guiding future developments in efficient PDE operator learning. Overall, the paper provides a theoretically grounded and empirically validated path to accurate, data-efficient operator learning and Green's-function estimation using shallow models.
Abstract
Greedy algorithms, particularly the orthogonal greedy algorithm (OGA), have proven effective in training shallow neural networks for fitting functions and solving partial differential equations (PDEs). In this paper, we extend the application of OGA to the tasks of linear operator learning, which is equivalent to learning the kernel function through integral transforms. Firstly, a novel greedy algorithm is developed for kernel estimation rate in a new semi-inner product, which can be utilized to approximate the Green's function of linear PDEs from data. Secondly, we introduce the OGA for point-wise kernel estimation to further improve the approximation rate, achieving orders of accuracy improvement across various tasks and baseline models. In addition, we provide a theoretical analysis on the kernel estimation problem and the optimal approximation rates for both algorithms, establishing their efficacy and potential for future applications in PDEs and operator learning tasks.
