Orthogonal greedy algorithm for linear operator learning with shallow neural network

Ye Lin; Jiwei Jia; Young Ju Lee; Ran Zhang

Orthogonal greedy algorithm for linear operator learning with shallow neural network

Ye Lin, Jiwei Jia, Young Ju Lee, Ran Zhang

TL;DR

This work addresses learning linear operators associated with PDE solution operators by focusing on kernel (Green's function) estimation. It introduces two orthogonal greedy algorithm (OGA)-based frameworks operating on shallow neural-network dictionaries, including a data-driven semi-inner product and a point-wise kernel estimation variant (PW-OGA) to improve convergence. The authors establish convergence guarantees in the semi-norm, derive rate estimates that align with optimal shallow-network bounds, and demonstrate through extensive 1D–3D PDE experiments that the proposed methods achieve orders-of-magnitude improvements over baselines such as GL, FNO, and DON. The results highlight the practicality and interpretability of kernel-based operator learning with shallow networks, while revealing nuances such as warm-up behavior and overfitting in PW-OGA, guiding future developments in efficient PDE operator learning. Overall, the paper provides a theoretically grounded and empirically validated path to accurate, data-efficient operator learning and Green's-function estimation using shallow models.

Abstract

Greedy algorithms, particularly the orthogonal greedy algorithm (OGA), have proven effective in training shallow neural networks for fitting functions and solving partial differential equations (PDEs). In this paper, we extend the application of OGA to the tasks of linear operator learning, which is equivalent to learning the kernel function through integral transforms. Firstly, a novel greedy algorithm is developed for kernel estimation rate in a new semi-inner product, which can be utilized to approximate the Green's function of linear PDEs from data. Secondly, we introduce the OGA for point-wise kernel estimation to further improve the approximation rate, achieving orders of accuracy improvement across various tasks and baseline models. In addition, we provide a theoretical analysis on the kernel estimation problem and the optimal approximation rates for both algorithms, establishing their efficacy and potential for future applications in PDEs and operator learning tasks.

Orthogonal greedy algorithm for linear operator learning with shallow neural network

TL;DR

Abstract

Paper Structure (33 sections, 7 theorems, 75 equations, 18 figures, 9 tables, 4 algorithms)

This paper contains 33 sections, 7 theorems, 75 equations, 18 figures, 9 tables, 4 algorithms.

Introduction
Operator learning and kernel estimation
Operator learning
Linear operator learning and kernel estimation
Nonlinear dictionary approximation with shallow neural network
Nonlinear dictionary approximation
Shallow neural network for function approximation in a Hilbert space
Inner product in the linear operator learning setting
Orthogonal greedy algorithm for function approximation
Orthogonal greedy algorithm for kernel estimation
Approximation rate analysis of OGA for kernel estimation
Shallow neural network in practice
Point-wise kernel estimation by OGA
Numerical Results
Numerical results for one-dimensional problems
...and 18 more sections

Key Result

Lemma 1

Assume that the data set $\{f_j\}^{N}_{j=1}$ contains at least $m_f$ linear independent functions, say $\{f_{j_\ell}\}^{m_f}_{\ell=1}$ with $j_\ell \in \{1,\cdots,N\}$. Then, the space $\mathcal{K}$ makes a finite dimensional compact set and the following bilinear form on $\mathcal{G}$ defines an in

Figures (18)

Figure 1: Schematic of Green Learning
Figure 2: exact kernel $G$ and its approximant $\hat{G}$ with $h=0.1$
Figure 3: (a) Kernel function. (b) Slices of kernel function.
Figure 4: Result of 1D Poisson problem: (a) The training dynamics of OGA method. (b) Exact Green's function. (c) Learned Green's function. (d) Absolute error $E_G$.
Figure 5: Result of 1D Helmholtz problem: (a) The training dynamics of OGA method. (b) Exact Green's function. (c) Learned Green's function. (d) Absolute error $E_G$.
...and 13 more figures

Theorems & Definitions (13)

Lemma 1
proof
Lemma 2
proof
Lemma 3
Lemma 4
Remark 1
Theorem 1
proof
Lemma 5
...and 3 more

Orthogonal greedy algorithm for linear operator learning with shallow neural network

TL;DR

Abstract

Orthogonal greedy algorithm for linear operator learning with shallow neural network

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (18)

Theorems & Definitions (13)