Finite Element Representation Network (FERN) for Operator Learning with a Localized Trainable Basis
Zecheng Zhang, Hao Liu, Guosheng Fu, Hayden Schaeffer, Guang Lin
TL;DR
The paper tackles operator learning for PDEs by addressing the inefficiency of global learnable bases in neural operators. It introduces the Finite Element Representation Network (FERN), which constructs adaptive, locally supported hat-function bases inside a shallow network using two-parameter ReLU representations, enabling exact basis assembly and adaptive refinement during training. The approach yields competitive or superior accuracy to strong baselines like DeepONet and POD across seven PDE families, while dramatically reducing the number of trainable parameters and demonstrating clear basis adaptivity to localized features such as bumps and shocks. This work bridges finite element concepts with neural operator learning, offering an interpretable and efficient pathway for solving parametrized PDEs with localized phenomena.
Abstract
We propose a finite-element local basis-based operator learning framework for solving partial differential equations (PDEs). Operator learning aims to approximate mappings from input functions to output functions, where the latter are typically represented using basis functions. While non-learnable bases reduce training costs, learnable bases offer greater flexibility but often require deep network architectures with a large number of trainable parameters. Existing approaches typically rely on deep global bases; however, many PDE solutions exhibit local behaviors such as shocks, sharp gradients, etc., and in parametrized PDE settings, these localized features may appear in different regions of the domain across different training and testing samples. Motivated by the use of local bases in finite element methods (FEM) for function approximation, we develop a shallow neural network architecture that constructs adaptive FEM bases. By adopting suitable activation functions, such as ReLU, the FEM bases can be assembled exactly within the network, introducing no additional approximation error in the basis construction process. This design enables the learning procedure to naturally mimic the adaptive refinement mechanism of FEM, allowing the network to discover basis functions tailored to intrinsic solution features such as shocks. The proposed learnable adaptive bases are then employed to represent the solution (output function) of the PDE. This framework reduces the number of trainable parameters while maintaining high approximation accuracy, effectively combining the adaptivity of FEM with the expressive power of operator learning. To evaluate performance, we validate the proposed method on seven families of PDEs with diverse characteristics, demonstrating its accuracy, efficiency, and robustness.
