Neural Interpretable PDEs: Harmonizing Fourier Insights with Attention for Scalable and Interpretable Physics Discovery
Ning Liu, Yue Yu
TL;DR
The paper addresses the challenge of learning forward PDE solutions and inverse PDE parameters in ill-posed settings while preserving interpretability. It introduces Neural Interpretable PDEs (NIPS), which combine linear attention with Fourier-domain spectral convolution to form a data-dependent kernel map that jointly solves forward and inverse PDE tasks across multiple systems, enabling zero-shot generalization. Key contributions include the data-dependent kernel map $K[\mathbf{u}_{1:d},\mathbf{f}_{1:d}]$, a linear-attention reformulation to reduce computation, and Fourier kernel learning that facilitates scalable, interpretable physics discovery with strong empirical gains over NAO and AFNO baselines in Darcy Flow, MMNIST, and synthetic tissue tasks. The approach enables scalable physics discovery with interpretable kernels and zero-shot generalization, and the authors provide open-source code to support reproducibility and broader adoption across physics domains.
Abstract
Attention mechanisms have emerged as transformative tools in core AI domains such as natural language processing and computer vision. Yet, their largely untapped potential for modeling intricate physical systems presents a compelling frontier. Learning such systems often entails discovering operators that map between functional spaces using limited instances of function pairs -- a task commonly framed as a severely ill-posed inverse PDE problem. In this work, we introduce Neural Interpretable PDEs (NIPS), a novel neural operator architecture that builds upon and enhances Nonlocal Attention Operators (NAO) in both predictive accuracy and computational efficiency. NIPS employs a linear attention mechanism to enable scalable learning and integrates a learnable kernel network that acts as a channel-independent convolution in Fourier space. As a consequence, NIPS eliminates the need to explicitly compute and store large pairwise interactions, effectively amortizing the cost of handling spatial interactions into the Fourier transform. Empirical evaluations demonstrate that NIPS consistently surpasses NAO and other baselines across diverse benchmarks, heralding a substantial leap in scalable, interpretable, and efficient physics learning. Our code and data accompanying this paper are available at https://github.com/fishmoon1234/Nonlocal-Attention-Operator.
