Neural Spectral Methods: Self-supervised learning in the spectral domain
Yiheng Du, Nithin Chalapathi, Aditi Krishnapriyan
TL;DR
Neural Spectral Methods (NSM) address the challenge of solving parametric PDEs by learning solution mappings directly in a spectral basis and optimizing a spectral residual loss. By formulating the PDE residual in the spectral domain and applying Parseval's identity, NSM achieves exact residual computation and a fixed computational cost that does not scale with grid resolution, enabling constant-time inference. The framework introduces a spectral neural operator with activations at collocation points and a diagonalized kernel in the spectral domain, which together reduce aliasing and backpropagation complexity relative to grid-based PINN approaches. Empirically, NSM delivers 100x speedups in training and 500x in inference while surpassing grid-based PINN-based methods in accuracy across Poisson, Reaction-Diffusion, and Navier–Stokes problems, demonstrating substantial practical gains for parametric PDE solving in smooth regimes and moderate dimensions.
Abstract
We present Neural Spectral Methods, a technique to solve parametric Partial Differential Equations (PDEs), grounded in classical spectral methods. Our method uses orthogonal bases to learn PDE solutions as mappings between spectral coefficients. In contrast to current machine learning approaches which enforce PDE constraints by minimizing the numerical quadrature of the residuals in the spatiotemporal domain, we leverage Parseval's identity and introduce a new training strategy through a \textit{spectral loss}. Our spectral loss enables more efficient differentiation through the neural network, and substantially reduces training complexity. At inference time, the computational cost of our method remains constant, regardless of the spatiotemporal resolution of the domain. Our experimental results demonstrate that our method significantly outperforms previous machine learning approaches in terms of speed and accuracy by one to two orders of magnitude on multiple different problems. When compared to numerical solvers of the same accuracy, our method demonstrates a $10\times$ increase in performance speed.
