Parametric Encoding with Attention and Convolution Mitigate Spectral Bias of Neural Partial Differential Equation Solvers
Mehdi Shishehbor, Shirin Hosseinmardi, Ramin Bostanabad
TL;DR
PGCAN addresses spectral bias in neural PDE solvers by introducing a Parametric Grid Convolutional Attention Network that uses a trainable grid-based encoder, local convolution to propagate boundary information, and a transformer-style decoder with attention. It demonstrates lower relative $L_2$ error $L_2^{re}$ than baseline methods across Burgers’, Convection, Helmholtz, and lid-driven cavity problems, with gains growing with problem complexity. The paper also introduces a directional PSD-based metric to quantify spectral bias and reports flatter PSDs for PGCAN errors compared with baselines, indicating improved frequency learning. Limitations include a uniform grid partitioning, with proposed future work on adaptive domain decomposition and extension to irregular/higher-dimensional domains.
Abstract
Deep neural networks (DNNs) are increasingly used to solve partial differential equations (PDEs) that naturally arise while modeling a wide range of systems and physical phenomena. However, the accuracy of such DNNs decreases as the PDE complexity increases and they also suffer from spectral bias as they tend to learn the low-frequency solution characteristics. To address these issues, we introduce Parametric Grid Convolutional Attention Networks (PGCANs) that can solve PDE systems without leveraging any labeled data in the domain. The main idea of PGCAN is to parameterize the input space with a grid-based encoder whose parameters are connected to the output via a DNN decoder that leverages attention to prioritize feature training. Our encoder provides a localized learning ability and uses convolution layers to avoid overfitting and improve information propagation rate from the boundaries to the interior of the domain. We test the performance of PGCAN on a wide range of PDE systems and show that it effectively addresses spectral bias and provides more accurate solutions compared to competing methods.
