Transformer for Partial Differential Equations' Operator Learning
Zijie Li, Kazem Meidani, Amir Barati Farimani
TL;DR
The paper addresses learning solution operators for PDEs with variable discretizations. It introduces OFormer, an attention-based Transformer that uses cross-attention to query outputs at arbitrary locations and a latent-space time-marching scheme to evolve dynamics, enabling discretization-invariant operator learning. The approach achieves competitive results on standard PDE benchmarks and demonstrates robustness to irregular grids, while revealing meaningful latent structures that correlate with system parameters like viscosity. The work advances flexible, data-driven PDE solvers capable of handling diverse sampling patterns without re-training for new discretizations.
Abstract
Data-driven learning of partial differential equations' solution operators has recently emerged as a promising paradigm for approximating the underlying solutions. The solution operators are usually parameterized by deep learning models that are built upon problem-specific inductive biases. An example is a convolutional or a graph neural network that exploits the local grid structure where functions' values are sampled. The attention mechanism, on the other hand, provides a flexible way to implicitly exploit the patterns within inputs, and furthermore, relationship between arbitrary query locations and inputs. In this work, we present an attention-based framework for data-driven operator learning, which we term Operator Transformer (OFormer). Our framework is built upon self-attention, cross-attention, and a set of point-wise multilayer perceptrons (MLPs), and thus it makes few assumptions on the sampling pattern of the input function or query locations. We show that the proposed framework is competitive on standard benchmark problems and can flexibly be adapted to randomly sampled input.
