Towards Universal Neural Operators through Multiphysics Pretraining
Mikhail Masliaev, Dmitry Gusarov, Ilya Markov, Alexander Hvatov
TL;DR
This work addresses the high cost of training neural operators for PDE-based simulations by proposing a multiphysics pretraining framework. It combines transformer- and state-space–based neural operators with adapter-based transfer learning to enable cross-domain generalization across varying inputs and physics. The main contributions include a modular pretraining/fine-tuning scheme, demonstrations on multiple PDEs (e.g., Burgers', Gray-Scott, Navier–Stokes), and evidence that pretraining on heterogeneous multiphysics data improves generalization while reducing fine-tuning costs. This approach pushes toward a foundational PDE dynamics model with practical benefits for rapid deployment in new multiphysics scenarios and lays groundwork for future data augmentation and symmetry-based enhancements.
Abstract
Although neural operators are widely used in data-driven physical simulations, their training remains computationally expensive. Recent advances address this issue via downstream learning, where a model pretrained on simpler problems is fine-tuned on more complex ones. In this research, we investigate transformer-based neural operators, which have previously been applied only to specific problems, in a more general transfer learning setting. We evaluate their performance across diverse PDE problems, including extrapolation to unseen parameters, incorporation of new variables, and transfer from multi-equation datasets. Our results demonstrate that advanced neural operator architectures can effectively transfer knowledge across PDE problems.
