Masked Autoencoders are PDE Learners
Anthony Zhou, Amir Barati Farimani
TL;DR
This work applies masked autoencoder pretraining to a diverse set of 1D and 2D PDEs to learn latent physics representations without labeled data. The MAE encoder captures meaningful structure across coefficients, boundary conditions, and discretizations, enabling downstream PDE feature prediction and conditioning of neural solvers for time-stepping and super-resolution. Results show latent structure aligns with PDE properties and improves downstream performance within the pretraining distribution, though extrapolation to unseen equations remains challenging. The approach offers a scalable path toward unified latent physics representations from heterogeneous, unlabeled PDE data and suggests directions for scaling and latent arithmetic in physics-informed learning.
Abstract
Neural solvers for partial differential equations (PDEs) have great potential to generate fast and accurate physics solutions, yet their practicality is currently limited by their generalizability. PDEs evolve over broad scales and exhibit diverse behaviors; predicting these phenomena will require learning representations across a wide variety of inputs which may encompass different coefficients, boundary conditions, resolutions, or even equations. As a step towards generalizable PDE modeling, we adapt masked pretraining for physics problems. Through self-supervised learning across PDEs, masked autoencoders can consolidate heterogeneous physics to learn rich latent representations. We show that learned representations can generalize to a limited set of unseen equations or parameters and are meaningful enough to regress PDE coefficients or the classify PDE features. Furthermore, conditioning neural solvers on learned latent representations can improve time-stepping and super-resolution performance across a variety of coefficients, discretizations, or boundary conditions, as well as on certain unseen PDEs. We hope that masked pretraining can emerge as a unifying method across large, unlabeled, and heterogeneous datasets to learn latent physics at scale.
