Text2PDE: Latent Diffusion Models for Accessible Physics Simulation
Anthony Zhou, Zijie Li, Michael Schneier, John R Buchanan, Amir Barati Farimani
TL;DR
Text2PDE introduces a latent diffusion framework for physics simulation that compresses irregular PDE data with a mesh aware autoencoder and generates full spatio temporal rollouts conditioned on initial frames or language prompts. By performing diffusion in a latent space, the approach mitigates autoregressive error accumulation and supports discretization invariant decoding onto arbitrary meshes. Conditioning modalities, including text prompts via pretrained transformers, offer interpretable and compact interfaces for engineering design while maintaining physical fidelity. Across cylinder flow, buoyancy driven flow, and 3D turbulence, the method achieves competitive accuracy and scalable performance up to near 3 billion parameters, highlighting the practicality of diffusion surrogates for neural PDE solvers and their potential usability improvements.
Abstract
Recent advances in deep learning have inspired numerous works on data-driven solutions to partial differential equation (PDE) problems. These neural PDE solvers can often be much faster than their numerical counterparts; however, each presents its unique limitations and generally balances training cost, numerical accuracy, and ease of applicability to different problem setups. To address these limitations, we introduce several methods to apply latent diffusion models to physics simulation. Firstly, we introduce a mesh autoencoder to compress arbitrarily discretized PDE data, allowing for efficient diffusion training across various physics. Furthermore, we investigate full spatio-temporal solution generation to mitigate autoregressive error accumulation. Lastly, we investigate conditioning on initial physical quantities, as well as conditioning solely on a text prompt to introduce text2PDE generation. We show that language can be a compact, interpretable, and accurate modality for generating physics simulations, paving the way for more usable and accessible PDE solvers. Through experiments on both uniform and structured grids, we show that the proposed approach is competitive with current neural PDE solvers in both accuracy and efficiency, with promising scaling behavior up to $\sim$3 billion parameters. By introducing a scalable, accurate, and usable physics simulator, we hope to bring neural PDE solvers closer to practical use.
