FLUID-LLM: Learning Computational Fluid Dynamics with Spatiotemporal-aware Large Language Models
Max Zhu, Adrián Bazaga, Pietro Liò
TL;DR
FLUID-LLM tackles the computational burden of CFD by marrying pre-trained large language models with spatiotemporal encodings to forecast unsteady fluid flows. The framework tokenizes and embeds patch-based fluid states, augments them with learned spatiotemporal context, and uses an LLM to produce next-state embeddings that a grid-based GNN decoder converts into state updates via $\hat{s}_{t+1} = s_t + f_{\text{dec}}(e^{\text{out}}_t)$. Across Cylinder and Airfoil benchmarks, FLUID-LLM with larger LLMs achieves superior long-horizon accuracy, and experiments demonstrate in-context learning and few-shot adaptation to novel dynamics. This indicates a practical path toward efficient CFD with language-model priors and flexible spatiotemporal reasoning.
Abstract
Learning computational fluid dynamics (CFD) traditionally relies on computationally intensive simulations of the Navier-Stokes equations. Recently, large language models (LLMs) have shown remarkable pattern recognition and reasoning abilities in natural language processing (NLP) and computer vision (CV). However, these models struggle with the complex geometries inherent in fluid dynamics. We introduce FLUID-LLM, a novel framework combining pre-trained LLMs with spatiotemporal-aware encoding to predict unsteady fluid dynamics. Our approach leverages the temporal autoregressive abilities of LLMs alongside spatial-aware layers, bridging the gap between previous CFD prediction methods. Evaluations on standard benchmarks reveal significant performance improvements across various fluid datasets. Our results demonstrate that FLUID-LLM effectively integrates spatiotemporal information into pre-trained LLMs, enhancing CFD task performance.
