CFDagent: A Language-Guided, Zero-Shot Multi-Agent System for Complex Flow Simulation
Zhaoyue Xu, Long Wang, Chunyu Wang, Yixin Chen, Qingyong Luo, Hua-Dong Yao, Shizhao Wang, Guowei He
TL;DR
CFDagent introduces a zero-shot, language-guided, end-to-end CFD framework that couples three GPT-4o–driven agents to generate geometry, configure and run an immersed boundary flow solver, and postprocess results from natural-language prompts. The system leverages Point-E for text/image–to–geometry generation, a marching cubes–based mesh pipeline, and an IB solver on parallel grids to simulate flows around arbitrary geometries, validated against canonical sphere benchmarks at $Re=100$ and $Re=300$ with $C_d$ and $C_l$ in agreement with literature. It demonstrates robust generalization to real-world objects described textually or via images, producing multimodal visualizations, vortex structures via the $Q$-criterion, and even GPT-4o–generated synthesis of visualizations. The work highlights the potential to lower barriers to expert CFD by integrating generative AI with high-fidelity simulations, enabling education, research, and engineering workflows, while acknowledging limitations in geometry quality and computational efficiency and outlining future extensions to moving/deforming objects and smarter visualizations.
Abstract
We introduce CFDagent, a zero-shot, multi-agent system that enables fully autonomous computational fluid dynamics (CFD) simulations from natural language prompts. CFDagent integrates three specialized LLM-driven agents: (i) the Preprocessing Agent that generates 3D geometries from textual or visual inputs using a hybrid text-to-3D diffusion model (Point-E) and automatically meshes the geometries; (ii) the Solver Agent that configures and executes an immersed boundary flow solver; and (iii) the Postprocessing Agent that analyzes and visualizes the results, including multimodal renderings. These agents are interactively guided by GPT-4o via conversational prompts, enabling intuitive and user-friendly interaction. We validate CFDagent by reproducing canonical sphere flows at Reynolds numbers of 100 and 300 using three distinct inputs: a simple text prompt (i.e., "sphere"), an image-based input, and a standard sphere model. The computed drag and lift coefficients from meshes produced by each input approach closely match available data. The proposed system enables synthesization of flow simulations and photorealistic visualizations for complex geometries. Through extensive tests on canonical and realistic scenarios, we demonstrate the robustness, versatility, and practical applicability of CFDagent. By bridging generative AI with high-fidelity simulations, CFDagent significantly lowers barriers to expert-level CFD, unlocking broad opportunities in education, scientific research, and practical engineering applications.
