Fine-tuning a Large Language Model for Automating Computational Fluid Dynamics Simulations
Zhehao Dong, Zhen Lu, Yue Yang
TL;DR
This work tackles the barrier to CFD adoption by domain-specifically adapting a compact LLM to automate OpenFOAM setup from natural language. By fine-tuning Qwen2.5-7B-Instruct with LoRA on NL2FOAM, and deploying a four-agent workflow, the approach translates problem descriptions into executable CFD configurations with high reliability. On a 21-case incompressible-flow benchmark, it achieves 88.7% accuracy and 82.6% pass@1 while requiring fewer correction iterations than larger general-purpose models, illustrating the value of domain-specific adaptation for engineering automation. The results suggest a practical path toward democratizing CFD tooling, while pointing to extensions toward more complex flow regimes and larger-scale models in future work.
Abstract
Configuring computational fluid dynamics (CFD) simulations typically demands extensive domain expertise, limiting broader access. Although large language models (LLMs) have advanced scientific computing, their use in automating CFD workflows is underdeveloped. We introduce a novel approach centered on domain-specific LLM adaptation. By fine-tuning Qwen2.5-7B-Instruct on NL2FOAM, our custom dataset of 28716 natural language-to-OpenFOAM configuration pairs with chain-of-thought (CoT) annotations, we enable direct translation from natural language descriptions to executable CFD setups. A multi-agent framework orchestrates the process, autonomously verifying inputs, generating configurations, running simulations, and correcting errors. Evaluation on a benchmark of 21 diverse flow cases demonstrates state-of-the-art performance, achieving 88.7% solution accuracy and 82.6% first-attempt success rate. This significantly outperforms larger general-purpose models like Qwen2.5-72B-Instruct, DeepSeek-R1, and Llama3.3-70B-Instruct, while also requiring fewer correction iterations and maintaining high computational efficiency. The results highlight the critical role of domain-specific adaptation in deploying LLM assistants for complex engineering workflows. Our code and fine-tuned model have been deposited at https://github.com/YYgroup/AutoCFD.
