Geometry Matters: Benchmarking Scientific ML Approaches for Flow Prediction around Complex Geometries
Ali Rabeh, Ethan Herron, Aditya Balu, Soumik Sarkar, Chinmay Hegde, Adarsh Krishnamurthy, Baskar Ganapathysubramanian
TL;DR
This work systematically benchmarks SciML approaches for predicting fluid flow around complex geometries using FlowBench's 2D lid-driven cavity data. By comparing neural operators and vision-transformer foundation models across Signed Distance Field (SDF) and binary mask geometry representations, it develops a unified scoring framework over global accuracy, boundary fidelity, and PDE consistency. The study shows that newer foundation models outperform neural operators in data-limited settings, while SDF representations offer advantages with sufficient data; however, out-of-distribution generalization remains a major challenge. The results highlight that model architecture and geometric representation jointly govern performance, with scOT and Poseidon typically delivering the strongest results across tasks, especially when data are scarce, and DeepONet excelling in PDE-consistency metrics. The work also delineates data-efficiency trends, boundary-layer accuracy, and residual behavior, offering actionable insights for deploying SciML solvers around complex geometries and motivating future physics-informed and multiphysics extensions.
Abstract
Rapid and accurate simulations of fluid dynamics around complicated geometric bodies are critical in a variety of engineering and scientific applications, including aerodynamics and biomedical flows. However, while scientific machine learning (SciML) has shown considerable promise, most studies in this field are limited to simple geometries, and complex, real-world scenarios are underexplored. This paper addresses this gap by benchmarking diverse SciML models, including neural operators and vision transformer-based foundation models, for fluid flow prediction over intricate geometries. Using a high-fidelity dataset of steady-state flows across various geometries, we evaluate the impact of geometric representations -- Signed Distance Fields (SDF) and binary masks -- on model accuracy, scalability, and generalization. Central to this effort is the introduction of a novel, unified scoring framework that integrates metrics for global accuracy, boundary layer fidelity, and physical consistency to enable a robust, comparative evaluation of model performance. Our findings demonstrate that newer foundation models significantly outperform neural operators, particularly in data-limited scenarios, and that SDF representations yield superior results with sufficient training data. Despite these promises, all models struggle with out-of-distribution generalization, highlighting a critical challenge for future SciML applications. By advancing both evaluation models and modeling capabilities, our work paves the way for robust and scalable ML solutions for fluid dynamics across complex geometries.
