CarBench: A Comprehensive Benchmark for Neural Surrogates on High-Fidelity 3D Car Aerodynamics
Mohamed Elrefaie, Dule Shu, Matt Klenk, Faez Ahmed
TL;DR
CarBench delivers a unified, open benchmark for neural surrogates of high-fidelity 3D car aerodynamics using the DrivAerNet++ dataset, evaluating 11 architectures across interpolation and cross-category generalization under a standardized pipeline. It introduces bootstrap-based uncertainty quantification, full-mesh evaluation, and component-level tests (e.g., wheel aerodynamics) to ensure physically grounded assessment. The results show transformer-based solvers (notably AB-UPT and Transolver variants) achieving the best accuracy–efficiency trade-offs, with dataset scale and geometric diversity being critical for robust zero-shot generalization. By providing standardized data processing, evaluation protocols, and open-source tooling, CarBench aims to catalyze reproducible progress in data-driven CFD surrogates for automotive and related external-aero applications.
Abstract
Benchmarking has been the cornerstone of progress in computer vision, natural language processing, and the broader deep learning domain, driving algorithmic innovation through standardized datasets and reproducible evaluation protocols. The growing availability of large-scale Computational Fluid Dynamics (CFD) datasets has opened new opportunities for applying machine learning to aerodynamic and engineering design. Yet, despite this progress, there exists no standardized benchmark for large-scale numerical simulations in engineering design. In this work, we introduce CarBench, the first comprehensive benchmark dedicated to large-scale 3D car aerodynamics, performing a large-scale evaluation of state-of-the-art models on DrivAerNet++, the largest public dataset for automotive aerodynamics, containing over 8,000 high-fidelity car simulations. We assess eleven architectures spanning neural operator methods (e.g., Fourier Neural Operator), geometric deep learning (PointNet, RegDGCNN, PointMAE, PointTransformer), transformer-based neural solvers (Transolver, Transolver++, AB-UPT), and implicit field networks (TripNet). Beyond standard interpolation tasks, we perform cross-category experiments in which transformer-based solvers trained on a single car archetype are evaluated on unseen categories. Our analysis covers predictive accuracy, physical consistency, computational efficiency, and statistical uncertainty. To accelerate progress in data-driven engineering, we open-source the benchmark framework, including training pipelines, uncertainty estimation routines based on bootstrap resampling, and pretrained model weights, establishing the first reproducible foundation for large-scale learning from high-fidelity CFD simulations, available at https://github.com/Mohamedelrefaie/CarBench.
