Consistency requirement of data-driven subgrid-scale modeling in large-eddy simulation
Xinyi Huang, Sze Chai Leung, H. Jane Bae
TL;DR
This work investigates why data-driven subgrid-scale closures in large-eddy simulation often perform well in training (a priori) but poorly in actual LES runs (a posteriori). Using a DNS-aided LES framework, the authors quantify the numerical deviation $\delta_i$ arising from discretization and commutation and study two SGS closures: an eddy-viscosity and a complex nonlinear form. Training with $\delta_i$ generally reduces a priori accuracy but improves a posteriori energy spectra fidelity and stability, with the nonlinear closure benefiting most when $\delta_i$ is included; excluding the deviation can lead to severe a posteriori instability. The results emphasize that a physically grounded closure form, appropriate filter width, and explicit accounting for numerical deviation are essential for reliable, data-driven SGS modeling in LES, especially for anisotropic and non-equilibrium flows.
Abstract
Data-driven subgrid-scale (SGS) modeling in the large-eddy simulations (LES) suffers from the inconsistency between the \textit{a priori} tests and the a posteriori tests, which make training accurate SGS models a difficult task. We study the difference in filtered high-fidelity data and LES to identify the numerical deviation between the two cases, which is a combined impact of commutation error, numerical errors, and error coupling. The impact of the numerical deviation is examined through two SGS model formulations: the eddy-viscosity and the complex nonlinear models. By incorporating numerical deviations into model training, we enhance consistency, stabilize simulations, and improve predictions of energy spectra in a posteriori tests. Our findings highlight that data-driven methods introduce significant nonlinearity and equation coupling, exacerbating inconsistencies compared to non-data-driven approaches. Finally, while the impact of the numerical deviation can be generalized, achieving accurate model predictions necessitates a physically grounded model form and an optimal filter width.
