Leveraging Scale Separation and Stochastic Closure for Data-Driven Prediction of Chaotic Dynamics
Ismaël Zighed, Nicolas Thome, Patrick Gallinari, Taraneh Sayadi
TL;DR
This work tackles turbulence forecasting by decoupling scale dynamics into large-scale, stochastic evolution and small-scale closure. It introduces a two-task framework: (i) a probabilistic reduced-order model that learns large-scale dynamics in a latent space via a Variational Autoencoder and Transformer, producing ensembles that capture predictive uncertainty, and (ii) a Gaussian-process closure that maps reduced-space representations to full-resolution fields within a POD basis, yielding statistically consistent reconstructions. Applied to Kolmogorov flow, the approach achieves accurate first- and second-moment statistics, competitive uncertainty calibration (PICP/CRPS), and stable long-horizon PDFs, outperforming VAE and diffusion baselines while remaining computationally efficient. The modular, plug-and-play architecture enables real-time closure and broad applicability to multiscale turbulence problems, offering a principled data-driven path to robust, probabilistic turbulence prediction and control.
Abstract
Simulating turbulent fluid flows is a computationally prohibitive task, as it requires the resolution of fine-scale structures and the capture of complex nonlinear interactions across multiple scales. This is particularly the case in direct numerical simulation (DNS) applied to real-world turbulent applications. Consequently, extensive research has focused on analysing turbulent flows from a data-driven perspective. However, due to the complex and chaotic nature of these systems, traditional models often become unstable as they accumulate errors through autoregression, severely degrading even short-term predictions. To overcome these limitations, we propose a purely stochastic approach that separately addresses the evolution of large-scale coherent structures and the closure of high-fidelity statistical data. To this end, the dynamics of the filtered data (i.e. coherent motion) are learnt using an autoregressive model. This combines a VAE and Transformer architecture. The VAE projection is probabilistic, ensuring consistency between the model's stochasticity and the flow's statistical properties. To recover high-fidelity velocity fields from the filtered latent space, Gaussian Process (GP) regression is employed. This strategy has been tested in the context of a Kolmogorov flow exhibiting chaotic behaviour analogous to real-world turbulence. We compare the performance of our model with state-of-the-art probabilistic baselines, including a VAE and a diffusion model. We demonstrate that our Gaussian process-based closure outperforms these baselines in capturing first and second moment statistics in this particular test bed, providing robust and adaptive confidence intervals.
