Synthetic Lagrangian Turbulence by Generative Diffusion Models
Tianyi Li, Luca Biferale, Fabio Bonaccorso, Martino Andrea Scarpolini, Michele Buzzicotti
TL;DR
This work tackles the challenge of generating faithful 3D Lagrangian trajectories at high Reynolds numbers without relying on exhaustive DNS or experiments. It introduces diffusion-model generators DM-1c (single velocity component) and DM-3c (three correlated components), trained on DNS-based HIT data at $R_\lambda\simeq310$, achieving accurate reproduction of multiscale statistics, including fat-tailed velocity increments and accelerations, structure functions up to order $p=8$, and scale-by-scale exponents ${\zeta(p,\tau)}$ across inertial and dissipative ranges. The models also capture enhanced intermittency near the dissipative time ${\tau_\eta}$ and demonstrate strong generalization to extreme events beyond training data, enabling high-quality synthetic datasets for downstream applications. This diffusion-based framework offers a scalable route to abundant, physically consistent Lagrangian data and provides interpretability opportunities through the progressive multiscale denoising process and potential conditioning on flow configurations.
Abstract
Lagrangian turbulence lies at the core of numerous applied and fundamental problems related to the physics of dispersion and mixing in engineering, bio-fluids, atmosphere, oceans, and astrophysics. Despite exceptional theoretical, numerical, and experimental efforts conducted over the past thirty years, no existing models are capable of faithfully reproducing statistical and topological properties exhibited by particle trajectories in turbulence. We propose a machine learning approach, based on a state-of-the-art diffusion model, to generate single-particle trajectories in three-dimensional turbulence at high Reynolds numbers, thereby bypassing the need for direct numerical simulations or experiments to obtain reliable Lagrangian data. Our model demonstrates the ability to reproduce most statistical benchmarks across time scales, including the fat-tail distribution for velocity increments, the anomalous power law, and the increased intermittency around the dissipative scale. Slight deviations are observed below the dissipative scale, particularly in the acceleration and flatness statistics. Surprisingly, the model exhibits strong generalizability for extreme events, producing events of higher intensity and rarity that still match the realistic statistics. This paves the way for producing synthetic high-quality datasets for pre-training various downstream applications of Lagrangian turbulence.
