Comparison of Generative Learning Methods for Turbulence Surrogates
Claudia Drygala, Edmund Ross, Francesca di Mare, Hanno Gottschalk
TL;DR
The paper tackles the challenge of efficiently representing turbulent flows by benchmarking three generative models—$VAE$, $DCGAN$, and $DDPM$—as surrogates for turbulence. It applies these models to 2D wake flows from a circular cylinder, using high-fidelity LES data and experimental PIV data to assess statistical fidelity and spatial structure preservation. The key finding is that both $DCGAN$ and $DDPM$ can reproduce LES-like flow fields, with $DCGAN$ offering superior data efficiency, faster training and inference, and closer alignment with input streams, while $VAE$ underperforms and $DDPM$ incurs higher computational costs. This work demonstrates the practicality of GAN-based turbulence surrogates for rapid data generation and uncertainty quantification, and it suggests diffusion models as a slower but still viable alternative, potentially extendable to broader experimental datasets and higher Reynolds numbers.
Abstract
Numerical simulations of turbulent flows present significant challenges in fluid dynamics due to their complexity and high computational cost. High resolution techniques such as Direct Numerical Simulation (DNS) and Large Eddy Simulation (LES) are generally not computationally affordable, particularly for technologically relevant problems. Recent advances in machine learning, specifically in generative probabilistic models, offer promising alternatives as surrogates for turbulence. This paper investigates the application of three generative models - Variational Autoencoders (VAE), Deep Convolutional Generative Adversarial Networks (DCGAN), and Denoising Diffusion Probabilistic Models (DDPM) - in simulating a von Kármán vortex street around a fixed cylinder projected into 2D, as well as a real-world experimental dataset of the wake flow of a cylinder array. Training data was obtained by means of LES in the simulated case and Particle Image Velocimetry (PIV) in the experimental case. We evaluate each model's ability to capture the statistical properties and spatial structures of the turbulent flow. Our results demonstrate that DDPM and DCGAN effectively replicate all flow distributions, highlighting their potential as efficient and accurate tools for turbulence surrogacy. We find a strong argument for DCGAN, as although they are more difficult to train (due to problems such as mode collapse), they show the fastest inference and training time, require less data to train compared to VAE and DDPM, and provide the results most closely aligned with the input stream. In contrast, VAE train quickly (and can generate samples quickly) but do not produce adequate results, and DDPM, whilst effective, are significantly slower at both, inference and training time.
