tempoGAN: A Temporally Coherent, Volumetric GAN for Super-resolution Fluid Flow
You Xie, Aleksandra Franz, Mengyu Chu, Nils Thuerey
TL;DR
TempoGAN tackles extracting high-fidelity, temporally coherent 4D flow fields from a single low-resolution snapshot by integrating a conditional GAN with a novel temporal discriminator $D_t$. By incorporating physics-aware inputs such as velocity $\mathbf{v}$ and vorticity $\boldsymbol{\omega}$ and employing physics-aware data augmentation, it learns advection-driven details without requiring temporal sequences in the input, producing consistent $3D$ and $4D$ outputs efficiently via tiling. The method introduces a feature-space loss $\mathcal{L}_f$ and jointly optimizes a spatial discriminator $D_s$ and the temporal discriminator $D_t$ to promote realism and temporal coherence, achieving high-quality results in both 2D and 3D flows and offering artistic control through input conditioning. Overall, tempoGAN provides a general, scalable framework for physics-informed super-resolution of turbulent flows with potential extensions to other physical domains and to non-physical sequence data.
Abstract
We propose a temporally coherent generative model addressing the super-resolution problem for fluid flows. Our work represents a first approach to synthesize four-dimensional physics fields with neural networks. Based on a conditional generative adversarial network that is designed for the inference of three-dimensional volumetric data, our model generates consistent and detailed results by using a novel temporal discriminator, in addition to the commonly used spatial one. Our experiments show that the generator is able to infer more realistic high-resolution details by using additional physical quantities, such as low-resolution velocities or vorticities. Besides improvements in the training process and in the generated outputs, these inputs offer means for artistic control as well. We additionally employ a physics-aware data augmentation step, which is crucial to avoid overfitting and to reduce memory requirements. In this way, our network learns to generate advected quantities with highly detailed, realistic, and temporally coherent features. Our method works instantaneously, using only a single time-step of low-resolution fluid data. We demonstrate the abilities of our method using a variety of complex inputs and applications in two and three dimensions.
