Physically Interpretable Representation Learning with Gaussian Mixture Variational AutoEncoder (GM-VAE)
Tiffany Fan, Murray Cutforth, Marta D'Elia, Alexandre Cortiella, Alireza Doostan, Eric Darve
TL;DR
The paper tackles the challenge of extracting physically meaningful, low-dimensional representations from high-dimensional turbulent and combustion data. It introduces a Gaussian Mixture Variational Autoencoder trained with an EM-inspired block-coordinate descent, paired with a graph-Laplacian based interpretability metric to ensure latent coordinates vary smoothly with key physical quantities. The approach yields latent spaces where regimes align with distinct physical states, demonstrated across surface-reaction bifurcations, wake flows, and Schlieren ignition images, and outperforms standard nonlinear DR methods in physical interpretability. This framework enables both regime discovery and generative sampling within physically coherent states, offering a robust tool for data-driven discovery in turbulent and reactive systems. Limitations include computational complexity, sensitivity to initialization, and the Gaussian-regime assumption, with future work targeting multi-modal integration and product-of-experts fusion to broaden applicability.
Abstract
Extracting compact, physically interpretable representations from high-dimensional scientific data is a persistent challenge due to the complex, nonlinear structures inherent in physical systems. We propose a Gaussian Mixture Variational Autoencoder (GM-VAE) framework designed to address this by integrating an Expectation-Maximization (EM)-inspired training scheme with a novel spectral interpretability metric. Unlike conventional VAEs that jointly optimize reconstruction and clustering (often leading to training instability), our method utilizes a block-coordinate descent strategy, alternating between expectation and maximization steps. This approach stabilizes training and naturally aligns latent clusters with distinct physical regimes. To objectively evaluate the learned representations, we introduce a quantitative metric based on graph-Laplacian smoothness, which measures the coherence of physical quantities across the latent manifold. We demonstrate the efficacy of this framework on datasets of increasing complexity: surface reaction ODEs, Navier-Stokes wake flows, and experimental laser-induced combustion Schlieren images. The results show that our GM-VAE yields smooth, physically consistent manifolds and accurate regime clustering, offering a robust data-driven tool for interpreting turbulent and reactive flow systems.
