Table of Contents
Fetching ...

On the Shape of Latent Variables in a Denoising VAE-MoG: A Posterior Sampling-Based Study

Fernanda Zapata Bascuñán

TL;DR

This paper tackles whether a denoising VAE with a mixture-of-Gaussians prior learns a latent space that faithfully reflects the data-generating process when trained on gravitational-wave signals. It combines a CNN-based DVAE with a $D=256$ latent space and posterior sampling via Hamiltonian Monte Carlo conditioned on clean inputs, contrasted against encoder outputs from noisy data. Using a BGMM prior and KS tests across latent dimensions, the authors find a pronounced mismatch in latent-space structure despite effective denoising, highlighting the need for posterior-based validation in evaluating generative models for scientific data. The work underscores that reconstruction quality alone is insufficient to ensure latent interpretability and reliability, with implications for how such models are validated and deployed in gravitational-wave analysis and other noisy scientific domains.

Abstract

In this work, we explore the latent space of a denoising variational autoencoder with a mixture-of-Gaussians prior (VAE-MoG), trained on gravitational wave data from event GW150914. To evaluate how well the model captures the underlying structure, we use Hamiltonian Monte Carlo (HMC) to draw posterior samples conditioned on clean inputs, and compare them to the encoder's outputs from noisy data. Although the model reconstructs signals accurately, statistical comparisons reveal a clear mismatch in the latent space. This shows that strong denoising performance doesn't necessarily mean the latent representations are reliable highlighting the importance of using posterior-based validation when evaluating generative models.

On the Shape of Latent Variables in a Denoising VAE-MoG: A Posterior Sampling-Based Study

TL;DR

This paper tackles whether a denoising VAE with a mixture-of-Gaussians prior learns a latent space that faithfully reflects the data-generating process when trained on gravitational-wave signals. It combines a CNN-based DVAE with a latent space and posterior sampling via Hamiltonian Monte Carlo conditioned on clean inputs, contrasted against encoder outputs from noisy data. Using a BGMM prior and KS tests across latent dimensions, the authors find a pronounced mismatch in latent-space structure despite effective denoising, highlighting the need for posterior-based validation in evaluating generative models for scientific data. The work underscores that reconstruction quality alone is insufficient to ensure latent interpretability and reliability, with implications for how such models are validated and deployed in gravitational-wave analysis and other noisy scientific domains.

Abstract

In this work, we explore the latent space of a denoising variational autoencoder with a mixture-of-Gaussians prior (VAE-MoG), trained on gravitational wave data from event GW150914. To evaluate how well the model captures the underlying structure, we use Hamiltonian Monte Carlo (HMC) to draw posterior samples conditioned on clean inputs, and compare them to the encoder's outputs from noisy data. Although the model reconstructs signals accurately, statistical comparisons reveal a clear mismatch in the latent space. This shows that strong denoising performance doesn't necessarily mean the latent representations are reliable highlighting the importance of using posterior-based validation when evaluating generative models.

Paper Structure

This paper contains 11 sections, 7 figures.

Figures (7)

  • Figure 1: Template bank
  • Figure 2: Loss function, KL loss and total training loss of the denoising Convolutional VAE training.
  • Figure 3: Bayesian Gaussian Mixture Model fitted to $y_{\text{train}}$. The individual Gaussian components are shown, along with the overall estimated density in orange.
  • Figure 4: Histogram-based probability density function of the clean training targets $y_{\text{train}}$, serving as a visual reference of the empirical distribution.
  • Figure 5: Pearson correlation matrix between latent dimensions z extracted by the encoder. Most correlations are close to zero, suggesting relative independence between dimensions.
  • ...and 2 more figures