ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders
Fotios Lygerakis, Elmar Rueckert
TL;DR
This work targets limitations of the standard ELBO in VAEs when priors are nonanalytic or unknown. It introduces ED-VAE, a reformulation that explicitly decomposes the ELBO into reconstruction, mutual information, and a marginal KL term split into entropy and cross-entropy $H[q(z)]$ and $H[q(z), p(z)]$, enabling flexible priors. Mutual information is bounded via InfoNCE to encourage informative latent codes, and the total loss combines reconstruction, InfoNCE, entropy, and cross-entropy terms. Experiments on synthetic datasets with Gaussian and complex non-Gaussian priors demonstrate that ED-VAE achieves higher ELBO and better prior alignment, improving reconstruction quality and latent space interpretability, highlighting its potential for adaptable generative modeling with domain-specific priors.
Abstract
Traditional Variational Autoencoders (VAEs) are constrained by the limitations of the Evidence Lower Bound (ELBO) formulation, particularly when utilizing simplistic, non-analytic, or unknown prior distributions. These limitations inhibit the VAE's ability to generate high-quality samples and provide clear, interpretable latent representations. This work introduces the Entropy Decomposed Variational Autoencoder (ED-VAE), a novel re-formulation of the ELBO that explicitly includes entropy and cross-entropy components. This reformulation significantly enhances model flexibility, allowing for the integration of complex and non-standard priors. By providing more detailed control over the encoding and regularization of latent spaces, ED-VAE not only improves interpretability but also effectively captures the complex interactions between latent variables and observed data, thus leading to better generative performance.
