ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

Fotios Lygerakis; Elmar Rueckert

ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

Fotios Lygerakis, Elmar Rueckert

TL;DR

This work targets limitations of the standard ELBO in VAEs when priors are nonanalytic or unknown. It introduces ED-VAE, a reformulation that explicitly decomposes the ELBO into reconstruction, mutual information, and a marginal KL term split into entropy and cross-entropy $H[q(z)]$ and $H[q(z), p(z)]$, enabling flexible priors. Mutual information is bounded via InfoNCE to encourage informative latent codes, and the total loss combines reconstruction, InfoNCE, entropy, and cross-entropy terms. Experiments on synthetic datasets with Gaussian and complex non-Gaussian priors demonstrate that ED-VAE achieves higher ELBO and better prior alignment, improving reconstruction quality and latent space interpretability, highlighting its potential for adaptable generative modeling with domain-specific priors.

Abstract

Traditional Variational Autoencoders (VAEs) are constrained by the limitations of the Evidence Lower Bound (ELBO) formulation, particularly when utilizing simplistic, non-analytic, or unknown prior distributions. These limitations inhibit the VAE's ability to generate high-quality samples and provide clear, interpretable latent representations. This work introduces the Entropy Decomposed Variational Autoencoder (ED-VAE), a novel re-formulation of the ELBO that explicitly includes entropy and cross-entropy components. This reformulation significantly enhances model flexibility, allowing for the integration of complex and non-standard priors. By providing more detailed control over the encoding and regularization of latent spaces, ED-VAE not only improves interpretability but also effectively captures the complex interactions between latent variables and observed data, thus leading to better generative performance.

ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

TL;DR

and

, enabling flexible priors. Mutual information is bounded via InfoNCE to encourage informative latent codes, and the total loss combines reconstruction, InfoNCE, entropy, and cross-entropy terms. Experiments on synthetic datasets with Gaussian and complex non-Gaussian priors demonstrate that ED-VAE achieves higher ELBO and better prior alignment, improving reconstruction quality and latent space interpretability, highlighting its potential for adaptable generative modeling with domain-specific priors.

Abstract

Paper Structure (20 sections, 13 equations, 1 figure, 2 tables)

This paper contains 20 sections, 13 equations, 1 figure, 2 tables.

Introduction
Related Work
Method
Original ELBO Formulation
Entropy Decomposed ELBO
ELBO Optimization
Experiments
Synthetic datasets
Dataset 1: Gaussian Prior
Dataset 2: Complex Non-Gaussian Prior
Model Configurations
Experimental Setup
Results
Discussion
Supplementary Material
...and 5 more sections

Figures (1)

Figure 1: Visual description of the Evidence Lower BOund (ELBO) in a ED-VAE. Figure \ref{['fig:ed_vae_graph']} describes the minimization of ELBO, which we show consists of a mutual information, an entropy and a cross-entropy term, as shown in Section \ref{['sec:entropy_dec']}.

ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

TL;DR

Abstract

ED-VAE: Entropy Decomposition of ELBO in Variational Autoencoders

Authors

TL;DR

Abstract

Table of Contents

Figures (1)