Table of Contents
Fetching ...

Analyzing Generative Models by Manifold Entropic Metrics

Daniel Galperin, Ullrich Köthe

TL;DR

The paper tackles unsupervised evaluation of disentangled representations in deep generative models. It introduces Manifold Entropic Metrics that operate on the decoder side, leveraging the Jacobian to define quantities like manifold entropy $H(q_S)$, manifold total correlation $\mathcal{I}_{\mathcal{P}}$, and manifold mutual information $\mathcal{I}(q_S,q_T)$. Grounded in the manifold hypothesis and Independent Mechanism Analysis, the approach formalizes latent manifolds $\mathcal{M}_{\mathbb{S}}$ and their densities $q_{\mathbb{S}}$, enabling alignment and disentanglement assessment through information-theoretic lenses. Experiments on toy data and EMNIST show that architectural choices and training biases can steer models toward aligned and disentangled latent representations, with practical implications for bottleneck design and model evaluation.

Abstract

Good generative models should not only synthesize high quality data, but also utilize interpretable representations that aid human understanding of their behavior. However, it is difficult to measure objectively if and to what degree desirable properties of disentangled representations have been achieved. Inspired by the principle of independent mechanisms, we address this difficulty by introducing a novel set of tractable information-theoretic evaluation metrics. We demonstrate the usefulness of our metrics on illustrative toy examples and conduct an in-depth comparison of various normalizing flow architectures and $β$-VAEs on the EMNIST dataset. Our method allows to sort latent features by importance and assess the amount of residual correlations of the resulting concepts. The most interesting finding of our experiments is a ranking of model architectures and training procedures in terms of their inductive bias to converge to aligned and disentangled representations during training.

Analyzing Generative Models by Manifold Entropic Metrics

TL;DR

The paper tackles unsupervised evaluation of disentangled representations in deep generative models. It introduces Manifold Entropic Metrics that operate on the decoder side, leveraging the Jacobian to define quantities like manifold entropy , manifold total correlation , and manifold mutual information . Grounded in the manifold hypothesis and Independent Mechanism Analysis, the approach formalizes latent manifolds and their densities , enabling alignment and disentanglement assessment through information-theoretic lenses. Experiments on toy data and EMNIST show that architectural choices and training biases can steer models toward aligned and disentangled latent representations, with practical implications for bottleneck design and model evaluation.

Abstract

Good generative models should not only synthesize high quality data, but also utilize interpretable representations that aid human understanding of their behavior. However, it is difficult to measure objectively if and to what degree desirable properties of disentangled representations have been achieved. Inspired by the principle of independent mechanisms, we address this difficulty by introducing a novel set of tractable information-theoretic evaluation metrics. We demonstrate the usefulness of our metrics on illustrative toy examples and conduct an in-depth comparison of various normalizing flow architectures and -VAEs on the EMNIST dataset. Our method allows to sort latent features by importance and assess the amount of residual correlations of the resulting concepts. The most interesting finding of our experiments is a ranking of model architectures and training procedures in terms of their inductive bias to converge to aligned and disentangled representations during training.

Paper Structure

This paper contains 71 sections, 91 equations, 20 figures, 2 tables.

Figures (20)

  • Figure 1: The two moons distribution illustrates how manifold entropic metrics quantify DRL in terms of alignment and disentanglement. (Top left) The latent prior and a Cartesian grid spanned by the latent variables $Z_c$ (orange) and $Z_d$ (blue). The latent distribution is mapped to data space by three generative models with equal accuracy, but vastly different representations. This can be seen by the differences in the transformed grid spanned by the manifold random variables ${\boldsymbol{X}}_c$ (orange) and ${\boldsymbol{X}}_d$ (blue) in the top row, and the corresponding values of our metrics manifold entropy$H({\boldsymbol{X}}_c)$, $H({\boldsymbol{X}}_d)$ and manifold mutual information$\mathcal{I}({\boldsymbol{X}}_c, {\boldsymbol{X}}_d)$ in the bottom row. The total entropy of the distribution (gray) is the signed sum of the three terms. (A) The latent manifolds are entangled (and thus not interpretable), and our metric indicates this by high mutual information (brown). (B) The latent manifolds are locally orthogonal everywhere and have low mutual information. However, alignment is inconsistent (${\boldsymbol{X}}_d$ aligns with the upper moon, ${\boldsymbol{X}}_c$ with the lower), resulting in comparable manifold entropy of both variables. (C) The representation is disentangled and aligned. The manifold entropy is high for the important variable ${\boldsymbol{X}}_c$ (orange) and low for the noise variable ${\boldsymbol{X}}_d$ (blue), and their mutual information is small.
  • Figure 2: Application of our manifold entropic metrics in order to infer if the ground-truth DGP has been learned (b) or not (a). (Top) Manifold entropy spectrum and (Bottom) Pearson Correlation matrix and MCPMI matrix (left to right) comparing a trained model with the ground truth. As expected, statistical independence between latent variables (diagonal MCPMI) implies uncorrelated features (diagonal Pearson).
  • Figure 3: Manifold Entropy spectrum of NFs with unsorted (faded color) and sorted (full color) latent dimensions. Trained with additional reconstruction loss for $10$, $20$ and $100$ core dimensions ${\mathbb{C}}$.
  • Figure 4: Sorting latent dimensions by the standard deviation of each latent variable as in GIN can reveal a spurious notion of importance.
  • Figure 5: Manifold Entropy Spectra (left) with unsorted (faded color) and sorted (full color) latent dimensions, MPMI matrix of a trained GLOW-like NF (middle) and MPMI matrix of a Wavelet-flow (right).
  • ...and 15 more figures

Theorems & Definitions (6)

  • Definition A.1
  • Definition A.2
  • Definition A.3
  • Definition A.4
  • Definition A.5
  • Definition A.6