A Revisit of Total Correlation in Disentangled Variational Auto-Encoder with Partial Disentanglement
Chengrui Li, Yunmiao Wang, Yule Wang, Weihan Li, Dieter Jaeger, Anqi Wu
TL;DR
PDisVAE introduces a flexible, partially disentangled variational auto-encoder that replaces the full- independence TC penalty with a partial correlation (PC) penalty to enforce group-wise independence while allowing within-group entanglement. By unifying the grouping parameter G with latent dimensionality K, PDisVAE smoothly interpolates between standard VAEs and fully disentangled VAEs, and it naturally accommodates rank deficiencies within groups. The authors derive an optimal importance-sampling batch approximation for estimating the PC term, and validate the approach on synthetic, partial-dsprites, CelebA, and neural data, showing improved recovery of group structure and richer, more interpretable representations than fully disentangled methods. The framework offers a practical and versatile tool for learning latent representations that honor realistic, group-wise independence structures in complex data. This has broad implications for applications in computer vision and neuroscience where factors of variation exhibit partial, rather than complete, independence.
Abstract
A fully disentangled variational auto-encoder (VAE) aims to identify disentangled latent components from observations. However, enforcing full independence between all latent components may be too strict for certain datasets. In some cases, multiple factors may be entangled together in a non-separable manner, or a single independent semantic meaning could be represented by multiple latent components within a higher-dimensional manifold. To address such scenarios with greater flexibility, we develop the Partially Disentangled VAE (PDisVAE), which generalizes the total correlation (TC) term in fully disentangled VAEs to a partial correlation (PC) term. This framework can handle group-wise independence and can naturally reduce to either the standard VAE or the fully disentangled VAE. Validation through three synthetic experiments demonstrates the correctness and practicality of PDisVAE. When applied to real-world datasets, PDisVAE discovers valuable information that is difficult to find using fully disentangled VAEs, implying its versatility and effectiveness.
