Disentanglement Learning via Topology
Nikita Balabin, Daria Voronkova, Ilya Trofimov, Evgeny Burnaev, Serguei Barannikov
TL;DR
This work introduces TopDis, a topological regularizer for disentangled representation learning that operates in an unsupervised setting and remains effective even when factors of variation are correlated. By integrating a differentiable Representation Topology Divergence (RTD) term into VAE-type losses and employing Gaussian-preserving latent shifts via group(oid) actions, TopDis enforces topological similarity across latent traversals. A gradient orthogonalization step safeguards reconstruction quality while encouraging disentanglement. Across multiple benchmarks (dSprites, 3D Shapes, 3D Faces, MPI 3D, CelebA) and several VAE variants, TopDis consistently improves MIG, FactorVAE score, SAP, and DCI disentanglement metrics while preserving reconstruction, and it can uncover disentangled directions in pretrained StyleGAN. The approach offers a flexible, topology-driven inductive bias for disentanglement with potential applicability to diverse domains beyond images.
Abstract
We propose TopDis (Topological Disentanglement), a method for learning disentangled representations via adding a multi-scale topological loss term. Disentanglement is a crucial property of data representations substantial for the explainability and robustness of deep learning models and a step towards high-level cognition. The state-of-the-art methods are based on VAE and encourage the joint distribution of latent variables to be factorized. We take a different perspective on disentanglement by analyzing topological properties of data manifolds. In particular, we optimize the topological similarity for data manifolds traversals. To the best of our knowledge, our paper is the first one to propose a differentiable topological loss for disentanglement learning. Our experiments have shown that the proposed TopDis loss improves disentanglement scores such as MIG, FactorVAE score, SAP score, and DCI disentanglement score with respect to state-of-the-art results while preserving the reconstruction quality. Our method works in an unsupervised manner, permitting us to apply it to problems without labeled factors of variation. The TopDis loss works even when factors of variation are correlated. Additionally, we show how to use the proposed topological loss to find disentangled directions in a trained GAN.
