Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

Zijian Li; Shunxing Fan; Yujia Zheng; Ignavier Ng; Shaoan Xie; Guangyi Chen; Xinshuai Dong; Ruichu Cai; Kun Zhang

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

Zijian Li, Shunxing Fan, Yujia Zheng, Ignavier Ng, Shaoan Xie, Guangyi Chen, Xinshuai Dong, Ruichu Cai, Kun Zhang

TL;DR

This work addresses identifiability in nonlinear ICA for disentangled representation learning under practical constraints. It introduces a unified framework that combines sufficient changes via auxiliary variables with a sparse mixing procedure, supported by theoretical results that yield subspace- and component-wise identifiability. The authors instantiate the theory with CG-VAE and CG-GAN, incorporating a domain-encoding network and a sparse mixing constraint, and validate the approach on synthetic and real multi-domain image data, showing improved disentanglement and domain-aware controllability. The method promises more broadly applicable identifiability guarantees in real-world settings where domain coverage and full sparsity are challenging, enabling more robust and interpretable generative models across domains.

Abstract

Disentangled representation learning aims to uncover latent variables underlying the observed data, and generally speaking, rather strong assumptions are needed to ensure identifiability. Some approaches rely on sufficient changes on the distribution of latent variables indicated by auxiliary variables such as domain indices, but acquiring enough domains is often challenging. Alternative approaches exploit structural sparsity assumptions on the mixing procedure, but such constraints are usually (partially) violated in practice. Interestingly, we find that these two seemingly unrelated assumptions can actually complement each other to achieve identifiability. Specifically, when conditioned on auxiliary variables, the sparse mixing procedure assumption provides structural constraints on the mapping from estimated to true latent variables and hence compensates for potentially insufficient distribution changes. Building on this insight, we propose an identifiability theory with less restrictive constraints regarding distribution changes and the sparse mixing procedure, enhancing applicability to real-world scenarios. Additionally, we develop an estimation framework incorporating a domain encoding network and a sparse mixing constraint and provide two implementations based on variational autoencoders and generative adversarial networks, respectively. Experiment results on synthetic and real-world datasets support our theoretical results.

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

TL;DR

Abstract

Synergy Between Sufficient Changes and Sparse Mixing Procedure for Disentangled Representation Learning

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (6)

Theorems & Definitions (11)