Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Sagar Shrestha; Xiao Fu

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

Sagar Shrestha, Xiao Fu

TL;DR

This work addresses identifiability of latent content $oldsymbol{c}$ and domain-specific style $oldsymbol{s}^{(n)}$ from unaligned multi-domain data, where latent dimensions may be unknown. It introduces cross-domain latent distribution matching (LDM) to prove identifiability under relaxed conditions, notably relaxing componentwise independence and requiring only two domains in some cases. The authors show that with sparsity constraints, identifiability holds even when latent dimensions are not known, and they reformulate LDM as a sparsity-regularized multi-domain GAN (MDGAN) that is computationally efficient. Empirical results on image translation and generation tasks demonstrate reliable content-style disentanglement, high style diversity, and competitive generation quality, validating the theoretical claims and offering a practical pathway for dimension-agnostic content-style learning.

Abstract

Understanding identifiability of latent content and style variables from unaligned multi-domain data is essential for tasks such as domain translation and data generation. Existing works on content-style identification were often developed under somewhat stringent conditions, e.g., that all latent components are mutually independent and that the dimensions of the content and style variables are known. We introduce a new analytical framework via cross-domain \textit{latent distribution matching} (LDM), which establishes content-style identifiability under substantially more relaxed conditions. Specifically, we show that restrictive assumptions such as component-wise independence of the latent variables can be removed. Most notably, we prove that prior knowledge of the content and style dimensions is not necessary for ensuring identifiability, if sparsity constraints are properly imposed onto the learned latent representations. Bypassing the knowledge of the exact latent dimension has been a longstanding aspiration in unsupervised representation learning -- our analysis is the first to underpin its theoretical and practical viability. On the implementation side, we recast the LDM formulation into a regularized multi-domain GAN loss with coupled latent variables. We show that the reformulation is equivalent to LDM under mild conditions -- yet requiring considerably less computational resource. Experiments corroborate with our theoretical claims.

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

TL;DR

Abstract

Content-Style Learning from Unaligned Domains: Identifiability under Unknown Latent Dimensions

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (16)

Theorems & Definitions (10)