Provable Affine Identifiability of Nonlinear CCA under Latent Distributional Priors
Zhiwei Han, Stefan Matthes, Hao Shen
TL;DR
The paper proves that nonlinear CCA can identifiably recover ground-truth latent factors up to an affine transform when latent priors are present and whitening is applied. It establishes population affine identifiability for Gaussian priors and shows ridge-regularized empirical CCA converges to the population solution as data grows. The analysis hinges on transporting the problem to source space and exploiting spectral separation via Hermite/Mehler expansions, with reparameterization invariance guaranteeing consistency across representations. Empirical results on synthetic and rendered data validate the theory and demonstrate the importance of whitening, while ablations reveal the limits under assumption violations. Overall, the work positions affine identifiability as a principled alternative to contrastive methods for disentangling multi-view latent structure in nonlinear settings.
Abstract
In this work, we establish conditions under which nonlinear CCA recovers the ground-truth latent factors up to an orthogonal transform after whitening. Building on the classical result that linear mappings maximize canonical correlations under Gaussian priors, we prove affine identifiability for a broad class of latent distributions in the population setting. Central to our proof is a reparameterization result that transports the analysis from observation space to source space, where identifiability becomes tractable. We further show that whitening is essential for ensuring boundedness and well-conditioning, thereby underpinning identifiability. Beyond the population setting, we prove that ridge-regularized empirical CCA converges to its population counterpart, transferring these guarantees to the finite-sample regime. Experiments on a controlled synthetic dataset and a rendered image dataset validate our theory and demonstrate the necessity of its assumptions through systematic ablations.
