Unsupervised discovery of the shared and private geometry in multi-view data
Sai Koukuntla, Joshua B. Julian, Jesse C. Kaminsky, Manuel Schottdorf, David W. Tank, Carlos D. Brody, Adam S. Charles
TL;DR
This work tackles the challenge of uncovering nonlinear, shared versus private latent structure across paired high-dimensional views without supervision. It introduces SPLICE, a two-step autoencoder framework that first disentangles shared and private latents via predictability minimization in a crossed-encoder setup, then preserves the intrinsic geometry of each submanifold by projecting data, estimating geodesic distances with landmark-based methods, and applying a geometry-preserving fine-tuning loss. The approach yields interpretable latent spaces that capture the geometry of both shared and private information, demonstrated across Rotated MNIST, synthetic LGN–V1 data, and real hippocampus–prefrontal cortex recordings, and outperforming several baselines. The method enables blind discovery of meaningful cross-view structure with robust dimensionality estimation, holding promise for neuroscience and cross-modal sensor fusion applications. Overall, SPLICE advances interpretable, geometry-aware multi-view representation learning in scientific contexts where true latent dimensionality is unknown a priori.
Abstract
Studying complex real-world phenomena often involves data from multiple views (e.g. sensor modalities or brain regions), each capturing different aspects of the underlying system. Within neuroscience, there is growing interest in large-scale simultaneous recordings across multiple brain regions. Understanding the relationship between views (e.g., the neural activity in each region recorded) can reveal fundamental insights into each view and the system as a whole. However, existing methods to characterize such relationships lack the expressivity required to capture nonlinear relationships, describe only shared sources of variance, or discard geometric information that is crucial to drawing insights from data. Here, we present SPLICE: a neural network-based method that infers disentangled, interpretable representations of private and shared latent variables from paired samples of high-dimensional views. Compared to competing methods, we demonstrate that SPLICE 1) disentangles shared and private representations more effectively, 2) yields more interpretable representations by preserving geometry, and 3) is more robust to incorrect a priori estimates of latent dimensionality. We propose our approach as a general-purpose method for finding succinct and interpretable descriptions of paired data sets in terms of disentangled shared and private latent variables.
