Multi-Way Representation Alignment
Akshit Achara, Tatiana Gaintseva, Mateo Mahaut, Pritish Chakraborty, Viktor Stenby Johansson, Melih Barsbey, Emanuele Rodolà, Donato Crisostomi
TL;DR
The paper addresses the challenge of aligning representations from multiple independently trained models by introducing a shared universe framework. It adapts Generalized Procrustes Analysis (GPA) to build an isometric universal space, but shows that pure geometry preservation can impair retrieval performance, motivating a consensus-based correction. The authors propose Geometry-Corrected Procrustes Alignment (GCPA), which uses GPA as a geometric scaffold and applies a shared residual correction to bridge geometry and cross-model agreement. Across multilingual, cross-camera, and multimodal benchmarks, GCPA achieves state-of-the-art any-to-any retrieval while enabling scalable universe extension and robust aggregation, demonstrating practical interoperability for model stitching, cross-modal transfer, and zero-shot composition.
Abstract
The Platonic Representation Hypothesis suggests that independently trained neural networks converge to increasingly similar latent spaces. However, current strategies for mapping these representations are inherently pairwise, scaling quadratically with the number of models and failing to yield a consistent global reference. In this paper, we study the alignment of $M \ge 3$ models. We first adapt Generalized Procrustes Analysis (GPA) to construct a shared orthogonal universe that preserves the internal geometry essential for tasks like model stitching. We then show that strict isometric alignment is suboptimal for retrieval, where agreement-maximizing methods like Canonical Correlation Analysis (CCA) typically prevail. To bridge this gap, we finally propose Geometry-Corrected Procrustes Alignment (GCPA), which establishes a robust GPA-based universe followed by a post-hoc correction for directional mismatch. Extensive experiments demonstrate that GCPA consistently improves any-to-any retrieval while retaining a practical shared reference space.
