On the Wasserstein alignment problem
Soumik Pal, Bodhisattva Sen, Ting-Kam Leonard Wong
TL;DR
This work introduces the Wasserstein downward alignment problem, where a family of transformations $\{T_{\theta}\}$ maps a source measure $\mu$ in $\mathcal{X}$ to a target space $\mathcal{Z}$ with measure $\nu$, and seeks the transformation minimizing the Wasserstein cost between $(T_{\theta})_{\#}\mu$ and $\nu$ via $\mathbb{W}_c^{\downarrow}(\mu,\nu)$. A key contribution is a generalized Kantorovich duality that yields a convex relaxation and a tractable dual formulation, enabling a linear-programming algorithm for empirical/discrete data and extensions to penalized objectives $\mathbb{W}_c^{\downarrow,R}(\mu,\nu)$. In the Euclidean setting, the paper shows an equivalence between downward and upward formulations under zero-mean, unit-covariance normalization, and derives a first-order optimality condition for orthogonal projections, including a concrete 2D example highlighting nonconvexity and local minima. The implementation sections provide a concrete LP approach for empirical measures and demonstrate alignment of 2D point clouds, illustrating the method’s robustness and ability to preserve geometric structure. Overall, the results offer a principled, convex-analytic framework for cross-space distribution alignment with practical LP-based computability and broad applicability to shape analysis and related tasks.
Abstract
Suppose we are given two metric spaces and a family of continuous transformations from one to the other. Given a probability distribution on each of these two spaces - namely the source and the target measures - the Wasserstein alignment problem seeks the transformation that minimizes the optimal transport cost between its pushforward of the source distribution and the target distribution, ensuring the closest possible alignment in a probabilistic sense. Examples of interest include two distributions on two Euclidean spaces $\mathbb{R}^n$ and $\mathbb{R}^d$, and we want a spatial embedding of the $n$-dimensional source measure in $\mathbb{R}^d$ that is closest in some Wasserstein metric to the target distribution on $\mathbb{R}^d$. Similar data alignment problems also commonly arise in shape analysis and computer vision. In this paper we show that this nonconvex optimal transport projection problem admits a convex Kantorovich-type dual. This allows us to characterize the set of projections and devise a linear programming algorithm. For certain special examples, such as orthogonal transformations on Euclidean spaces of unequal dimensions and the $2$-Wasserstein cost, we characterize the covariance of the optimal projections. Our results also cover the generalization when we penalize each transformation by a function. An example is the inner product Gromov-Wasserstein distance minimization problem which has recently gained popularity.
