On the Wasserstein alignment problem

Soumik Pal; Bodhisattva Sen; Ting-Kam Leonard Wong

On the Wasserstein alignment problem

Soumik Pal, Bodhisattva Sen, Ting-Kam Leonard Wong

TL;DR

This work introduces the Wasserstein downward alignment problem, where a family of transformations $\{T_{\theta}\}$ maps a source measure $\mu$ in $\mathcal{X}$ to a target space $\mathcal{Z}$ with measure $\nu$, and seeks the transformation minimizing the Wasserstein cost between $(T_{\theta})_{\#}\mu$ and $\nu$ via $\mathbb{W}_c^{\downarrow}(\mu,\nu)$. A key contribution is a generalized Kantorovich duality that yields a convex relaxation and a tractable dual formulation, enabling a linear-programming algorithm for empirical/discrete data and extensions to penalized objectives $\mathbb{W}_c^{\downarrow,R}(\mu,\nu)$. In the Euclidean setting, the paper shows an equivalence between downward and upward formulations under zero-mean, unit-covariance normalization, and derives a first-order optimality condition for orthogonal projections, including a concrete 2D example highlighting nonconvexity and local minima. The implementation sections provide a concrete LP approach for empirical measures and demonstrate alignment of 2D point clouds, illustrating the method’s robustness and ability to preserve geometric structure. Overall, the results offer a principled, convex-analytic framework for cross-space distribution alignment with practical LP-based computability and broad applicability to shape analysis and related tasks.

Abstract

Suppose we are given two metric spaces and a family of continuous transformations from one to the other. Given a probability distribution on each of these two spaces - namely the source and the target measures - the Wasserstein alignment problem seeks the transformation that minimizes the optimal transport cost between its pushforward of the source distribution and the target distribution, ensuring the closest possible alignment in a probabilistic sense. Examples of interest include two distributions on two Euclidean spaces $\mathbb{R}^n$ and $\mathbb{R}^d$, and we want a spatial embedding of the $n$-dimensional source measure in $\mathbb{R}^d$ that is closest in some Wasserstein metric to the target distribution on $\mathbb{R}^d$. Similar data alignment problems also commonly arise in shape analysis and computer vision. In this paper we show that this nonconvex optimal transport projection problem admits a convex Kantorovich-type dual. This allows us to characterize the set of projections and devise a linear programming algorithm. For certain special examples, such as orthogonal transformations on Euclidean spaces of unequal dimensions and the $2$-Wasserstein cost, we characterize the covariance of the optimal projections. Our results also cover the generalization when we penalize each transformation by a function. An example is the inner product Gromov-Wasserstein distance minimization problem which has recently gained popularity.

On the Wasserstein alignment problem

TL;DR

This work introduces the Wasserstein downward alignment problem, where a family of transformations

maps a source measure

to a target space

with measure

, and seeks the transformation minimizing the Wasserstein cost between

and

via

. A key contribution is a generalized Kantorovich duality that yields a convex relaxation and a tractable dual formulation, enabling a linear-programming algorithm for empirical/discrete data and extensions to penalized objectives

. In the Euclidean setting, the paper shows an equivalence between downward and upward formulations under zero-mean, unit-covariance normalization, and derives a first-order optimality condition for orthogonal projections, including a concrete 2D example highlighting nonconvexity and local minima. The implementation sections provide a concrete LP approach for empirical measures and demonstrate alignment of 2D point clouds, illustrating the method’s robustness and ability to preserve geometric structure. Overall, the results offer a principled, convex-analytic framework for cross-space distribution alignment with practical LP-based computability and broad applicability to shape analysis and related tasks.

Abstract

and

, and we want a spatial embedding of the

-dimensional source measure in

that is closest in some Wasserstein metric to the target distribution on

. Similar data alignment problems also commonly arise in shape analysis and computer vision. In this paper we show that this nonconvex optimal transport projection problem admits a convex Kantorovich-type dual. This allows us to characterize the set of projections and devise a linear programming algorithm. For certain special examples, such as orthogonal transformations on Euclidean spaces of unequal dimensions and the

-Wasserstein cost, we characterize the covariance of the optimal projections. Our results also cover the generalization when we penalize each transformation by a function. An example is the inner product Gromov-Wasserstein distance minimization problem which has recently gained popularity.

On the Wasserstein alignment problem

TL;DR

Abstract

On the Wasserstein alignment problem

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (4)

Theorems & Definitions (25)