On Wasserstein distances for affine transformations of random vectors

Keaton Hamm; Andrzej Korzeniowski

On Wasserstein distances for affine transformations of random vectors

Keaton Hamm, Andrzej Korzeniowski

TL;DR

The paper addresses the problem of quantifying the Wasserstein distance $W_2$ between a distribution and its affine transformations, with a focus on rotations and their role in rotational manifolds within Wasserstein space. It leverages the Dowson–Landau lower bound and Gelbrich’s equality framework to derive explicit, covariance-based lower bounds, and in 2D derives a closed-form rotation bound using the trace of square roots of covariance products; it also provides an upper bound for compositions of translations, rotations, and dilations. Key contributions include a closed-form expression for $\operatorname{tr}[(\Sigma_{R_\theta X}\Sigma_{R_\varphi X})^{1/2}]$ and sharp lower bounds that are attained for Gaussian or elliptically contoured distributions, together with practical upper bounds for affine compositions and concrete demonstrations on 1D submanifolds and handwriting-inspired data for manifold learning via Wassmap. The work advances understanding of Wasserstein geometry under affine deformations and offers practical tools for synthetic data generation and dimensionality reduction benchmarking in applications like handwriting analysis and image-based manifold learning, while highlighting open questions about centering, symmetry, and the influence of data geometry on the tightness of the bounds.

Abstract

We expound on some known lower bounds of the quadratic Wasserstein distance between random vectors in $\mathbb{R}^n$ with an emphasis on affine transformations that have been used in manifold learning of data in Wasserstein space. In particular, we give concrete lower bounds for rotated copies of random vectors in $\mathbb{R}^2$ by computing the Bures metric between the covariance matrices. We also derive upper bounds for compositions of affine maps which yield a fruitful variety of diffeomorphisms applied to an initial data measure. We apply these bounds to various distributions including those lying on a 1-dimensional manifold in $\mathbb{R}^2$ and illustrate the quality of the bounds. Finally, we give a framework for mimicking handwritten digit or alphabet datasets that can be applied in a manifold learning framework.

On Wasserstein distances for affine transformations of random vectors

TL;DR

The paper addresses the problem of quantifying the Wasserstein distance

between a distribution and its affine transformations, with a focus on rotations and their role in rotational manifolds within Wasserstein space. It leverages the Dowson–Landau lower bound and Gelbrich’s equality framework to derive explicit, covariance-based lower bounds, and in 2D derives a closed-form rotation bound using the trace of square roots of covariance products; it also provides an upper bound for compositions of translations, rotations, and dilations. Key contributions include a closed-form expression for

and sharp lower bounds that are attained for Gaussian or elliptically contoured distributions, together with practical upper bounds for affine compositions and concrete demonstrations on 1D submanifolds and handwriting-inspired data for manifold learning via Wassmap. The work advances understanding of Wasserstein geometry under affine deformations and offers practical tools for synthetic data generation and dimensionality reduction benchmarking in applications like handwriting analysis and image-based manifold learning, while highlighting open questions about centering, symmetry, and the influence of data geometry on the tightness of the bounds.

Abstract

We expound on some known lower bounds of the quadratic Wasserstein distance between random vectors in

with an emphasis on affine transformations that have been used in manifold learning of data in Wasserstein space. In particular, we give concrete lower bounds for rotated copies of random vectors in

by computing the Bures metric between the covariance matrices. We also derive upper bounds for compositions of affine maps which yield a fruitful variety of diffeomorphisms applied to an initial data measure. We apply these bounds to various distributions including those lying on a 1-dimensional manifold in

and illustrate the quality of the bounds. Finally, we give a framework for mimicking handwritten digit or alphabet datasets that can be applied in a manifold learning framework.

Paper Structure (13 sections, 16 theorems, 79 equations, 11 figures)

This paper contains 13 sections, 16 theorems, 79 equations, 11 figures.

Introduction
Notation and definitions
Lower bounds for the Wasserstein distance
Examples of Wasserstein distances between transformations of a fixed random vector
Translations
Dilations
Rotations
Illustrating the Lower bound for Rotations
Compositions of affine maps
Applications to 1-dimensional submanifolds
Toward handwriting analysis
Wassmap embeddings
Conclusion

Key Result

Proposition 1.1

Given $\mu,\nu\in\mathcal{P}(\mathbb{R}^n)$ with corresponding random vectors $X,Y\in\mathbb{R}^n$, we have

Figures (11)

Figure 1: Rotation of $[0,1]^2$ from Example \ref{['EX:01Rotation']}. (Left) $W_2(X,R_\theta X)$ and the lower bound $\sqrt{1-\cos(\theta)}$ for $\theta\in[0,\frac{\pi}{2}]$. (Right) Relative error $(W_2(X,R_\theta X)-\sqrt{1-\cos(\theta)})/W_2(X,R_\theta X)$ for $\theta\in[0,\frac{\pi}{2}]$.
Figure 2: Rotation of $[-\frac{1}{2},\frac{1}{2}]^2$ from Example \ref{['EX:CenteredCubeRotation']}. Plotted is $W_2(X,R_\theta X)$ for $\theta\in[0,\frac{\pi}{2}]$.
Figure 3: Rotation of $[0,2]\times[0,1]$ from Example \ref{['EX:RectangleRotation']}. (Left) $W_2(X,R_\theta X)$ and the lower bound of \ref{['EQN:RectangleLowerBound']} for $\theta\in[0,\frac{\pi}{2}]$. (Right) Relative error $(W_2(X,R_\theta X)-$lower bound)$/W_2(X,R_\theta X)$ for $\theta\in[0,\frac{\pi}{2}]$.
Figure 4: Figure illustrating Example \ref{['EX:GaussianComposition']}. (Top) Sample of $X\sim\mathcal{N}(0,\Sigma_1)$ under the map $T_\alpha\circ R_\theta\circ S_\lambda$ for $\theta\in[0,\frac{\pi}{2}]$ with $\alpha = 0$, $\lambda = \frac{1}{2}2$. (Bottom) Relative error (upper bound $- W_2(T_\alpha\circ R_\theta\circ S_\lambda X,X))/W_2(T_\alpha\circ R_\theta\circ S_\lambda X,X)$ for $X\sim(0,\Sigma_i)$, $i=1,2,3$ using the upper bound of Theorem \ref{['THM:CompositionW2']} (which is valid because Gaussians satisfy equality in Theorem \ref{['THM:RotationW2']}).
Figure 5: (Left) $X$ drawn from the uniform distribution on the unit circle in $\mathbb{R}^2$ as in Example \ref{['EX:CircleComposition']}. Shown is the relative error between the Wasserstein distance and the upper bound. (Right) Relative error between Wasserstein distance and upper bound for $X$ drawn from the uniform distribution on $[-\frac{1}{2},\frac{1}{2}]$ in $\mathbb{R}^2$ as in Example \ref{['EX:LineComposition']}.
...and 6 more figures

Theorems & Definitions (39)

Proposition 1.1
proof
Theorem 1.2: dowson1982frechet
Corollary 1.3
Proposition 1.4
proof
Proposition 2.1
proof
Proposition 2.2
proof
...and 29 more

On Wasserstein distances for affine transformations of random vectors

TL;DR

Abstract

On Wasserstein distances for affine transformations of random vectors

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (11)

Theorems & Definitions (39)