Table of Contents
Fetching ...

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

Ranthony A. Clark, Tom Needham, Thomas Weighill

TL;DR

The paper introduces a manifold-valued extension of multidimensional scaling by exploiting the semi-relaxed Gromov-Wasserstein (srGW) distance, linking dimensionality reduction to optimal transport and generalized Gromov-Hausdorff distances. It proves the existence of Monge maps that realize srGW, showing srGW generalizes classical MDS and equivalently relates to modified Gromov-Hausdorff distances, enabling embeddings into diverse target spaces beyond Euclidean ones. The authors develop SRGW+GD, an efficient algorithm that initializes with a discretized srGW embedding and refines via gradient descent, and demonstrate its effectiveness on MNIST, rotated MNIST, and geodesic-circle embeddings of city data. They further showcase a redistricting application where ensembles of districting plans are visualized on a circle to expose typical patterns and outliers, highlighting the practical utility of manifold-valued dimensionality reduction for complex non-Euclidean data.

Abstract

Dimension reduction techniques typically seek an embedding of a high-dimensional point cloud into a low-dimensional Euclidean space which optimally preserves the geometry of the input data. Based on expert knowledge, one may instead wish to embed the data into some other manifold or metric space in order to better reflect the geometry or topology of the point cloud. We propose a general method for manifold-valued multidimensional scaling based on concepts from optimal transport. In particular, we establish theoretical connections between the recently introduced semi-relaxed Gromov-Wasserstein (srGW) framework and multidimensional scaling by solving the Monge problem in this setting. We also derive novel connections between srGW distance and Gromov-Hausdorff distance. We apply our computational framework to analyze ensembles of political redistricting plans for states with two Congressional districts, achieving an effective visualization of the ensemble as a distribution on a circle which can be used to characterize typical neutral plans, and to flag outliers.

Generalized Dimension Reduction Using Semi-Relaxed Gromov-Wasserstein Distance

TL;DR

The paper introduces a manifold-valued extension of multidimensional scaling by exploiting the semi-relaxed Gromov-Wasserstein (srGW) distance, linking dimensionality reduction to optimal transport and generalized Gromov-Hausdorff distances. It proves the existence of Monge maps that realize srGW, showing srGW generalizes classical MDS and equivalently relates to modified Gromov-Hausdorff distances, enabling embeddings into diverse target spaces beyond Euclidean ones. The authors develop SRGW+GD, an efficient algorithm that initializes with a discretized srGW embedding and refines via gradient descent, and demonstrate its effectiveness on MNIST, rotated MNIST, and geodesic-circle embeddings of city data. They further showcase a redistricting application where ensembles of districting plans are visualized on a circle to expose typical patterns and outliers, highlighting the practical utility of manifold-valued dimensionality reduction for complex non-Euclidean data.

Abstract

Dimension reduction techniques typically seek an embedding of a high-dimensional point cloud into a low-dimensional Euclidean space which optimally preserves the geometry of the input data. Based on expert knowledge, one may instead wish to embed the data into some other manifold or metric space in order to better reflect the geometry or topology of the point cloud. We propose a general method for manifold-valued multidimensional scaling based on concepts from optimal transport. In particular, we establish theoretical connections between the recently introduced semi-relaxed Gromov-Wasserstein (srGW) framework and multidimensional scaling by solving the Monge problem in this setting. We also derive novel connections between srGW distance and Gromov-Hausdorff distance. We apply our computational framework to analyze ensembles of political redistricting plans for states with two Congressional districts, achieving an effective visualization of the ensemble as a distribution on a circle which can be used to characterize typical neutral plans, and to flag outliers.
Paper Structure (31 sections, 8 theorems, 66 equations, 19 figures, 3 tables, 1 algorithm)

This paper contains 31 sections, 8 theorems, 66 equations, 19 figures, 3 tables, 1 algorithm.

Key Result

Theorem 2

Let $(X,d_X,\mu_X)$ be a metric measure space with $X$ finite and $\mu_X$ fully supported and let $(Y,d_Y)$ be a proper metric space with a cocompact action by isometries by some group $G$. Then for any $p \in [1,\infty]$, there exists a function $f: X \to Y$ such that Moreover, if $p < \infty$, then any semi-coupling with the same distortion as $\mu_f$ is induced by a function.

Figures (19)

  • Figure 1: Circle and planar embeddings for the R-MNIST9 dataset (above), and plots comparing the true angle of rotation vs the inferred angular coordinate from the embedding (below). Color indicates the true angle of rotation.
  • Figure 2: Embedding onto a geodesic sphere of 20 world cities.
  • Figure 3: Circle embeddings of 1,000-plan ensembles using SRGW+GD. Heat maps indicate the average district location for plans in each part of the circle.
  • Figure 4: Summary of the relationships between various notions of dissimilarity defined in Sections \ref{['sec:gwdistances']} and \ref{['sec:gromov-type_distances']}.
  • Figure 5: Circle and planar embeddings for images of 0 s in the form of Figure \ref{['fig:scatterplots']}.
  • ...and 14 more figures

Theorems & Definitions (25)

  • Definition 1: Semi-Relaxed Gromov-Wasserstein Distance
  • Theorem 2: Existence of Monge Maps
  • Remark 3
  • Remark 4: The Monge Problem
  • Example 5
  • Corollary 6
  • Definition 7: Semi-Relaxed Gromov-Hausdorff Distance
  • Theorem 8: Equivalence of Gromov-Hausdorff Distances
  • Theorem 9: Equivalence of srGW and srGH
  • Remark 10
  • ...and 15 more