Degrees of the Wasserstein Distance to Small Toric Models
Greg DePaul, Serkan Hoşten, Nilava Metya, Ikenna Nometa
TL;DR
This work studies the algebraic complexity of computing Wasserstein distances from discrete empirical distributions to small toric models. It introduces the Wasserstein degree $w(X,F)$, the number of complex critical points of a linear objective on a translated face intersected with a toric model, and shows $w(X,F)$ is bounded by the polar degrees of the underlying variety. The authors derive explicit polar-degree formulas for rational normal scrolls and a range of graphical models (star trees, the binary 4-path, the binary 4-cycle, and no-three-way interaction models) and present an algorithm to compute Wasserstein degrees without Lagrange multipliers. Computational experiments, implemented in SageMath and Macaulay2, demonstrate that $w(X,F)$ is often smaller than the corresponding polar degrees, offering a practical handle on the complexity of Wasserstein-distance computations in discrete settings and informing distance-based estimation and learning tasks.
Abstract
The study of the closest point(s) on a statistical model from a given distribution in the probability simplex with respect to a fixed Wasserstein metric gives rise to a polyhedral norm distance optimization problem. There are two components to the complexity of determining the Wasserstein distance from a data point to a model. One is the combinatorial complexity that is governed by the combinatorics of the Lipschitz polytope of the finite metric to be used. Another is the algebraic complexity, which is governed by the polar degrees of the Zariski closure of the model. We find formulas for the polar degrees of rational normal scrolls and graphical models whose underlying graphs are star trees. Also, the polar degrees of the graphical models with four binary random variables where the graphs are a path on four vertices and the four-cycle, as well as for small, no-three-way interaction models, were computed. We investigate the algebraic degree of computing the Wasserstein distance to a small subset of these models. It was observed that this algebraic degree is typically smaller than the corresponding polar degree.
