Learning Elastic Costs to Shape Monge Displacements
Michal Klein, Aram-Alexandre Pooladian, Pierre Ablin, Eugène Ndiaye, Jonathan Niles-Weed, Marco Cuturi
TL;DR
This work extends optimal transport by introducing elastic costs $h(z)=\tfrac{1}{2}\|z\|^2+\gamma\tau(z)$ to shape Monge map displacements via the proximal operator of $\tau$. It provides a practical route to compute OT maps for any elastic cost using the MBO estimator, and introduces a bilevel learning scheme to infer the regularizer parameter $\theta$ that enforces low-dimensional displacement subspaces. By focusing on subspace elastic costs $\tau_{A^\perp}$, the authors establish statistical guarantees, connect to the spiked transport model, and show that the effective estimation rate can depend on the subspace dimension rather than the ambient dimension. Through synthetic and single-cell data experiments, they demonstrate ground-truth map generation, subspace recovery, and improved predictive performance when learning displacement structure, highlighting the method's potential for structured OT in high dimensions.
Abstract
Given a source and a target probability measure supported on $\mathbb{R}^d$, the Monge problem asks to find the most efficient way to map one distribution to the other. This efficiency is quantified by defining a \textit{cost} function between source and target data. Such a cost is often set by default in the machine learning literature to the squared-Euclidean distance, $\ell^2_2(\mathbf{x},\mathbf{y})=\tfrac12|\mathbf{x}-\mathbf{y}|_2^2$. Recently, Cuturi et. al '23 highlighted the benefits of using elastic costs, defined through a regularizer $τ$ as $c(\mathbf{x},\mathbf{y})=\ell^2_2(\mathbf{x},\mathbf{y})+τ(\mathbf{x}-\mathbf{y})$. Such costs shape the \textit{displacements} of Monge maps $T$, i.e., the difference between a source point and its image $T(\mathbf{x})-\mathbf{x})$, by giving them a structure that matches that of the proximal operator of $τ$. In this work, we make two important contributions to the study of elastic costs: (i) For any elastic cost, we propose a numerical method to compute Monge maps that are provably optimal. This provides a much-needed routine to create synthetic problems where the ground truth OT map is known, by analogy to the Brenier theorem, which states that the gradient of any convex potential is always a valid Monge map for the $\ell_2^2$ cost; (ii) We propose a loss to \textit{learn} the parameter $θ$ of a parameterized regularizer $τ_θ$, and apply it in the case where $τ_{A}(\mathbf{z})=|A^\perp \mathbf{z}|^2_2$. This regularizer promotes displacements that lie on a low dimensional subspace of $\mathbb{R}^d$, spanned by the $p$ rows of $A\in\mathbb{R}^{p\times d}$.
