Structured Matching via Cost-Regularized Unbalanced Optimal Transport
Emanuele Pardini, Katerina Papagiannouli
TL;DR
The paper tackles aligning measures across heterogeneous spaces when total mass differs and the ground cost is unknown. It introduces cost-regularized unbalanced OT (CR-UOT), a framework that jointly optimizes a transport plan and a convex cost regularizer, and establishes existence and convergence results; it further specializes to inner-product costs (GW-IP) and proves that optimal couplings can be induced by Monge maps under mild conditions. The authors develop entropic-regularized algorithms, including a block-coordinate descent method, and derive entropic maps that converge to deterministic Monge maps in appropriate regimes. Empirically, CR-UOT improves cross-modality alignment in single-cell multiomics (scGEM and SNAREseq), particularly when data are unbalanced or lack direct one-to-one correspondences, demonstrating practical impact for heterogeneous biological data integration.
Abstract
Unbalanced optimal transport (UOT) provides a flexible way to match or compare nonnegative finite Radon measures. However, UOT requires a predefined ground transport cost, which may misrepresent the data's underlying geometry. Choosing such a cost is particularly challenging when datasets live in heterogeneous spaces, often motivating practitioners to adopt Gromov-Wasserstein formulations. To address this challenge, we introduce cost-regularized unbalanced optimal transport (CR-UOT), a framework that allows the ground cost to vary while allowing mass creation and removal. We show that CR-UOT incorporates unbalanced Gromov-Wasserstein type problems through families of inner-product costs parameterized by linear transformations, enabling the matching of measures or point clouds across Euclidean spaces. We develop algorithms for such CR-UOT problems using entropic regularization and demonstrate that this approach improves the alignment of heterogeneous single-cell omics profiles, especially when many cells lack direct matches.
