A numerical algorithm with linear complexity for Multi-marginal Optimal Transport with $L^1$ Cost
Chunhui Chen, Jing Chen, Baojia Luo, Shi Jin, Hao Wu
TL;DR
The authors address the prohibitive cost of solving multi-marginal optimal transport with $L^1$ cost by developing a linear-time algorithm that replaces dense tensor-vector products with recursive, dynamic-programming based computations. By exploiting the kernel structure $K_{ijk}=e^{-c_{ijk}/oldsymbol{ε}}$ and partitioning index triples into six order-sets to remove absolute values, they derive two fast tensor-vector solvers (FTVP-1 and FTVP-2) and integrate them into a fast Sinkhorn framework (FS-3), with an optional log-domain stabilized variant (FTVP-LOG). The method extends naturally to higher dimensions and arbitrary numbers of marginals ($l$-marginal MMOT), preserving $O(N)$ complexity per update, and is validated through extensive 1D, 2D, and image-matching experiments showing substantial speedups while maintaining accuracy. Overall, this work enables scalable MMOT computations in practical settings such as image processing, tomography, and machine learning by reducing the dominant bottleneck from $O(N^l)$ to $O(N)$.
Abstract
Numerically solving multi-marginal optimal transport (MMOT) problems is computationally prohibitive, even for moderate-scale instances involving $l\ge4$ marginals with support sizes of $N\ge1000$. The cost in MMOT is represented as a tensor with $N^l$ elements. Even accessing each element once incurs a significant computational burden. In fact, many algorithms require direct computation of tensor-vector products, leading to a computational complexity of $O(N^l)$ or beyond. In this paper, inspired by our previous work [$Comm. \ Math. \ Sci.$, 20 (2022), pp. 2053 - 2057], we observe that the costly tensor-vector products in the Sinkhorn Algorithm can be computed with a recursive process by separating summations and dynamic programming. Based on this idea, we propose a fast tensor-vector product algorithm to solve the MMOT problem with $L^1$ cost, achieving a miraculous reduction in the computational cost of the entropy regularized solution to $O(N)$. Numerical experiment results confirm such high performance of this novel method which can be several orders of magnitude faster than the original Sinkhorn algorithm.
