HOT: An Efficient Halpern Accelerating Algorithm for Optimal Transport Problems
Guojun Zhang, Zhexuan Gu, Yancheng Yuan, Defeng Sun
TL;DR
This work tackles the computational bottleneck of the Kantorovich–Wasserstein distance for discrete 2D histograms by solving an equivalent reduced OT model with a Halpern-accelerated method (HOT). HOT achieves an $O(1/\varepsilon)$ iteration complexity for the reduced problem, while a novel linear-time procedure solves the involved linear systems without forming $AA^{\top}$, yielding per-iteration cost $O(N)$ and memory $O(N)$, and overall complexity $O(M^{1.5}/\varepsilon)$ for $M$ supports. A transport-plan recovery method extends the reduced-model solution to the original OT problem, enabling practical use in color transfer and related tasks. Implemented in PyTorch, HOT demonstrates strong empirical performance on large-scale benchmarks (e.g., DOTmark), outperforming Sinkhorn-type, Network Simplex, and ADMM approaches in both speed and memory usage, with substantial gains at higher resolutions and in plan-based applications. The results suggest HOT as a scalable tool for OT-based similarity measures and transport-based applications in computer vision and data analysis, with potential extensions to higher dimensions and Wasserstein barycenters.
Abstract
This paper proposes an efficient HOT algorithm for solving the optimal transport (OT) problems with finite supports. We particularly focus on an efficient implementation of the HOT algorithm for the case where the supports are in $\mathbb{R}^2$ with ground distances calculated by $L_2^2$-norm. Specifically, we design a Halpern accelerating algorithm to solve the equivalent reduced model of the discrete OT problem. Moreover, we derive a novel procedure to solve the involved linear systems in the HOT algorithm in linear time complexity. Consequently, we can obtain an $\varepsilon$-approximate solution to the optimal transport problem with $M$ supports in $O(M^{1.5}/\varepsilon)$ flops, which significantly improves the best-known computational complexity. We further propose an efficient procedure to recover an optimal transport plan for the original OT problem based on a solution to the reduced model, thereby overcoming the limitations of the reduced OT model in applications that require the transport plan. We implement the HOT algorithm in PyTorch and extensive numerical results show the superior performance of the HOT algorithm compared to existing state-of-the-art algorithms for solving the OT problems.
