Overcoming Spurious Solutions in Semi-Dual Neural Optimal Transport: A Smoothing Approach for Learning the Optimal Transport Plan
Jaemoo Choi, Jaewoong Choi, Dohyun Kwon
TL;DR
This work addresses spurious solutions in semi-dual neural optimal transport (SNOT) by proving a sufficient condition on the source measure that guarantees recovery of the true OT map from the max-min solution. When the condition fails, it introduces OTP, a smoothing-based method that learns the full OT plan (potentially stochastic) by gradually annealing the smoothed distribution back to the original, with convergence guarantees along subsequences. The approach yields accurate OT plans in failure cases and delivers state-of-the-art results in unpaired image-to-image translation, including stochastic colorization tasks where deterministic maps fall short. The results underscore the practical value of learning transport plans and provide a pathway to robust neural OT methods in settings with inherent stochasticity and complex support structures.
Abstract
We address the convergence problem in learning the Optimal Transport (OT) map, where the OT Map refers to a map from one distribution to another while minimizing the transport cost. Semi-dual Neural OT, a widely used approach for learning OT Maps with neural networks, often generates spurious solutions that fail to transfer one distribution to another accurately. We identify a sufficient condition under which the max-min solution of Semi-dual Neural OT recovers the true OT Map. Moreover, to address cases when this sufficient condition is not satisfied, we propose a novel method, OTP, which learns both the OT Map and the Optimal Transport Plan, representing the optimal coupling between two distributions. Under sharp assumptions on the distributions, we prove that our model eliminates the spurious solution issue and correctly solves the OT problem. Our experiments show that the OTP model recovers the optimal transport map where existing methods fail and outperforms current OT-based models in image-to-image translation tasks. Notably, the OTP model can learn stochastic transport maps when deterministic OT Maps do not exist, such as one-to-many tasks like colorization.
