Improving Neural Optimal Transport via Displacement Interpolation
Jaemoo Choi, Yongxin Chen, Jaewoong Choi
TL;DR
The paper addresses instability in neural optimal transport map learning by leveraging displacement interpolation from dynamic OT theory. It derives a dual formulation for displacement interpolation and links time-evolving interpolation potentials through a Hamilton–Jacobi–Bellman condition, enabling training over the full interpolation trajectory. The proposed method, DIOTM, parametrizes intermediate maps via boundary maps and introduces an HJB-based regularizer to achieve stable, accurate OT map estimation, demonstrated on synthetic maps and image-to-image translation benchmarks with strong performance and stability. Limitations include the quadratic cost restriction and the need for bidirectional transport maps, suggesting avenues for extending the framework to more general costs and applications.
Abstract
Optimal Transport (OT) theory investigates the cost-minimizing transport map that moves a source distribution to a target distribution. Recently, several approaches have emerged for learning the optimal transport map for a given cost function using neural networks. We refer to these approaches as the OT Map. OT Map provides a powerful tool for diverse machine learning tasks, such as generative modeling and unpaired image-to-image translation. However, existing methods that utilize max-min optimization often experience training instability and sensitivity to hyperparameters. In this paper, we propose a novel method to improve stability and achieve a better approximation of the OT Map by exploiting displacement interpolation, dubbed Displacement Interpolation Optimal Transport Model (DIOTM). We derive the dual formulation of displacement interpolation at specific time $t$ and prove how these dual problems are related across time. This result allows us to utilize the entire trajectory of displacement interpolation in learning the OT Map. Our method improves the training stability and achieves superior results in estimating optimal transport maps. We demonstrate that DIOTM outperforms existing OT-based models on image-to-image translation tasks.
