Arbitrary Distributions Mapping via SyMOT-Flow: A Flow-based Approach Integrating Maximum Mean Discrepancy and Optimal Transport
Zhe Xiong, Qiaoqiao Ding, Xiaoqun Zhang
TL;DR
The paper tackles learning a transformation between two unknown distributions from finite samples by developing SyMOT-Flow, a symmetric flow-based model that leverages maximum mean discrepancy (MMD) and optimal transport (OT) regularization. The method learns an invertible transformation via invertible neural networks (INNs) and imposes a symmetric MMD term together with a transport cost to approximate the OT constraint in the original data space. The authors provide theoretical results linking the relaxed OT objective to the true OT solution through $\Gamma$-convergence and convergence of $\mathrm{OT}_\lambda$ to $\mathrm{OT}$ as $\lambda \to \infty$, ensuring feasibility and stability. Empirically, SyMOT-Flow achieves accurate and interpretable mappings on toy 2D datasets, MNIST/Fashion-MNIST feature spaces, and high-dimensional medical imaging modalities, outperforming baselines in forward/backward consistency and quality of generated samples, with ablations confirming the importance of the symmetric design and OT regularization.
Abstract
Finding a transformation between two unknown probability distributions from finite samples is crucial for modeling complex data distributions and performing tasks such as sample generation, domain adaptation and statistical inference. One powerful framework for such transformations is normalizing flow, which transforms an unknown distribution into a standard normal distribution using an invertible network. In this paper, we introduce a novel model called SyMOT-Flow that trains an invertible transformation by minimizing the symmetric maximum mean discrepancy between samples from two unknown distributions, and an optimal transport cost is incorporated as regularization to obtain a short-distance and interpretable transformation. The resulted transformation leads to more stable and accurate sample generation. Several theoretical results are established for the proposed model and its effectiveness is validated with low-dimensional illustrative examples as well as high-dimensional bi-modality medical image generation through the forward and reverse flows.
