Diversified Flow Matching with Translation Identifiability
Sagar Shrestha, Xiao Fu
TL;DR
This work addresses content misalignment in unpaired domain translation by enforcing translation identifiability via diversified distribution matching (DDM). It introduces diversified flow matching (DFM), an ODE-based flow matching method that uses a bilevel learning loss and private nonlinear interpolants to guarantee DDM identifiability, while also producing explicit transport trajectories. A non-overlapping support assumption enables a practical, two-stage implementation that avoids the computational burden of a full bilevel optimization. Experiments on synthetic data, unpaired image translation, and swarm navigation demonstrate that DFM achieves translation identifiability, superior trajectory consistency, and robust performance compared to GAN- and diffusion-based baselines, with potential for trajectory-aware domain transfer in real-world applications.
Abstract
Diversified distribution matching (DDM) finds a unified translation function mapping a diverse collection of conditional source distributions to their target counterparts. DDM was proposed to resolve content misalignment issues in unpaired domain translation, achieving translation identifiability. However, DDM has only been implemented using GANs due to its constraints on the translation function. GANs are often unstable to train and do not provide the transport trajectory information -- yet such trajectories are useful in applications such as single-cell evolution analysis and robot route planning. This work introduces diversified flow matching (DFM), an ODE-based framework for DDM. Adapting flow matching (FM) to enforce a unified translation function as in DDM is challenging, as FM learns the translation function's velocity rather than the translation function itself. A custom bilevel optimization-based training loss, a nonlinear interpolant, and a structural reformulation are proposed to address these challenges, offering a tangible implementation. To our knowledge, DFM is the first ODE-based approach guaranteeing translation identifiability. Experiments on synthetic and real-world datasets validate the proposed method.
