Momentum Multi-Marginal Schrödinger Bridge Matching
Panagiotis Theodoropoulos, Augustinos D. Saravanos, Evangelos A. Theodorou, Guan-Horng Liu
TL;DR
3MSBM addresses the challenge of reconstructing trajectories from sparse multi-snapshot data by lifting dynamics to phase space and solving a multi-marginal Schrödinger bridge. It introduces a two-stage, simulation-free training scheme: (i) compute a conditional multi-marginal bridge (3MBB) enforcing multiple positional constraints, and (ii) learn a drift a_t^\theta that matches this path while preserving all marginals, enabling scalable high-dimensional trajectory interpolation. The framework provides closed-form conditional accelerations with convergence guarantees to the unique mmSB, and demonstrates superior temporal coherence and marginal fidelity across LV, Gulf of Mexico, Beijing air quality, and single-cell datasets compared to state-of-the-art baselines. These results underscore 3MSBM's potential for robust dynamic interpolation in domains with sparse temporal observations, from biology to geoscience, while highlighting future work on dense image/video settings.
Abstract
Understanding complex systems by inferring trajectories from sparse sample snapshots is a fundamental challenge in a wide range of domains, e.g., single-cell biology, meteorology, and economics. Despite advancements in Bridge and Flow matching frameworks, current methodologies rely on pairwise interpolation between adjacent snapshots. This hinders their ability to capture long-range temporal dependencies and potentially affects the coherence of the inferred trajectories. To address these issues, we introduce \textbf{Momentum Multi-Marginal Schrödinger Bridge Matching (3MSBM)}, a novel matching framework that learns smooth measure-valued splines for stochastic systems that satisfy multiple positional constraints. This is achieved by lifting the dynamics to phase space and generalizing stochastic bridges to be conditioned on several points, forming a multi-marginal conditional stochastic optimal control problem. The underlying dynamics are then learned by minimizing a variational objective, having fixed the path induced by the multi-marginal conditional bridge. As a matching approach, 3MSBM learns transport maps that preserve intermediate marginals throughout training, significantly improving convergence and scalability. Extensive experimentation in a series of real-world applications validates the superior performance of 3MSBM compared to existing methods in capturing complex dynamics with temporal dependencies, opening new avenues for training matching frameworks in multi-marginal settings.
