Adapted optimal transport between Gaussian processes in discrete time
Madhu Gunasingam, Ting-Kam Leonard Wong
TL;DR
This work derives an explicit adapted 2-W Wasserstein distance between non-degenerate Gaussian measures in discrete time under time-respecting (bicausal) constraints, and provides a complete characterization of the optimal couplings. The main result expresses AW_2^2 as the sum of the squared distance between means and an adapted Bures-Wasserstein distance between covariances: AW_2^2(mu,nu) = ||a-b||^2 + d_ABW^2(A,B), with d_ABW^2(A,B) = trace(A) + trace(B) - 2 || diag(L^T M) ||_1. The authors develop a dynamic programming principle to reduce the problem to 1D conditional OT problems, identify the sign of diag(L^T M) as guiding the optimal coupling structure, and connect the adapted geometry to Knothe-Rosenblatt transport, including a detailed discussion of when KR is AW_2-optimal. These results illuminate the geometry of time-respecting transport for Gaussian processes and provide tools for applications in filtering, robust optimization, and stochastic control under adapted metrics; extensions to multivariate processes with entropic regularization and continuous-time settings are highlighted as future directions.
Abstract
We derive explicitly the adapted $2$-Wasserstein distance between non-degenerate Gaussian distributions on $\mathbb{R}^N$ and characterize the optimal bicausal coupling(s). This leads to an adapted version of the Bures-Wasserstein distance on the space of positive definite matrices.
