An operator splitting analysis of Wasserstein--Fisher--Rao gradient flows
Francesca Romana Crucinio, Sahani Pathiraja
TL;DR
The paper analyzes Wasserstein-Fisher-Rao gradient flows for sampling, focusing on exact operator splitting of the W and FR components. It derives variational formulae for two canonical splitting orders (W-FR and FR-W), establishes Gaussian-case covariances and KL decay analyses, and proves log-concavity preservation for the WFR flow under suitable conditions. A key finding is that, with an appropriate step size, splitting can accelerate convergence to the target distribution beyond the exact WFR dynamics, as quantified by KL and Jeffreys divergence rates; the improvement depends on initialization, target geometry, and operator ordering. The results provide theoretical guidance for designing split-flow samplers that exploit splitting bias to achieve faster convergence in practice, especially in strongly log-concave settings.
Abstract
Wasserstein-Fisher-Rao (WFR) gradient flows have been recently proposed as a powerful sampling tool that combines the advantages of pure Wasserstein (W) and pure Fisher-Rao (FR) gradient flows. Existing algorithmic developments implicitly make use of operator splitting techniques to numerically approximate the WFR partial differential equation, whereby the W flow is evaluated over a given step size and then the FR flow (or vice versa). This works investigates the impact of the order in which the W and FR operator are evaluated and aims to provide a quantitative analysis. Somewhat surprisingly, we show that with a judicious choice of step size and operator ordering, the split scheme can converge to the target distribution faster than the exact WFR flow (in terms of model time). We obtain variational formulae describing the evolution over one time step of both sequential splitting schemes and investigate in which settings the W-FR split should be preferred to the FR-W split. As a step towards this goal we show that the WFR gradient flow preserves log-concavity and obtain the first sharp decay bound for WFR.
