Sequential Monte Carlo approximations of Wasserstein--Fisher--Rao gradient flows
Francesca R. Crucinio, Sahani Pathiraja
TL;DR
The paper tackles efficient sampling from a target distribution $\\pi$ by casting it as minimizing $\\mathrm{KL}(\\mu||\\pi)$ and leveraging gradient-flow structures in Wasserstein, Fisher--Rao, and their combination (WFR). It introduces a stable, sampling-based approximation to the WFR flow by alternating a Wasserstein-like diffusion with a Fisher--Rao reweighting step, implemented via an SMC framework that yields convergence guarantees (Prop. is_wfr and Prop. lp). The authors connect SMC samplers to FR-type flows, explore tempering-based variants, and derive computationally favorable approximations (SMC-ULA, SMC-MALA) while highlighting the trade-offs. Extensive experiments on multimodal and high-dimensional targets show that the proposed SMC-WFR method can outperform birth-death Langevin dynamics and other Monte Carlo methods in convergence speed and robustness, providing practical guidelines for when to deploy WFR-based sampling.
Abstract
We consider the problem of sampling from a probability distribution $π$. It is well known that this can be written as an optimisation problem over the space of probability distribution in which we aim to minimise the Kullback--Leibler divergence from $π$. We consider several partial differential equations (PDEs) whose solution is a minimiser of the Kullback--Leibler divergence from $π$ and connect them to well-known Monte Carlo algorithms. We focus in particular on PDEs obtained by considering the Wasserstein--Fisher--Rao geometry over the space of probabilities and show that these lead to a natural implementation using importance sampling and sequential Monte Carlo. We propose a novel algorithm to approximate the Wasserstein--Fisher--Rao flow of the Kullback--Leibler divergence and conduct an extensive empirical study to identify when these algorithms outperforms other popular Monte Carlo algorithms.
