Sampling in Unit Time with Kernel Fisher-Rao Flow
Aimee Maurais, Youssef Marzouk
TL;DR
We address sampling from an unnormalized target $\pi_1$ using a gradient-free, finite-time transport from a reference $\pi_0$ by evolving along the unit-time geometric path $\pi_t \propto \pi_0^{1-t}\pi_1^t$. The core contribution is a mean-field ODE for a velocity field $v_t$ obtained by solving a Poisson equation in an RKHS, yielding a kernelized, gradient-free interacting particle system (KFRFlow) with discrete-time (KFRFlow-I) and stochastic (KFRD) variants that transport samples toward $\pi_1$. The approach unifies a discrete-time sample-driven optimal transport view with a continuous Fisher--Rao gradient flow perspective, and empirical results demonstrate strong performance against gradient-free baselines and competitiveness with gradient-based methods, especially in higher dimensions. This framework provides a scalable, gradient-free tool for challenging sampling tasks under unnormalized densities, with potential extensions to kernel acceleration, dimension reduction, and adaptive time stepping.
Abstract
We introduce a new mean-field ODE and corresponding interacting particle systems (IPS) for sampling from an unnormalized target density. The IPS are gradient-free, available in closed form, and only require the ability to sample from a reference density and compute the (unnormalized) target-to-reference density ratio. The mean-field ODE is obtained by solving a Poisson equation for a velocity field that transports samples along the geometric mixture of the two densities, which is the path of a particular Fisher-Rao gradient flow. We employ a RKHS ansatz for the velocity field, which makes the Poisson equation tractable and enables discretization of the resulting mean-field ODE over finite samples. The mean-field ODE can be additionally be derived from a discrete-time perspective as the limit of successive linearizations of the Monge-Ampère equations within a framework known as sample-driven optimal transport. We introduce a stochastic variant of our approach and demonstrate empirically that our IPS can produce high-quality samples from varied target distributions, outperforming comparable gradient-free particle systems and competitive with gradient-based alternatives.
