Measure estimation on a manifold explored by a diffusion process
Vincent Divol, Hélène Guérin, Dinh-Toan Nguyen, Viet Chi Tran
TL;DR
This work addresses estimating the invariant measure of a diffusion on a compact manifold from a single diffusion path. It introduces a kernel-smoothed occupation measure estimator $\widehat{\mu}_{T,h}$ and proves nonasymptotic upper bounds in the Wasserstein-2 metric that balance a variance term (scaling like $h^{4-d}/T$ for $d\ge5$) and a bias term tied to the density's Sobolev regularity $\ell$. When the stationary density $p$ lies in Sobolev space $H^\ell(\mathcal{M})$ with $\ell\ge2$, choosing $h\asymp T^{-1/(2\ell+d-2)}$ yields an overall rate of $\mathcal{O}\big(T^{-(2\ell+2)/(2\ell+d-2)}\big)$, which is minimax optimal in this regime. A minimax lower bound shows these rates are optimal for $d\ge5$ (with a slower rate for $d\le4$ where the empirical occupation measure can be competitive). The analysis hinges on elliptic operator theory on manifolds (Laplace–Beltrami, Green function), ultracontractivity, and a careful variance-bias decomposition, with results that extend beyond Langevin diffusions to a broad class of uniformly elliptic generators $\mathcal{A}$. This provides a principled, nonparametric route to density estimation on manifolds from diffusion data, with explicit finite-sample guarantees.
Abstract
From the observation of a diffusion path $(X_t)_{t\in [0,T]}$ on a compact connected $d$-dimensional manifold $\mathcal{M}$ without boundary, we consider the problem of estimating the stationary measure $μ$ of the process. Wang and Zhu (2023) showed that for the Wasserstein metric $\mathcal{W}_2$ and for $d\geq 5$, the convergence rate of $T^{-1/(d-2)}$ is attained by the occupation measure of the path $(X_t)_{t\in [0,T]}$ when $(X_t)_{t\in [0,T]}$ is a Langevin diffusion. We extend their result in several directions. First, we show that the rate of convergence holds for a large class of diffusion paths, whose generators are uniformly elliptic. Second, the regularity of the density $p$ of the stationary measure $μ$ with respect to the volume measure of $\mathcal{M}$ can be leveraged to obtain faster estimators: when $p$ belongs to a Sobolev space of order $\ell\geq 2$, smoothing the occupation measure by convolution with a kernel yields an estimator whose rate of convergence is of order $T^{-(\ell+1)/(2\ell+d-2)}$. We further show that this rate is the minimax rate of estimation for this problem.
