Statistical algorithms for low-frequency diffusion data: A PDE approach
Matteo Giordano, Sven Wang
TL;DR
The paper introduces a PDE-based framework for nonparametric inference of multi-dimensional diffusion coefficients from low-frequency data by treating transition densities as heat-kernel solutions to the Fokker-Planck equation. A central advance is the closed-form gradient of the likelihood, derived via a variation-of-constants approach and expressed in a spectral basis of the elliptic generator, which reduces computation to finite-element-based elliptic eigenvalue problems. This enables efficient likelihood evaluation and gradient calculations, supporting both gradient-based (ULA, MAP) and gradient-free (pCN) posterior inference under Gaussian process priors, with extensive simulations demonstrating accurate conductivity recovery. The work also outlines extensions to more general reversible models with potential energy, discusses gradient-stability-based complexity considerations, and connects to small-time asymptotics and diffusion-generative modeling, highlighting broad methodological and practical implications. Overall, the PDE perspective provides scalable, principled tools for nonparametric diffusion inference in low-frequency regimes where traditional likelihood methods struggle.
Abstract
We consider the problem of making nonparametric inference in a class of multi-dimensional diffusions in divergence form, from low-frequency data. Statistical analysis in this setting is notoriously challenging due to the intractability of the likelihood and its gradient, and computational methods have thus far largely resorted to expensive simulation-based techniques. In this article, we propose a new computational approach which is motivated by PDE theory and is built around the characterisation of the transition densities as solutions of the associated heat (Fokker-Planck) equation. Employing optimal regularity results from the theory of parabolic PDEs, we prove a novel characterisation for the gradient of the likelihood. Using these developments, for the nonlinear inverse problem of recovering the diffusivity, we then show that the numerical evaluation of the likelihood and its gradient can be reduced to standard elliptic eigenvalue problems, solvable by powerful finite element methods. This enables the efficient implementation of a large class of popular statistical algorithms, including (i) preconditioned Crank-Nicolson and Langevin-type methods for posterior sampling, and (ii) gradient-based descent optimisation schemes to compute maximum likelihood and maximum-a-posteriori estimates. We showcase the effectiveness of these methods via extensive simulation studies in a nonparametric Bayesian model with Gaussian process priors, in which both the proposed optimisation and sampling schemes provide good numerical recovery. The reproducible code is available online at https://github.com/MattGiord/LF-Diffusion.
