On the numerical integration of the Fokker-Planck equation driven by a mechanical force and the Bismut-Elworthy-Li formula
Julia Sanders, Paolo Muratore-Ginanneschi
TL;DR
This work develops and validates numerical methods for two key PDEs in stochastic optimal control: the Fokker-Planck equation with a time-dependent mechanical potential and the Hamilton-Jacobi-Bellman equation for the value function. It combines a Girsanov-based Monte Carlo approach to FP with backward diffusion and a Bismut-Elworthy-Li gradient representation to compute optimal protocols, including extensions to degenerate diffusion. The paper demonstrates analytic and numerical examples, including Schrödinger bridge problems and a machine-learning-inspired prototype that solves for optimal drifts without spatial discretization, and confirms consistency with existing iterative methods. The methods offer scalable, high-dimensional alternatives to traditional spatial discretization, with demonstrated speedups and applicability to nanoscale thermodynamics and diffusion-model-based learning tasks.
Abstract
Optimal control theory aims to find an optimal protocol to steer a system between assigned boundary conditions while minimizing a given cost functional in finite time. Equations arising from these types of problems are often non-linear and difficult to solve numerically. In this note, we describe numerical methods of integration for two partial differential equations that commonly arise in optimal control theory: the Fokker-Planck equation driven by a mechanical potential for which we use Girsanov theorem; and the Hamilton-Jacobi-Bellman, or dynamic programming, equation for which we find the gradient of its solution using the Bismut-Elworthy-Li formula. The computation of the gradient is necessary to specify the optimal protocol. Finally, we give an example application of the numerical techniques to solving an optimal control problem without spacial discretization using machine learning.
