Diffusion Tempering Improves Parameter Estimation with Probabilistic Integrators for Ordinary Differential Equations
Jonas Beck, Nathanael Bosch, Michael Deistler, Kyra L. Kadhim, Jakob H. Macke, Philipp Hennig, Philipp Berens
TL;DR
The paper tackles the problem of gradient-based parameter estimation in nonlinear ODEs, where local minima and sensitivity to initialization hinder reliable optimization. It introduces diffusion tempering, a schedule-based regularization for probabilistic ODE solvers that starts with a high diffusion $\kappa$ to smooth the loss surface and progressively lowers $\kappa$ to emphasize the true IVP solution, thereby guiding optimization toward the global optimum. Empirical results on a simple pendulum and the Hodgkin–Huxley model show that diffusion tempering substantially improves convergence and parameter recovery over traditional RK-based methods and prior Fenrir approaches, including high-parameter HH settings. The approach offers a principled pathway to gradient-based, data-efficient parameter inference in complex dynamical systems, with practical impact for scientific modeling and systems biology.
Abstract
Ordinary differential equations (ODEs) are widely used to describe dynamical systems in science, but identifying parameters that explain experimental measurements is challenging. In particular, although ODEs are differentiable and would allow for gradient-based parameter optimization, the nonlinear dynamics of ODEs often lead to many local minima and extreme sensitivity to initial conditions. We therefore propose diffusion tempering, a novel regularization technique for probabilistic numerical methods which improves convergence of gradient-based parameter optimization in ODEs. By iteratively reducing a noise parameter of the probabilistic integrator, the proposed method converges more reliably to the true parameters. We demonstrate that our method is effective for dynamical systems of different complexity and show that it obtains reliable parameter estimates for a Hodgkin-Huxley model with a practically relevant number of parameters.
