Entropic Matching for Expectation Propagation of Markov Jump Processes
Yannick Eich, Bastian Alt, Heinz Koeppl
TL;DR
The paper tackles latent-state inference for continuous-time Markov jump processes by marrying entropic matching with expectation propagation, enabling tractable, closed-form updates for a practical variational family. By applying this framework to chemical reaction networks with mass-action kinetics and a linear-Gaussian observation model, the authors derive FFBS-like dynamics for filtering and smoothing and integrate EP site updates to tighten posterior approximations, along with an EM-style scheme for parameter learning. Empirical results on Lotka–Volterra, motility, gene transcription, and enzyme-kinetics CRNs show improved accuracy of posterior means over several baselines and demonstrate scalability relative to particle-based methods. The approach provides a principled, fast alternative for complex continuous-time Bayesian inference in high-dimensional CRN models, while acknowledging limitations of the product-Poisson approximation and suggesting avenues for more expressive variational families and broader MJPs in future work.
Abstract
We propose a novel, tractable latent state inference scheme for Markov jump processes, for which exact inference is often intractable. Our approach is based on an entropic matching framework that can be embedded into the well-known expectation propagation algorithm. We demonstrate the effectiveness of our method by providing closed-form results for a simple family of approximate distributions and apply it to the general class of chemical reaction networks, which are a crucial tool for modeling in systems biology. Moreover, we derive closed-form expressions for point estimation of the underlying parameters using an approximate expectation maximization procedure. We evaluate our method across various chemical reaction networks and compare it to multiple baseline approaches, demonstrating superior performance in approximating the mean of the posterior process. Finally, we discuss the limitations of our method and potential avenues for future improvement, highlighting its promising direction for addressing complex continuous-time Bayesian inference problems.
