Optimal control of stochastic reaction networks with entropic control cost and emergence of mode-switching strategies
Shuhei A. Horiguchi, Tetsuya J. Kobayashi
TL;DR
This work develops a unified framework for the optimal control of stochastic reaction networks with discrete, nonnegative counts and absorbing states, using $f$-divergence–based costs and, in particular, KL divergence to linearize the HJB via the Cole–Hopf transformation. It derives a time-dependent, state-feedback controller $k^{\dagger}_r(t,n,\beta)=k^0_r\exp(\overline{\nabla}_{s_r}V_t(n,\beta))$ and a linear backward equation for $Z_t(n,\beta)=\exp(V_t(n,\beta))$, enabling efficient computation and probabilistic interpretation through the Feynman–Kac representation. The framework is demonstrated on interacting random walkers, Moran processes, and SIR epidemic models, exposing mode-switching behavior in optimal controls and showing that a finite per-time control cost can prevent extinction in Moran-type dynamics. By connecting control with path-measure optimization, the method offers scalable, extinction-aware strategies with broad applicability to RN-based biology and epidemiology, while outlining directions for risk-sensitive extensions and partially observed settings.
Abstract
Controlling the stochastic dynamics of biological populations is a challenge that arises across various biological contexts. However, these dynamics are inherently nonlinear and involve a discrete state space, i.e., the number of molecules, cells, or organisms. Additionally, the possibility of extinction has a significant impact on both dynamics and control strategies, particularly when the population size is small. These factors hamper the direct application of conventional control theories to biological systems. To address these challenges, we formulate the optimal control problem for stochastic population dynamics by utilizing control cost functions based on the f-divergence, which naturally accounts for population-specific factors. If Kullback-Leibler (KL) divergence is adopted for the cost function, the complex nonlinear Hamilton-Jacobi-Bellman equation is simplified into a linear form, facilitating efficient computation of optimal solutions. We demonstrate the effectiveness of our approach by applying it to the control of interacting random walkers, Moran processes, and SIR models, and observe the mode-switching phenomena in the control strategies. Our approach provides new opportunities for applying control theory to a wide range of biological problems.
