Expert-Guided POMDP Learning for Data-Efficient Modeling in Healthcare
Marco Locatelli, Arjen Hommersom, Roberto Clemens Cerioli, Daniela Besozzi, Fabio Stella
TL;DR
The paper tackles learning POMDP parameters from limited healthcare data by introducing Fuzzy-MAP EM, which injects expert knowledge through fuzzy pseudo-counts into the M-step to perform MAP estimation. The method leverages Takagi-Sugeno fuzzy models to encode domain rules, producing $N_T^{\text{fuzzy}}$ and $N_O^{\text{fuzzy}}$ that regularize parameter updates via hyperparameters $\lambda_T,\lambda_O$; the augmented counts define $\tilde{N}_T$, $\tilde{N}_O$, and related sufficient statistics for updates. Empirical results on synthetic data show improved transition accuracy and observation fit under low-data and high-noise regimes, outperforming standard EM. A Myasthenia Gravis case study demonstrates the practical utility by recovering a clinically coherent two-state POMDP and capturing the drug effect of Ravulizumab, highlighting potential for data-efficient decision support in rare diseases. Overall, the work offers a novel integration of expert fuzzy reasoning with probabilistic learning to enhance robustness and applicability in healthcare contexts where data are scarce and noisy.
Abstract
Learning the parameters of Partially Observable Markov Decision Processes (POMDPs) from limited data is a significant challenge. We introduce the Fuzzy MAP EM algorithm, a novel approach that incorporates expert knowledge into the parameter estimation process by enriching the Expectation Maximization (EM) framework with fuzzy pseudo-counts derived from an expert-defined fuzzy model. This integration naturally reformulates the problem as a Maximum A Posteriori (MAP) estimation, effectively guiding learning in environments with limited data. In synthetic medical simulations, our method consistently outperforms the standard EM algorithm under both low-data and high-noise conditions. Furthermore, a case study on Myasthenia Gravis illustrates the ability of the Fuzzy MAP EM algorithm to recover a clinically coherent POMDP, demonstrating its potential as a practical tool for data-efficient modeling in healthcare.
