Developing hybrid mechanistic and data-driven personalized prediction models for platelet dynamics
Marie Steinacker, Yuri Kheifetz, Markus Scholz
TL;DR
Chemotherapy-induced thrombocytopenia presents strong inter-patient variability, challenging personalized risk prediction. The authors compare hybrid mechanistic-UDE frameworks based on the Friberg hematopoiesis model with purely data-driven ARX-GRU/NARX approaches across diverse data regimes, using a NHL-B patient dataset. They find that data-driven ARX-GRU predictions excel when data are plentiful, especially for high-toxicity trajectories, while hybrid and mechanistic methods provide robust performance under data sparsity and help preserve physiological plausibility. The work offers a generalizable framework for individualized prediction of treatment-related toxicities and provides public code to extend the approach to other hematopoietic toxicities and interventions.
Abstract
Hematotoxicity, drug-induced damage to the blood-forming system, is a frequent side effect of cytotoxic chemotherapy and poses a significant challenge in clinical practice due to its high inter-patient variability and limited predictability. Current mechanistic models often struggle to accurately forecast outcomes for patients with irregular or atypical trajectories. In this study, we develop and compare hybrid mechanistic and data-driven approaches for individualized time series modeling of platelet counts during chemotherapy. We consider hybrid models that combine mechanistic models with neural networks, known as universal differential equations. As a purely data-driven alternative, we utilize a nonlinear autoregressive exogenous model using gated recurrent units as the underlying architecture. These models are evaluated across a range of real patient scenarios, varying in data availability and sparsity, to assess predictive performance. Our findings demonstrate that data-driven methods, when provided with sufficient data, significantly improve prediction accuracy, particularly for high-risk patients with irregular platelet dynamics. This highlights the potential of data-driven approaches in enhancing clinical decision-making. In contrast, hybrid and mechanistic models are superior in scenarios with limited or sparse data. The proposed modeling and comparison framework is generalizable and could be extended to predict other treatment-related toxicities, offering broad applicability in personalized medicine.
