How Consistent are Clinicians? Evaluating the Predictability of Sepsis Disease Progression with Dynamics Models
Unnseo Park, Venkatesh Sivaraman, Adam Perer
TL;DR
The paper investigates whether clinician actions are diverse enough to influence sepsis progression and whether action information improves offline predictive models. Using transformer-based dynamics models trained on MIMIC-IV and eICU data, they assess predictions of future disease severity with and without future action inputs. They find that incorporating action information does not materially improve model fit, suggesting limited action-diversity signals in this dataset and that action-prediction analyses show some predictability but not enough to explain outcome differences. The work highlights the need for richer, clinically-informed action representations and diverse data sources to enable reliable RL-based sepsis optimization.
Abstract
Reinforcement learning (RL) is a promising approach to generate treatment policies for sepsis patients in intensive care. While retrospective evaluation metrics show decreased mortality when these policies are followed, studies with clinicians suggest their recommendations are often spurious. We propose that these shortcomings may be due to lack of diversity in observed actions and outcomes in the training data, and we construct experiments to investigate the feasibility of predicting sepsis disease severity changes due to clinician actions. Preliminary results suggest incorporating action information does not significantly improve model performance, indicating that clinician actions may not be sufficiently variable to yield measurable effects on disease progression. We discuss the implications of these findings for optimizing sepsis treatment.
