Rethinking Human-AI Collaboration in Complex Medical Decision Making: A Case Study in Sepsis Diagnosis
Shao Zhang, Jianing Yu, Xuhai Xu, Changchang Yin, Yuxuan Lu, Bingsheng Yao, Melanie Tory, Lace M. Padilla, Jeffrey Caterino, Ping Zhang, Dakuo Wang
TL;DR
This work examines the gap between high predictive performance of AI models and their real-world deployment in sepsis diagnosis. It introduces SepsisLab, a human-centered AI system that not only predicts current and near-term sepsis risk but also visualizes uncertainty and recommends actionable lab tests to reduce ambiguity, thereby supporting intermediate decision-making stages such as hypothesis generation and data gathering. Grounded in a formative study with clinicians who critique the existing Epic Sepsis Module, SepsisLab reframes AI as a collaborator rather than a competitor and demonstrates improved perceived collaboration, transparency, and utility. The findings suggest that shifting AI focus to intermediate decision-support tasks can generalize to other high-stakes, time-sensitive domains, offering practical guidance for deploying trustworthy AI-CDSS in complex clinical workflows.
Abstract
Today's AI systems for medical decision support often succeed on benchmark datasets in research papers but fail in real-world deployment. This work focuses on the decision making of sepsis, an acute life-threatening systematic infection that requires an early diagnosis with high uncertainty from the clinician. Our aim is to explore the design requirements for AI systems that can support clinical experts in making better decisions for the early diagnosis of sepsis. The study begins with a formative study investigating why clinical experts abandon an existing AI-powered Sepsis predictive module in their electrical health record (EHR) system. We argue that a human-centered AI system needs to support human experts in the intermediate stages of a medical decision-making process (e.g., generating hypotheses or gathering data), instead of focusing only on the final decision. Therefore, we build SepsisLab based on a state-of-the-art AI algorithm and extend it to predict the future projection of sepsis development, visualize the prediction uncertainty, and propose actionable suggestions (i.e., which additional laboratory tests can be collected) to reduce such uncertainty. Through heuristic evaluation with six clinicians using our prototype system, we demonstrate that SepsisLab enables a promising human-AI collaboration paradigm for the future of AI-assisted sepsis diagnosis and other high-stakes medical decision making.
