Two-step interpretable modeling of Intensive Care Acquired Infections
Giacomo Lancia, Meri Varkila, Olaf Cremer, Cristian Spitoni
TL;DR
The paper tackles dynamic prediction of ICU-acquired infections with interpretability by integrating minute-level EHR signals into a landmarking competing risks framework (LM-CR). It introduces a two-step approach: Step 1 uses a convolutional neural network (CNN) to extract a high-frequency risk score $Z_{CNN}$ from 1-minute vital signs, and Step 2 embeds this score within a Deep LM-CR model to produce time-dependent CIFs $F_{j,LM}(t_{hor}|\cdot)$. Explainability is provided via Saliency Map Order Equivalent (SMOE) scale-based saliency maps to link CNN patterns to clinical conditions, with data-driven clustering identifying patterns corresponding to tachycardia, hypotension, desaturation, and hyperventilation. Compared with a full ANN model, the Deep LM-CR offers similar accuracy with greater stability and interpretable covariate effects, achieving a global AUROC around 0.75 and a slightly improved Brier score, thereby enabling earlier and more reliable infection warnings in ICU settings.
Abstract
We present a novel methodology for integrating high resolution longitudinal data with the dynamic prediction capabilities of survival models. The aim is two-fold: to improve the predictive power while maintaining interpretability of the models. To go beyond the black box paradigm of artificial neural networks, we propose a parsimonious and robust semi-parametric approach (i.e., a landmarking competing risks model) that combines routinely collected low-resolution data with predictive features extracted from a convolutional neural network, that was trained on high resolution time-dependent information. We then use saliency maps to analyze and explain the extra predictive power of this model. To illustrate our methodology, we focus on healthcare-associated infections in patients admitted to an intensive care unit.
