Going beyond explainability in multi-modal stroke outcome prediction models
Jonas Brändli, Maurice Schneeberger, Lisa Herzog, Loran Avci, Nordin Dari, Martin Häansel, Hakim Baazaoui, Pascal Bühler, Susanne Wegener, Beate Sick
TL;DR
This study addresses trustworthy, interpretable stroke outcome prediction by integrating imaging and tabular patient data via deep Transformation Models (dTMs). By adapting Grad-CAM and Occlusion to multi-modal dTMs, the authors produce explanation maps that highlight relevant brain regions and enable error analysis, while retaining interpretable parameters for tabular features as log-odds-like effects. The results show that tabular-plus-imaging dTMs achieve near 0.8 AUC, with interpretable beta coefficients confirming known risk factors such as pre-stroke functional dependence and NIHSS on admission. Overall, the approach balances high predictive performance with explanation capabilities, supporting clinical trust and enabling hypothesis generation about image-based predictors of stroke outcome.
Abstract
Aim: This study aims to enhance interpretability and explainability of multi-modal prediction models integrating imaging and tabular patient data. Methods: We adapt the xAI methods Grad-CAM and Occlusion to multi-modal, partly interpretable deep transformation models (dTMs). DTMs combine statistical and deep learning approaches to simultaneously achieve state-of-the-art prediction performance and interpretable parameter estimates, such as odds ratios for tabular features. Based on brain imaging and tabular data from 407 stroke patients, we trained dTMs to predict functional outcome three months after stroke. We evaluated the models using different discriminatory metrics. The adapted xAI methods were used to generated explanation maps for identification of relevant image features and error analysis. Results: The dTMs achieve state-of-the-art prediction performance, with area under the curve (AUC) values close to 0.8. The most important tabular predictors of functional outcome are functional independence before stroke and NIHSS on admission, a neurological score indicating stroke severity. Explanation maps calculated from brain imaging dTMs for functional outcome highlighted critical brain regions such as the frontal lobe, which is known to be linked to age which in turn increases the risk for unfavorable outcomes. Similarity plots of the explanation maps revealed distinct patterns which give insight into stroke pathophysiology, support developing novel predictors of stroke outcome and enable to identify false predictions. Conclusion: By adapting methods for explanation maps to dTMs, we enhanced the explainability of multi-modal and partly interpretable prediction models. The resulting explanation maps facilitate error analysis and support hypothesis generation regarding the significance of specific image regions in outcome prediction.
