"It depends": Configuring AI to Improve Clinical Usefulness Across Contexts
Hubert D. Zając, Jorge M. N. Ribeiro, Silvia Ingala, Simona Gentile, Ruth Wanjohi, Samuel N. Gitau, Jonathan F. Carlsen, Michael B. Nielsen, Tariq O. Andersen
TL;DR
The paper tackles the gap between lab-level AI performance and real-world clinical usefulness in radiology. It employs a design-through-research approach across Denmark and Kenya, iterating three AI prototype versions with 13 radiologists to identify four configurable technical dimensions (AI functionality, AI medical focus, AI decision threshold, AI Explainability) and ten sociotechnical dependencies (social dimensions) that condition usefulness. The main contributions are four concrete design recommendations and a dependency-informed framework for configuring AI before-use and in-use to align with clinic type, expertise level, patient context, and workload. The findings have practical significance for deploying AI in diverse clinical settings, suggesting that configurable AI can improve adoption, trust, and potentially patient outcomes by tailoring AI support to local practice. The work also highlights limitations and points to future exploration of richer clinical data integration and the potential role of large language models in radiology support.
Abstract
Artificial Intelligence (AI) repeatedly match or outperform radiologists in lab experiments. However, real-world implementations of radiological AI-based systems are found to provide little to no clinical value. This paper explores how to design AI for clinical usefulness in different contexts. We conducted 19 design sessions and design interventions with 13 radiologists from 7 clinical sites in Denmark and Kenya, based on three iterations of a functional AI-based prototype. Ten sociotechnical dependencies were identified as crucial for the design of AI in radiology. We conceptualised four technical dimensions that must be configured to the intended clinical context of use: AI functionality, AI medical focus, AI decision threshold, and AI Explainability. We present four design recommendations on how to address dependencies pertaining to the medical knowledge, clinic type, user expertise level, patient context, and user situation that condition the configuration of these technical dimensions.
