Improving Clinical Imaging Systems using Cognition based Approaches
Kailas Dayanandan, Brejesh Lall
TL;DR
The paper tackles the problem of safely integrating AI into clinical imaging by proposing a cognition-based approach that mirrors radiologists’ analytical workflows. It presents a think-along system that replicates the ABCDE regional analysis, leveraging deep learning to reveal context around affected areas through an effective receptive field, thereby supporting System 2 deliberation. Through qualitative clinician interviews and extrinsic datasets (VQA-RAD, MIMIC-CXR, VinDr-CXR), it identifies hard-to-diagnose diseases and demonstrates how context-aware machine diagnoses can reduce inattentional blindness and supervision burden. The findings suggest design guidelines for creating complementary AI that enhances diagnostic accuracy and efficiency in real-world settings, with practical implications for rural and resource-limited environments. Overall, the work advances human–AI collaboration in medical imaging by coupling cognitive insights with context-rich visual explanations to support clinician supervision rather than replace it.
Abstract
Clinical systems operate in safety-critical environments and are not intended to function autonomously; however, they are currently designed to replicate clinicians' diagnoses rather than assist them in the diagnostic process. To enable better supervision of system-generated diagnoses, we replicate radiologists' systematic approach used to analyze chest X-rays. This approach facilitates comprehensive analysis across all regions of clinical images and can reduce errors caused by inattentional blindness and under reading. Our work addresses a critical research gap by identifying difficult-to-diagnose diseases for clinicians using insights from human vision, enabling these systems to serve as an effective "second pair of eyes". These improvements make the clinical imaging systems more complementary and combine the strengths of human and machine vision. Additionally, we leverage effective receptive fields in deep learning models to present machine-generated diagnoses with sufficient context, making it easier for clinicians to evaluate them.
