Leveraging Expert Input for Robust and Explainable AI-Assisted Lung Cancer Detection in Chest X-rays
Amy Rafferty, Rishi Ramaesh, Ajitha Rajan
TL;DR
The paper addresses the challenge of deploying AI for lung cancer detection in chest X-rays by evaluating existing XAI approaches and robustness methods. It reveals that post-hoc image-based and text-based XAI techniques often fail to provide clinically meaningful explanations and that expert input is crucial for clinical relevance. The authors introduce ClinicXAI, an expert-driven Concept Bottleneck Model that uses radiologist-curated concepts to produce interpretable explanations while preserving high diagnostic accuracy and robustness to adversarial attacks. ClinicXAI achieves superior concept reliability and radiologist-assessed clinical utility, as well as improved resistance to adversarial perturbations, demonstrating the practical value of domain-guided interpretable AI in healthcare. The work highlights a path toward more trustworthy AI systems in medical diagnostics through close collaboration with clinicians and domain-specific concept definitions.
Abstract
Deep learning models show significant potential for advancing AI-assisted medical diagnostics, particularly in detecting lung cancer through medical image modalities such as chest X-rays. However, the black-box nature of these models poses challenges to their interpretability and trustworthiness, limiting their adoption in clinical practice. This study examines both the interpretability and robustness of a high-performing lung cancer detection model based on InceptionV3, utilizing a public dataset of chest X-rays and radiological reports. We evaluate the clinical utility of multiple explainable AI (XAI) techniques, including both post-hoc and ante-hoc approaches, and find that existing methods often fail to provide clinically relevant explanations, displaying inconsistencies and divergence from expert radiologist assessments. To address these limitations, we collaborated with a radiologist to define diagnosis-specific clinical concepts and developed ClinicXAI, an expert-driven approach leveraging the concept bottleneck methodology. ClinicXAI generated clinically meaningful explanations which closely aligned with the practical requirements of clinicians while maintaining high diagnostic accuracy. We also assess the robustness of ClinicXAI in comparison to the original InceptionV3 model by subjecting both to a series of widely utilized adversarial attacks. Our analysis demonstrates that ClinicXAI exhibits significantly greater resilience to adversarial perturbations. These findings underscore the importance of incorporating domain expertise into the design of interpretable and robust AI systems for medical diagnostics, paving the way for more trustworthy and effective AI solutions in healthcare.
