Improving Model's Interpretability and Reliability using Biomarkers
Gautam Rajendrakumar Gare, Tom Fox, Beam Chansangavej, Amita Krishnan, Ricardo Luis Rodriguez, Bennett P deBoisblanc, Deva Kannan Ramanan, John Michael Galeotti
TL;DR
This work addresses the need for interpretable AI in safety-critical medicine by proposing a biomarker-based lung ultrasound diagnostic pipeline that enforces an interpretable biomarker bottleneck before downstream decision rules. A decision-tree classifier on clinically established biomarkers is compared with Grad-CAM saliency maps to explain predictions, with a user study involving three LUS clinicians. Results indicate that decision-tree explanations help detect false positives, while saliency maps more readily assist with true positives; combining both explanations yields the most consistent clinician judgments. The findings support deploying biomarker-driven interpretability tools to enhance the reliability and safety of lung ultrasound AI in clinical practice.
Abstract
Accurate and interpretable diagnostic models are crucial in the safety-critical field of medicine. We investigate the interpretability of our proposed biomarker-based lung ultrasound diagnostic pipeline to enhance clinicians' diagnostic capabilities. The objective of this study is to assess whether explanations from a decision tree classifier, utilizing biomarkers, can improve users' ability to identify inaccurate model predictions compared to conventional saliency maps. Our findings demonstrate that decision tree explanations, based on clinically established biomarkers, can assist clinicians in detecting false positives, thus improving the reliability of diagnostic models in medicine.
