Unifying Interpretability and Explainability for Alzheimer's Disease Progression Prediction
Raja Farrukh Ali, Stephanie Milani, John Woods, Emmanuel Adenij, Ayesha Farooq, Clayton Mansel, Jeffrey Burns, William Hsu
TL;DR
This work addresses predicting Alzheimer's disease progression by unifying interpretability and explainability through an interpretable, mechanistic RL framework (IXRL) that models cognition trajectories over a decade using baseline data. By embedding domain knowledge via differential equations and a two-node brain network (HC and PFC), and applying SHAP for post-hoc explanations, the study evaluates four RL algorithms (TRPO, PPO, DDPG, SAC) and demonstrates that TRPO most accurately captures long-term cognition decline, while explanations reveal limited reliance on amyloid among the decisions. The findings highlight both the promise and limits of RL in healthcare: predictive accuracy can be achieved with domain-guided models, yet pathophysiological biomarkers like amyloid may be underemphasized in decisions, underscoring the need for richer biomarkers and more comprehensive brain models. The IXRL framework advances actionable, transparent AD progression modeling, offering clinicians both global and patient-level insights while pointing to practical improvements for future predictive and explanatory AI in healthcare.
Abstract
Reinforcement learning (RL) has recently shown promise in predicting Alzheimer's disease (AD) progression due to its unique ability to model domain knowledge. However, it is not clear which RL algorithms are well-suited for this task. Furthermore, these methods are not inherently explainable, limiting their applicability in real-world clinical scenarios. Our work addresses these two important questions. Using a causal, interpretable model of AD, we first compare the performance of four contemporary RL algorithms in predicting brain cognition over 10 years using only baseline (year 0) data. We then apply SHAP (SHapley Additive exPlanations) to explain the decisions made by each algorithm in the model. Our approach combines interpretability with explainability to provide insights into the key factors influencing AD progression, offering both global and individual, patient-level analysis. Our findings show that only one of the RL methods is able to satisfactorily model disease progression, but the post-hoc explanations indicate that all methods fail to properly capture the importance of amyloid accumulation, one of the pathological hallmarks of Alzheimer's disease. Our work aims to merge predictive accuracy with transparency, assisting clinicians and researchers in enhancing disease progression modeling for informed healthcare decisions. Code is available at https://github.com/rfali/xrlad.
