Using AI to Measure Parkinson's Disease Severity at Home
Md Saiful Islam, Wasifur Rahman, Abdelrahman Abdelkader, Phillip T. Yang, Sangwu Lee, Jamie L. Adams, Ruth B. Schneider, E. Ray Dorsey, Ehsan Hoque
TL;DR
This work tackles remote, home-based assessment of Parkinson's disease severity by enabling individuals to perform a finger-tapping task in front of a webcam and have their motor impairment scored automatically. It combines MediaPipe hand tracking with a curated set of interpretable digital biomarkers, selecting 22 informative features out of 53 and training a LightGBM regressor under leave-one-patient-out cross-validation; the model achieves a mean absolute error of $0.58$ and a Pearson correlation of $0.66$ with ground-truth scores, approaching but not fully matching expert ratings. The study demonstrates strong inter-expert reliability (ICC $=0.88$, Krippendorff's $ ext{alpha}=0.69$) and shows that SHAP explanations align with clinically meaningful signals, suggesting potential for wide accessibility in resource-limited settings. Limitations include a relatively small, imbalanced dataset with few severe cases and tremor confounds; nonetheless, the approach offers a scalable, interpretable pathway to extend digital biomarkers and remote monitoring to other movement disorders and tasks.
Abstract
We present an artificial intelligence system to remotely assess the motor performance of individuals with Parkinson's disease (PD). Participants performed a motor task (i.e., tapping fingers) in front of a webcam, and data from 250 global participants were rated by three expert neurologists following the Movement Disorder Society Unified Parkinson's Disease Rating Scale (MDS-UPDRS). The neurologists' ratings were highly reliable, with an intra-class correlation coefficient (ICC) of 0.88. We developed computer algorithms to obtain objective measurements that align with the MDS-UPDRS guideline and are strongly correlated with the neurologists' ratings. Our machine learning model trained on these measures outperformed an MDS-UPDRS certified rater, with a mean absolute error (MAE) of 0.59 compared to the rater's MAE of 0.79. However, the model performed slightly worse than the expert neurologists (0.53 MAE). The methodology can be replicated for similar motor tasks, providing the possibility of evaluating individuals with PD and other movement disorders remotely, objectively, and in areas with limited access to neurological care.
