Federated Learning for Predicting Mild Cognitive Impairment to Dementia Conversion
Gaurang Sharma, Elaheh Moradi, Juha Pajula, Mika Hilvo, Jussi Tohka
TL;DR
This study tackles the privacy challenge of predicting progression from Mild Cognitive Impairment to dementia by applying Federated Learning to cognitive and socio-demographic data from the ADNI dataset. It compares centralized ML, site-specific training, and two FL architectures—client-server with FedAvg and P2P with FedGD—under horizontal FL with 10-fold cross-validation across 3, 4, and 5 year follow-ups. The results show FL achieves performance comparable to centralized ML and superior generalization to site-specific models, with FedGD offering benefits in collaborative normalization and FedAvg delivering computational efficiency. The work demonstrates that privacy-preserving FL can enable scalable, non-invasive dementia prognosis across multiple sites without sharing sensitive data.
Abstract
Dementia is a progressive condition that impairs an individual's cognitive health and daily functioning, with mild cognitive impairment (MCI) often serving as its precursor. The prediction of MCI to dementia conversion has been well studied, but previous studies have almost always focused on traditional Machine Learning (ML) based methods that require sharing sensitive clinical information to train predictive models. This study proposes a privacy-enhancing solution using Federated Learning (FL) to train predictive models for MCI to dementia conversion without sharing sensitive data, leveraging socio demographic and cognitive measures. We simulated and compared two network architectures, Peer to Peer (P2P) and client-server, to enable collaborative learning. Our results demonstrated that FL had comparable predictive performance to centralized ML, and each clinical site showed similar performance without sharing local data. Moreover, the predictive performance of FL models was superior to site specific models trained without collaboration. This work highlights that FL can eliminate the need for data sharing without compromising model efficacy.
