On the challenges of detecting MCI using EEG in the wild
Aayush Mishra, David Joffe, Sankara Surendra Telidevara, David S Oakley, Anqi Liu
TL;DR
The paper critically assesses the practicality of EEG-based MCI detection in real-world clinical settings by comparing a large, expert-labeled CAUEEG dataset with a smaller, general-practice GENEEG dataset. It shows that models trained on small data exhibit high variance and overconfidence, and that cross-domain shifts substantially degrade performance, revealing fundamental limits due to overlap between MCI and control feature distributions. The study finds CNNs on time-domain data to be the most effective among tested architectures, but performances remain modest in the presence of domain shift and data scarcity, underscoring the need for high-quality data collection and uncertainty-aware methods. Overall, the work argues for realistic goals, robust cross-domain validation, and uncertainty-aware modeling to advance non-invasive EEG-based MCI detection toward practical clinical use.
Abstract
Recent studies have shown promising results in the detection of Mild Cognitive Impairment (MCI) using easily accessible Electroencephalogram (EEG) data which would help administer early and effective treatment for dementia patients. However, the reliability and practicality of such systems remains unclear. In this work, we investigate the potential limitations and challenges in developing a robust MCI detection method using two contrasting datasets: 1) CAUEEG, collected and annotated by expert neurologists in controlled settings and 2) GENEEG, a new dataset collected and annotated in general practice clinics, a setting where routine MCI diagnoses are typically made. We find that training on small datasets, as is done by most previous works, tends to produce high variance models that make overconfident predictions, and are unreliable in practice. Additionally, distribution shifts between datasets make cross-domain generalization challenging. Finally, we show that MCI detection using EEG may suffer from fundamental limitations because of the overlapping nature of feature distributions with control groups. We call for more effort in high-quality data collection in actionable settings (like general practice clinics) to make progress towards this salient goal of non-invasive MCI detection.
