A Review of Deep Learning Approaches for Non-Invasive Cognitive Impairment Detection
Muath Alsuhaibani, Ali Pourramezan Fard, Jian Sun, Farida Far Poor, Peter S. Pressman, Mohammad H. Mahoor
TL;DR
This paper surveys deep learning approaches for non-invasive cognitive impairment detection, emphasizing speech as the most developed modality while showcasing visual and movement-based signals and their multimodal integration. It catalogs public and private datasets, discusses feature extraction and modeling techniques across acoustic, linguistic, visual, and movement data, and evaluates performance trends by modality. The review highlights that combining acoustic and linguistic speech features yields robust detection, identifies recurring challenges in data standardization, model explainability, and clinical adoption, and proposes directions such as language-agnostic analysis, multi-modal diagnostics, and privacy-preserving learning. Overall, the work maps current progress, clarifies obstacles to clinical translation, and outlines practical pathways to improve early diagnosis and patient outcomes.
Abstract
This review paper explores recent advances in deep learning approaches for non-invasive cognitive impairment detection. We examine various non-invasive indicators of cognitive decline, including speech and language, facial, and motoric mobility. The paper provides an overview of relevant datasets, feature-extracting techniques, and deep-learning architectures applied to this domain. We have analyzed the performance of different methods across modalities and observed that speech and language-based methods generally achieved the highest detection performance. Studies combining acoustic and linguistic features tended to outperform those using a single modality. Facial analysis methods showed promise for visual modalities but were less extensively studied. Most papers focused on binary classification (impaired vs. non-impaired), with fewer addressing multi-class or regression tasks. Transfer learning and pre-trained language models emerged as popular and effective techniques, especially for linguistic analysis. Despite significant progress, several challenges remain, including data standardization and accessibility, model explainability, longitudinal analysis limitations, and clinical adaptation. Lastly, we propose future research directions, such as investigating language-agnostic speech analysis methods, developing multi-modal diagnostic systems, and addressing ethical considerations in AI-assisted healthcare. By synthesizing current trends and identifying key obstacles, this review aims to guide further development of deep learning-based cognitive impairment detection systems to improve early diagnosis and ultimately patient outcomes.
