Designing and Evaluating a Conversational Agent for Early Diagnosis of Alzheimer's Disease and Related Dementias
Andrew G. Breithaupt, Nayoung Choi, James D. Finch, Jeanne M. Powell, Arin L. Nelson, Oz A. Alon, Howard J. Rosen, Jinho D. Choi
TL;DR
The paper tackles the problem of enabling timely ADRD diagnosis by collecting rich patient narratives via a voice-based conversational agent powered by large language models. It develops a socio-technical interview system tailored for older adults and multi-party sessions, and evaluates it in a within-subject study (n=30) against clinician interviews. Results show promising agreement in symptom elicitation (83.5% sensitivity, 100% specificity) and high user acceptance, with sequential prompting improving depth but risking topic coverage. The work demonstrates feasibility and provides design considerations for integrating agent-led history taking into clinical workflows, while acknowledging limitations and outlining directions for scaling, separation of voices, and validation of clinical utility.
Abstract
Early diagnosis of Alzheimer's disease and related dementias (ADRD) is critical for timely intervention, yet most diagnoses are delayed until advanced stages. While comprehensive patient narratives are essential for accurate diagnosis, prior work has largely focused on screening studies that classify cognitive status from interactions rather than supporting the diagnostic process. We designed voice-interactive conversational agents, leveraging large language models (LLMs), to elicit narratives relevant to ADRD from patients and informants. We evaluated the agent with 30 adults with suspected ADRD through conversation analysis, user surveys, and analysis of symptom elicitation compared to blinded specialist interviews. Symptoms detected by the agent showed promising agreement with those identified by specialists. Users appreciated the agent's patience and systematic questioning, which supported engagement and expression of complex, hard-to-describe experiences. While these findings suggest potential for conversational agents as structured diagnostic support tools, further validation with larger samples and assessment of clinical utility is needed before deployment.
