DementiaBank-Emotion: A Multi-Rater Emotion Annotation Corpus for Alzheimer's Disease Speech (Version 1.0)
Cheonkam Jeong, Jessica Liao, Audrey Lu, Yutong Song, Christopher Rashidian, Donna Krogh, Erik Krogh, Mahkameh Rasouli, Jung-Ah Lee, Nikil Dutt, Lisa M Gibbs, David Sultzer, Julie Rousseau, Jocelyn Ludlow, Margaret Galvez, Alexander Nuth, Chet Khay, Sabine Brunswicker, Adeline Nyamathi
TL;DR
DementiaBank-Emotion (v1.0) introduces the first multi-rater emotion annotation corpus for Alzheimer's disease speech, annotating 1,492 utterances from 108 speakers for Ekman’s basic emotions plus neutral. The study finds AD speech expresses more non-neutral emotions (16.9%) than controls (5.7%), with exploratory evidence of acoustic flattening in sadness (reduced F0 modulation) and partially preserved emotion-prosody mappings within AD as indicated by higher loudness for certain emotions. A hierarchical adjudication protocol and calibration workshops support label reliability, while qualitative analyses reveal laughter as a coping mechanism and emotion-aware interpretations tied to the Cookie Theft task. The dataset, guidelines, and calibration materials enable more nuanced emotion recognition research in clinical populations and set the stage for v2.0 enhancements including longitudinal data and finer-grained phonetic analyses.
Abstract
We present DementiaBank-Emotion, the first multi-rater emotion annotation corpus for Alzheimer's disease (AD) speech. Annotating 1,492 utterances from 108 speakers for Ekman's six basic emotions and neutral, we find that AD patients express significantly more non-neutral emotions (16.9%) than healthy controls (5.7%; p < .001). Exploratory acoustic analysis suggests a possible dissociation: control speakers showed substantial F0 modulation for sadness (Delta = -3.45 semitones from baseline), whereas AD speakers showed minimal change (Delta = +0.11 semitones; interaction p = .023), though this finding is based on limited samples (sadness: n=5 control, n=15 AD) and requires replication. Within AD speech, loudness differentiates emotion categories, indicating partially preserved emotion-prosody mappings. We release the corpus, annotation guidelines, and calibration workshop materials to support research on emotion recognition in clinical populations.
