From Coarse to Fine-Grained Emotion Annotation: An Immediate Recall Paradigm with Validation through Physiological Evidence and Recognition Performance

Hao Tang; Songyun Xie; Xinzhou Xie; Can Liao; Xin Zhang; Bohan Li; Zhongyu Tian; Dalu Zheng

From Coarse to Fine-Grained Emotion Annotation: An Immediate Recall Paradigm with Validation through Physiological Evidence and Recognition Performance

Hao Tang, Songyun Xie, Xinzhou Xie, Can Liao, Xin Zhang, Bohan Li, Zhongyu Tian, Dalu Zheng

TL;DR

This work addresses the label noise problem in video-induced emotion datasets by introducing an immediate recall paradigm that enables fine-grained, timestamped emotion annotations anchored to the moment of subjective experience. The FIRMED dataset combines synchronized EEG, ECG, GSR, PPG, and facial data with an immediate replay phase to mark discrete event timestamps $t_{event}$ within a precise 4-s window, validated against robust CNS and ANS physiological markers. Results show that models trained on FIRMED labels outperform those trained on traditional whole-trial labels across EEG and multimodal configurations, with notable gains when fusion modalities are used and when the window is centered on $t_{event}$. The findings demonstrate that annotation precision can outweigh data scale in determining emotion recognition performance, and the approach reduces annotation uncertainty compared with delayed recall methods. This paradigm advances ecologically valid emotion labeling and has implications for developing more reliable affective computing systems in real-world settings.

Abstract

Traditional video-induced emotion physiological datasets often use whole-trial annotation, assigning a single emotion label to all data collected during an entire trial. This coarse-grained annotation approach misaligns with the dynamic and temporally localized nature of emotional responses as they unfold with video narratives, introducing label noise that limits emotion recognition algorithm evaluation and performance. To solve the label noise problem caused by coarse-grained annotation, we propose a fine-grained annotation method through an immediate recall paradigm. This paradigm integrates an immediate video replay phase after the initial stimulus viewing, allowing participants to precisely mark the onset timestamp, emotion label, and intensity based on their immediate recall. We validate this paradigm through physiological evidence and recognition performance. Physiological validation of multimodal signals within participant-marked windows revealed rhythm-specific EEG patterns and arousal-dependent GSR responses-with SCRs appearing in 91% of high-arousal versus 6% of low-arousal emotion windows. These objective physiological data changes strongly aligned with subjective annotations, confirming annotation precision. For recognition performance, classification experiments showed that models trained on fine-grained annotations achieved 9.7% higher accuracy than traditional whole-trial labeling, despite using less data. This work not only addresses label noise through fine-grained annotation but also demonstrates that annotation precision outweighs data scale in determining emotion recognition performance.

From Coarse to Fine-Grained Emotion Annotation: An Immediate Recall Paradigm with Validation through Physiological Evidence and Recognition Performance

TL;DR

Abstract

From Coarse to Fine-Grained Emotion Annotation: An Immediate Recall Paradigm with Validation through Physiological Evidence and Recognition Performance

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)