Multi-label Class Incremental Emotion Decoding with Augmented Emotional Semantics Learning
Kaicheng Fu, Changde Du, Xiaoyu Chen, Jie Peng, Huiguang He
TL;DR
This work tackles multi-label class incremental emotion decoding (MLCIL) in dynamic human-computer interaction by introducing the AESL framework, which integrates an augmented emotional relation graph with label disambiguation, semantic-guided feature decoupling, and relation-based knowledge distillation from affective dimensions. A graph-autoencoder-based emotional semantics learning module yields emotion embeddings that guide label-aware feature extraction, while RKD aligns model representations with the affective space to mitigate future-missing labels. The method demonstrates strong, consistent improvements over state-of-the-art baselines across Brain27, Video27, and Audio28 under several incremental protocols, supported by ablations, visualization, and statistical tests. These results suggest AESL’s potential for robust, scalable emotion decoding in real-world, continually evolving affective systems.
Abstract
Emotion decoding plays an important role in affective human-computer interaction. However, previous studies ignored the dynamic real-world scenario, where human experience a blend of multiple emotions which are incrementally integrated into the model, leading to the multi-label class incremental learning (MLCIL) problem. Existing methods have difficulty in solving MLCIL issue due to notorious catastrophic forgetting caused by partial label problem and inadequate label semantics mining. In this paper, we propose an augmented emotional semantics learning framework for multi-label class incremental emotion decoding. Specifically, we design an augmented emotional relation graph module with label disambiguation to handle the past-missing partial label problem. Then, we leverage domain knowledge from affective dimension space to alleviate future-missing partial label problem by knowledge distillation. Besides, an emotional semantics learning module is constructed with a graph autoencoder to obtain emotion embeddings in order to guide the semantic-specific feature decoupling for better multi-label learning. Extensive experiments on three datasets show the superiority of our method for improving emotion decoding performance and mitigating forgetting on MLCIL problem.
