Adaptive Temporal Dynamics for Personalized Emotion Recognition: A Liquid Neural Network Approach
Anindya Bhattacharjee, Nittya Ananda Biswas, K. A. Shahriar, Adib Rahman
TL;DR
This work tackles the challenge of personalized emotion recognition from non-stationary physiological signals by introducing an attention-augmented Liquid Time-Constant Network (LTC) framework that jointly models EEG and peripheral signals. The approach uses CNN front-ends for raw EEG, LTC-based temporal dynamics with learnable time constants, and a self-attention mechanism to summarize temporal information, paired with an autoencoder-based cross-modal fusion that includes personality context. On PhyMER, it achieves 95.45% accuracy across seven discrete emotions, with near-perfect class separability (AUCs ≈ 0.99) and interpretable temporal signatures via attention and t-SNE visualizations. The results reveal distinct temporal neuron groups and emotion-specific attention patterns, demonstrating the method’s potential for efficient, edge-friendly, personalized affective computing while outlining avenues for subject-independent generalization and lighter fusion strategies.
Abstract
Emotion recognition from physiological signals remains challenging due to their non-stationary, noisy, and subject-dependent characteristics. This work presents, to the best of our knowledge, the first comprehensive application of liquid neural networks for EEG-based emotion recognition. The proposed multimodal framework combines convolutional feature extraction, liquid neural networks with learnable time constants, and attention-guided fusion to model temporal EEG dynamics with complementary peripheral physiological and personality features. Dedicated subnetworks are used to process EEG features and auxiliary modalities, and a shared autoencoder-based fusion module is used to learn discriminative latent representations before classification. Subject-dependent experiments conducted on the PhyMER dataset across seven emotional classes achieve an accuracy of 95.45%, surpassing previously reported results. Furthermore, temporal attention analysis provides interpretable insights into emotion-specific temporal relevance, and t-SNE visualizations demonstrate enhanced class separability, highlighting the effectiveness of the proposed approach. Finally, statistical analysis of temporal dynamics confirms that the network self-organizes into distinct functional groups with specialized fast and slow neurons, proving it independently tunes learnable time constants and memory dominance to effectively capture complex emotion artifacts.
