One Model for All: Universal Pre-training for EEG based Emotion Recognition across Heterogeneous Datasets and Paradigms
Xiang Li, You Li, Yazhou Zhang
TL;DR
The paper tackles profound heterogeneity in EEG-based emotion recognition across datasets by proposing a universal pre-training framework. It decouples learning into univariate channel-wise pre-training using contrastive SSL with a Unified Channel Schema (UCS), followed by per-subject fine-tuning with an Adaptive Resampling Transformer (ART) and Graph Attention Network (GAT) classifier. The approach achieves state-of-the-art within-subject performance on SEED, DEAP, and DREAMER, and demonstrates strong cross-dataset transfer, even surpassing within-domain upper bounds in some cases. Ablation studies show the necessity of pre-training, the critical role of GAT in handling noisy data, and the advantage of the ART encoder, collectively enabling scalable, universal EEG foundation models for diverse analysis tasks.
Abstract
EEG-based emotion recognition is hampered by profound dataset heterogeneity (channel/subject variability), hindering generalizable models. Existing approaches struggle to transfer knowledge effectively. We propose 'One Model for All', a universal pre-training framework for EEG analysis across disparate datasets. Our paradigm decouples learning into two stages: (1) Univariate pre-training via self-supervised contrastive learning on individual channels, enabled by a Unified Channel Schema (UCS) that leverages the channel union (e.g., SEED-62ch, DEAP-32ch); (2) Multivariate fine-tuning with a novel 'ART' (Adaptive Resampling Transformer) and 'GAT' (Graph Attention Network) architecture to capture complex spatio-temporal dependencies. Experiments show universal pre-training is an essential stabilizer, preventing collapse on SEED (vs. scratch) and yielding substantial gains on DEAP (+7.65%) and DREAMER (+3.55%). Our framework achieves new SOTA performance on all within-subject benchmarks: SEED (99.27%), DEAP (93.69%), and DREAMER (93.93%). We also show SOTA cross-dataset transfer, achieving 94.08% (intersection) and 93.05% (UCS) on the unseen DREAMER dataset, with the former surpassing the within-domain pre-training benchmark. Ablation studies validate our architecture: the GAT module is critical, yielding a +22.19% gain over GCN on the high-noise DEAP dataset, and its removal causes a catastrophic -16.44% performance drop. This work paves the way for more universal, scalable, and effective pre-trained models for diverse EEG analysis tasks.
