Enhancing EEG Signal-Based Emotion Recognition with Synthetic Data: Diffusion Model Approach
Gourav Siddhad, Masakazu Iwamura, Partha Pratim Roy
TL;DR
This work tackles data scarcity in EEG-based emotion recognition by introducing a conditional denoising diffusion model that augments real EEG with noise-augmented synthetic data. The method generates raw EEG-like signals and, when combined with real data, improves classifier performance across DL and non-DL models on DEAP and SADT, outperforming GAN-based and vanilla DDPM baselines. Quantitative gains are supported by qualitative analyses (t-SNE) and interpretability (SHAP), while ablation studies reveal optimal synthetic-data proportions. The approach demonstrates potential for data-efficient EEG affective computing, with future work addressing broader emotion spaces, additional datasets, and efficiency optimizations.
Abstract
Emotions are crucial in human life, influencing perceptions, relationships, behaviour, and choices. Emotion recognition using Electroencephalography (EEG) in the Brain-Computer Interface (BCI) domain presents significant challenges, particularly the need for extensive datasets. This study aims to generate synthetic EEG samples similar to real samples but distinct by augmenting noise to a conditional denoising diffusion probabilistic model, thus addressing the prevalent issue of data scarcity in EEG research. The proposed method is tested on the DEAP and SADT datasets, showcasing up to 5.6% improvement in classification accuracy when using synthetic data with DEAP and similar positive results with SADT. This is higher compared to the traditional Generative Adversarial Network (GAN) based and Denoising Diffusion Probabilistic Model (DDPM) based approaches. This study further evaluates the effectiveness of state-of-the-art classifiers on EEG data, employing both real and synthetic data with varying noise levels, and utilizes t-SNE and SHAP for detailed analysis and interpretability. The proposed diffusion-based approach for EEG data generation appears promising in refining the accuracy of emotion recognition systems and marks a notable contribution to EEG-based emotion recognition.
