Improving EEG Classification Through Randomly Reassembling Original and Generated Data with Transformer-based Diffusion Models
Mingzhi Chen, Yiyu Gui, Yuqi Su, Yuesheng Zhu, Guibo Luo, Yuchao Yang
TL;DR
The paper addresses the scarcity and quality issues in EEG data for classification by introducing a Transformer-based denoising diffusion probabilistic model (EEG Diffusion Transformer) augmented with Multi-Scale Convolution and Dynamic Fourier Spectrum Information to generate high-quality EEG signals. It then pairs generation with a novel Generated-Original Signals Reassembled (GO) augmentation, which reconstructs labels for generated data and fabricates vicinal data by reassembling original and generated signals, optimizing both empirical and vicinal risks. Across four EEG datasets and two backbone classifiers, the approach improves generation quality (lower EEG-FID) and classification accuracy, with notable gains on Bonn, SleepEDF-20, FACED, and Shu datasets. The method demonstrates universality across tasks, offering a practical, scalable pathway for enhancing EEG-based diagnostics and brain-computer interfaces, while acknowledging current limitations to EEG and proposing broader time-series extensions in future work.
Abstract
Electroencephalogram (EEG) classification has been widely used in various medical and engineering applications, where it is important for understanding brain function, diagnosing diseases, and assessing mental health conditions. However, the scarcity of EEG data severely restricts the performance of EEG classification networks, and generative model-based data augmentation methods have emerged as potential solutions to overcome this challenge. There are two problems with existing methods: (1) The quality of the generated EEG signals is not high; (2) The enhancement of EEG classification networks is not effective. In this paper, we propose a Transformer-based denoising diffusion probabilistic model and a generated data-based augmentation method to address the above two problems. For the characteristics of EEG signals, we propose a constant-factor scaling method to preprocess the signals, which reduces the loss of information. We incorporated Multi-Scale Convolution and Dynamic Fourier Spectrum Information modules into the model, improving the stability of the training process and the quality of the generated data. The proposed augmentation method randomly reassemble the generated data with original data in the time-domain to obtain vicinal data, which improves the model performance by minimizing the empirical risk and the vicinal risk. We verify the proposed augmentation method on four EEG datasets for four tasks and observe significant accuracy performance improvements: 14.00% on the Bonn dataset; 6.38% on the SleepEDF-20 dataset; 9.42% on the FACED dataset; 2.5% on the Shu dataset. We will make the code of our method publicly accessible soon.
