CwA-T: A Channelwise AutoEncoder with Transformer for EEG Abnormality Detection
Youshen Zhao, Keiji Iramina
TL;DR
This work tackles the challenge of detecting EEG abnormalities from long-term, high-dimensional recordings by introducing CwA-T, a hybrid framework that combines a channelwise CNN-based autoencoder with a lightweight single-head transformer classifier. The channelwise autoencoder preserves channel independence while drastically reducing data dimensionality, enabling efficient compression, and the single-head transformer models long-range dependencies in the latent space. On the TUH Abnormal EEG Corpus, CwA-T achieves strong per-case accuracy ($85.0\%$) with high specificity ($91.2\%$) and balanced sensitivity ($76.2\%$), while requiring only $202$M FLOPs and $2.9$M parameters, outperforming several baselines and substantially reducing computational cost compared with a full transformer. The framework maintains interpretability through its channelwise structure, and results suggest potential for real-time clinical deployment and advanced neuroscience research, with future work exploring attention-based connectivity and multi-modal extensions.
Abstract
Electroencephalogram (EEG) signals are critical for detecting abnormal brain activity, but their high dimensionality and complexity pose significant challenges for effective analysis. In this paper, we propose CwA-T, a novel framework that combines a channelwise CNN-based autoencoder with a single-head transformer classifier for efficient EEG abnormality detection. The channelwise autoencoder compresses raw EEG signals while preserving channel independence, reducing computational costs and retaining biologically meaningful features. The compressed representations are then fed into the transformer-based classifier, which efficiently models long-term dependencies to distinguish between normal and abnormal signals. Evaluated on the TUH Abnormal EEG Corpus, the proposed model achieves 85.0% accuracy, 76.2% sensitivity, and 91.2% specificity at the per-case level, outperforming baseline models such as EEGNet, Deep4Conv, and FusionCNN. Furthermore, CwA-T requires only 202M FLOPs and 2.9M parameters, making it significantly more efficient than transformer-based alternatives. The framework retains interpretability through its channelwise design, demonstrating great potential for future applications in neuroscience research and clinical practice. The source code is available at https://github.com/YossiZhao/CAE-T.
