EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Yuqi Chen; Kan Ren; Kaitao Song; Yansen Wang; Yifan Wang; Dongsheng Li; Lili Qiu

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Yuqi Chen, Kan Ren, Kaitao Song, Yansen Wang, Yifan Wang, Dongsheng Li, Lili Qiu

TL;DR

EEGFormer tackles the challenge of learning transferable, interpretable representations from large-scale unlabeled EEG data. It introduces a discrete vector-quantized pretraining framework that tokenizes multivariate EEG patches via a codebook and trains a lightweight decoder to reconstruct inputs. Pretraining on 1.7TB TUH data yields universal representations that transfer to multiple downstream tasks, including seizure detection and neonatal seizure localization, with improved metrics over baselines. The model supports interpretability by analyzing discrete tokens and codebook indices, enabling localization of salient patterns without heavy supervision.

Abstract

Self-supervised learning has emerged as a highly effective approach in the fields of natural language processing and computer vision. It is also applicable to brain signals such as electroencephalography (EEG) data, given the abundance of available unlabeled data that exist in a wide spectrum of real-world medical applications ranging from seizure detection to wave analysis. The existing works leveraging self-supervised learning on EEG modeling mainly focus on pretraining upon each individual dataset corresponding to a single downstream task, which cannot leverage the power of abundant data, and they may derive sub-optimal solutions with a lack of generalization. Moreover, these methods rely on end-to-end model learning which is not easy for humans to understand. In this paper, we present a novel EEG foundation model, namely EEGFormer, pretrained on large-scale compound EEG data. The pretrained model cannot only learn universal representations on EEG signals with adaptable performance on various downstream tasks but also provide interpretable outcomes of the useful patterns within the data. To validate the effectiveness of our model, we extensively evaluate it on various downstream tasks and assess the performance under different transfer settings. Furthermore, we demonstrate how the learned model exhibits transferable anomaly detection performance and provides valuable interpretability of the acquired patterns via self-supervised learning.

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

TL;DR

Abstract

Paper Structure (21 sections, 1 equation, 3 figures, 2 tables)

This paper contains 21 sections, 1 equation, 3 figures, 2 tables.

Introduction
Related Work
Pretraining for Time-Series Data
Pretraining for EEG data
EEGFormer: Vector-Quantized Pretraining Transformer for EEG Data
Feature Preprocessing
Slice & Encode
Vector Quantizer
Decode & Reconstruct
Training Loss
Downstream Fine-Tuning
Experimental Results
Datasets Description
Parameter Setting
Compared Baselines
...and 6 more sections

Figures (3)

Figure 1: Overview of EEGFormer. Initially, multi-variate EEG signals are segmented into patches, which are then passed through a Transformer encoder. Subsequently, a vector-quantized model is employed to generate discrete indices. These indices are then fed into a shallow Transformer decoder.
Figure 2: Influence of pretrain epochs on two TUH corpus.
Figure 3: Interpretation results from naive Bayes model.

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

TL;DR

Abstract

EEGFormer: Towards Transferable and Interpretable Large-Scale EEG Foundation Model

Authors

TL;DR

Abstract

Table of Contents

Figures (3)