SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs

Jiazhen Hong; Geoff Mackellar; Soheila Ghane

SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs

Jiazhen Hong, Geoff Mackellar, Soheila Ghane

TL;DR

SpellerSSL addresses the core bottlenecks of EEG-based P300 spellers—low SNR, poor generalization, and lengthy calibration—by integrating self-supervised pretraining on a customized 1D U-Net with a lightweight ERP-Head for P300 detection, plus a P300 aggregation scheme that denoises training signals. Pretraining uses a reconstruction objective with time masking and a frequency-domain consistency term across cross-domain and in-domain EEG data, followed by downstream fine-tuning on subject data. Results show that in-domain SSL with moderate aggregation (G=2) delivers state-of-the-art CRR (94% at 7 repetitions) and high ITR (up to 21.86 bits/min), while substantially reducing calibration needs (up to 60%). Cross-domain SSL also demonstrates strong transferability, highlighting the potential for EEG foundation models in P300 speller BCIs and practical improvements in efficiency and generalization.

Abstract

Electroencephalogram (EEG)-based P300 speller brain-computer interfaces (BCIs) face three main challenges: low signal-to-noise ratio (SNR), poor generalization, and time-consuming calibration. We propose SpellerSSL, a framework that combines self-supervised learning (SSL) with P300 aggregation to address these issues. First, we introduce an aggregation strategy to enhance SNR. Second, to achieve generalization in training, we employ a customized 1D U-Net backbone and pretrain the model on both cross-domain and in-domain EEG data. The pretrained model is subsequently fine-tuned with a lightweight ERP-Head classifier for P300 detection, which adapts the learned representations to subject-specific data. Our evaluations on calibration time demonstrate that combining the aggregation strategy with SSL significantly reduces the calibration burden per subject and improves robustness across subjects. Experimental results show that SSL learns effective EEG representations in both in-domain and cross-domain, with in-domain achieving a state-of-the-art character recognition rate of 94% with only 7 repetitions and the highest information transfer rate (ITR) of 21.86 bits/min on the public II-B dataset. Moreover, in-domain SSL with P300 aggregation reduces the required calibration size by 60% while maintaining a comparable character recognition rate. To the best of our knowledge, this is the first study to apply SSL to P300 spellers, highlighting its potential to improve both efficiency and generalization in speller BCIs and paving the way toward an EEG foundation model for P300 speller BCIs.

SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs

TL;DR

Abstract

SpellerSSL: Self-Supervised Learning with P300 Aggregation for Speller BCIs

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (8)