Table of Contents
Fetching ...

Subject-Independent Deep Architecture for EEG-based Motor Imagery Classification

Shadi Sartipi, Mujdat Cetin

TL;DR

This work tackles the problem of subject variability and limited labeled data in EEG-based motor imagery classification by introducing a subject-independent semi-supervised deep architecture (SSDA). The approach combines a columnar spatio-temporal auto-encoder (CST-AE) for unsupervised latent feature learning with a supervised classifier, trained end-to-end using both labeled and unlabeled samples, and strengthened by a center loss term and dimensional scaling. Empirical results on PhysioNet and BCI IV 2a show state-of-the-art performance in the subject-independent setting, with strong gains even when only a small fraction of labels is available, indicating potential to reduce calibration needs and accelerate practical BCI deployment. The method’s robustness to raw EEG and its generative–discriminative combination suggest broad applicability to related EEG decoding tasks and other non-invasive neural interfaces.

Abstract

Motor imagery (MI) classification based on electroencephalogram (EEG) is a widely-used technique in non-invasive brain-computer interface (BCI) systems. Since EEG recordings suffer from heterogeneity across subjects and labeled data insufficiency, designing a classifier that performs the MI independently from the subject with limited labeled samples would be desirable. To overcome these limitations, we propose a novel subject-independent semi-supervised deep architecture (SSDA). The proposed SSDA consists of two parts: an unsupervised and a supervised element. The training set contains both labeled and unlabeled data samples from multiple subjects. First, the unsupervised part, known as the columnar spatiotemporal auto-encoder (CST-AE), extracts latent features from all the training samples by maximizing the similarity between the original and reconstructed data. A dimensional scaling approach is employed to reduce the dimensionality of the representations while preserving their discriminability. Second, a supervised part learns a classifier based on the labeled training samples using the latent features acquired in the unsupervised part. Moreover, we employ center loss in the supervised part to minimize the embedding space distance of each point in a class to its center. The model optimizes both parts of the network in an end-to-end fashion. The performance of the proposed SSDA is evaluated on test subjects who were not seen by the model during the training phase. To assess the performance, we use two benchmark EEG-based MI task datasets. The results demonstrate that SSDA outperforms state-of-the-art methods and that a small number of labeled training samples can be sufficient for strong classification performance.

Subject-Independent Deep Architecture for EEG-based Motor Imagery Classification

TL;DR

This work tackles the problem of subject variability and limited labeled data in EEG-based motor imagery classification by introducing a subject-independent semi-supervised deep architecture (SSDA). The approach combines a columnar spatio-temporal auto-encoder (CST-AE) for unsupervised latent feature learning with a supervised classifier, trained end-to-end using both labeled and unlabeled samples, and strengthened by a center loss term and dimensional scaling. Empirical results on PhysioNet and BCI IV 2a show state-of-the-art performance in the subject-independent setting, with strong gains even when only a small fraction of labels is available, indicating potential to reduce calibration needs and accelerate practical BCI deployment. The method’s robustness to raw EEG and its generative–discriminative combination suggest broad applicability to related EEG decoding tasks and other non-invasive neural interfaces.

Abstract

Motor imagery (MI) classification based on electroencephalogram (EEG) is a widely-used technique in non-invasive brain-computer interface (BCI) systems. Since EEG recordings suffer from heterogeneity across subjects and labeled data insufficiency, designing a classifier that performs the MI independently from the subject with limited labeled samples would be desirable. To overcome these limitations, we propose a novel subject-independent semi-supervised deep architecture (SSDA). The proposed SSDA consists of two parts: an unsupervised and a supervised element. The training set contains both labeled and unlabeled data samples from multiple subjects. First, the unsupervised part, known as the columnar spatiotemporal auto-encoder (CST-AE), extracts latent features from all the training samples by maximizing the similarity between the original and reconstructed data. A dimensional scaling approach is employed to reduce the dimensionality of the representations while preserving their discriminability. Second, a supervised part learns a classifier based on the labeled training samples using the latent features acquired in the unsupervised part. Moreover, we employ center loss in the supervised part to minimize the embedding space distance of each point in a class to its center. The model optimizes both parts of the network in an end-to-end fashion. The performance of the proposed SSDA is evaluated on test subjects who were not seen by the model during the training phase. To assess the performance, we use two benchmark EEG-based MI task datasets. The results demonstrate that SSDA outperforms state-of-the-art methods and that a small number of labeled training samples can be sufficient for strong classification performance.
Paper Structure (21 sections, 13 equations, 7 figures, 7 tables)

This paper contains 21 sections, 13 equations, 7 figures, 7 tables.

Figures (7)

  • Figure 1: Motor imagery EEG acquisition experiment.
  • Figure 2: Block-diagram of the proposed subject-independent semi-supervised deep architecture.
  • Figure 3: Proposed columnar spatio-temporal auto-encoder (CST-AE) architecture. Yellow and grey blocks represent CNN and LSTM layers, respectively. Orange blocks are the latent representations that are the outputs of the attention mechanism. (col: Column).
  • Figure 4: Normalized confusion matrix when $N_l\ll N$, (left) $10\%$ and (right) $30\%$ labeled training data samples on the two-class PhysioNet dataset.
  • Figure 5: Normalized confusion matrix when $N_l\ll N$, (left) $10\%$ and (right) $30\%$ labeled training data samples on the four-class BCI competition IV 2a dataset.
  • ...and 2 more figures