GCMCG: A Clustering-Aware Graph Attention and Expert Fusion Network for Multi-Paradigm, Multi-task, and Cross-Subject EEG Decoding
Yiqiao Chen, Zijian Huang, Juchi He, Fazheng Xu, Zhenghui Feng
TL;DR
The paper tackles robust decoding of MI-ME EEG signals across subjects and paradigms by introducing GCMCG, a graph-guided, clustering-aware Mixture-of-Experts framework. It combines ICA-WT denoising, a learnable electrode tokenizer with Graph Attention networks, spectral clustering to form functional regions, region-specific CNN-GRU experts, and entropy-regularized MoE fusion, all trained with a three-stage strategy to improve cross-subject generalization. Empirical results across EEGmmidb, BCIC-IV 2a, and M3CV show state-of-the-art performance on challenging datasets and strong cross-dataset stability, along with extensive ablations and interpretability analyses. The work advances practical BCI by enabling flexible electrode layouts, variable sequence lengths, and robust multi-paradigm decoding, paving the way for real-world, plug-and-play EEG-based control systems.
Abstract
Brain-Computer Interfaces (BCIs) based on Motor Execution (ME) and Motor Imagery (MI) electroencephalogram (EEG) signals offer a direct pathway for human-machine interaction. However, developing robust decoding models remains challenging due to the complex spatio-temporal dynamics of EEG, its low signal-to-noise ratio, and the limited generalizability of many existing approaches across subjects and paradigms. To address these issues, this paper proposes Graph-guided Clustering Mixture-of-Experts CNN-GRU (GCMCG), a novel unified framework for MI-ME EEG decoding. Our approach integrates a robust preprocessing stage using Independent Component Analysis and Wavelet Transform (ICA-WT) for effective denoising. We further introduce a pre-trainable graph tokenization module that dynamically models electrode relationships via a Graph Attention Network (GAT), followed by unsupervised spectral clustering to decompose signals into interpretable functional brain regions. Each region is processed by a dedicated CNN-GRU expert network, and a gated fusion mechanism with L1 regularization adaptively combines these local features with a global expert. This Mixture-of-Experts (MoE) design enables deep spatio-temporal fusion and enhances representational capacity. A three-stage training strategy incorporating focal loss and progressive sampling is employed to improve cross-subject generalization and handle class imbalance. Evaluated on three public datasets of varying complexity (EEGmmidb-BCI2000, BCI-IV 2a, and M3CV), GCMCG achieves overall accuracies of 86.60%, 98.57%, and 99.61%, respectively, which demonstrates its superior effectiveness and strong generalization capability for practical BCI applications.
