Efficient Transformer-Integrated Deep Neural Architectures for Robust EEG Decoding of Complex Visual Imagery
Byoung-Hee Kwon
TL;DR
Problem: decoding complex visual imagery from non-invasive EEG for BCI applications is challenged by spatial variability and limited data. Approach: the authors propose a PLV-guided functional connectivity neural network (FCDN) that combines a CNN-based temporal encoder with a DeiT transformer for spatial feature learning, leveraging delta, theta, and alpha bands and a distillation-based DeiT for data efficiency. Contributions: offline analyses show an average accuracy of 0.7234 across 15 subjects, LOSO cross-validation yields 0.4960, and pseudo-online decoding surpasses 0.75, with the functional connectivity block enhancing spatial discriminability. Significance: the method demonstrates robust, subject-independent EEG-based control suitable for EEG-driven robotic arms, with a validated 3D-BCI training setup supporting near real-time decoding and broader applicability.
Abstract
This study introduces a pioneering approach in brain-computer interface (BCI) technology, featuring our novel concept of complex visual imagery for non-invasive electroencephalography (EEG)-based communication. Complex visual imagery, as proposed in our work, involves the user engaging in the mental visualization of complex upper limb movements. This innovative approach significantly enhances the BCI system, facilitating the extension of its applications to more sophisticated tasks such as EEG-based robotic arm control. By leveraging this advanced form of visual imagery, our study opens new horizons for intricate and intuitive mind-controlled interfaces. We developed an advanced deep learning architecture that integrates functional connectivity metrics with a convolutional neural network-image transformer. This framework is adept at decoding subtle user intentions, addressing the spatial variability in complex visual tasks, and effectively translating these into precise commands for robotic arm control. Our comprehensive offline and pseudo-online evaluations demonstrate the framework's efficacy in real-time applications, including the nuanced control of robotic arms. The robustness of our approach is further validated through leave-one-subject-out cross-validation, marking a significant step towards versatile, subject-independent BCI applications. This research highlights the transformative impact of advanced visual imagery and deep learning in enhancing the usability and adaptability of BCI systems, particularly in robotic arm manipulation.
