Large Cognition Model: Towards Pretrained EEG Foundation Model
Chi-Sheng Chen, Ying-Jung Chen, Aidan Hung-Wen Tsai
TL;DR
EEG data suffer from low SNR and high inter-subject variability, limiting scalable foundation models. The authors propose Large Cognition Model (LCM), a transformer-based EEG foundation model trained with self-supervised contrastive learning and cross-montage encoding to enable generalization across datasets and tasks. Key contributions include a learnable cross-montage channel mapping, a dual-encoder contrastive framework with masked feature reconstruction, and strong cross-dataset performance that surpasses several pretrained baselines even without extensive pretraining. This work advances universal EEG representations with potential impact on neuroscience, personalized medicine, and brain-computer interface technologies.
Abstract
Electroencephalography provides a non-invasive window into brain activity, offering valuable insights for neurological research, brain-computer interfaces, and clinical diagnostics. However, the development of robust machine learning models for EEG analysis is hindered by the scarcity of large-scale, well-annotated datasets and the inherent variability of EEG signals across subjects and recording conditions. Inspired by the success of foundation models in natural language processing and computer vision, we propose the Large Cognition Model-a transformer-based foundation model designed to generalize across diverse EEG datasets and downstream tasks. Unlike traditional approaches, our proposed transformer-based architecture demonstrates strong generalization capabilities across datasets and tasks, even without pretraining, surpassing some existing EEG universal models on specific downstream applications. LCM leverages large-scale self-supervised learning techniques to capture universal EEG representations, enabling efficient fine-tuning for applications such as cognitive state decoding, disease classification, and neurofeedback systems. We introduce a novel architecture that integrates temporal and spectral attention mechanisms, optimizing the model's ability to extract meaningful features from raw EEG signals. Extensive evaluations demonstrate that LCM outperforms state-of-the-art approaches across multiple EEG benchmarks, exhibiting strong cross-subject and cross-task generalization. Our findings highlight the potential of pretrained EEG foundation models to accelerate advancements in neuroscience, personalized medicine, and BCI technology.
