Neurocognitive Modeling for Text Generation: Deep Learning Architecture for EEG Data
Khushiyant
TL;DR
This work addresses the challenge of EEG-based text generation by introducing a data- and compute-efficient framework that couples an RNN-based EEG feature extractor with a classifier-LLM pipeline built around the Gemma 2B model. By extracting compact EEG representations and leveraging pre-trained language modeling capabilities, the approach enables EEG-conditioned text generation with substantially less training data than traditional end-to-end methods, while maintaining competitive performance and robustness to resource constraints. The study demonstrates notable results on the ImageNet of the Brain dataset, including 2-class and 3-class classification performance and perplexity-based text generation metrics, and highlights practical potential for assistive BCIs. It also discusses limitations in long-range text coherence and emphasizes future directions such as multi-modal inputs, personalized calibration, and real-time deployment to enhance real-world applicability.
Abstract
Text generating capabilities have undergone a substantial transformation with the introduction of large language models (LLMs). Electroencephalography (EEG)-based text production is still difficult, though, because it requires a lot of data and processing power. This paper introduces a new method that combines the use of the Gemma 2B LLM with a classifier-LLM architecture to incorporate a Recurrent Neural Network (RNN) encoder. Our approach drastically lowers the amount of data and compute power needed while achieving performance close to that of cutting-edge methods. Notably, compared to current methodologies, our methodology delivers an overall performance improvement of 10%. The suggested architecture demonstrates the possibility of effective transfer learning for EEG-based text production, remaining strong and functional even in the face of data limits. This work highlights the potential of integrating LLMs with EEG decoding to improve assistive technologies and improve independence and communication for those with severe motor limitations. Our method pushes the limits of present capabilities and opens new paths for research and application in brain-computer interfaces by efficiently using the strengths of pre-trained language models. This makes EEG-based text production more accessible and efficient.
