EEG-to-Text Translation: A Model for Deciphering Human Brain Activity
Saydul Akbar Murad, Ashim Dahal, Nick Rahimi
TL;DR
This work tackles EEG-to-text translation by introducing R1 Translator, an encoder–decoder model that combines a Bi-LSTM EEG encoder with a pretrained BART text generator. It employs a two-step fine-tuning strategy to adapt the language model to EEG inputs, evaluated on ZuCo V1 and V2 datasets with strong, consistent improvements over T5 and Brain Translator across BLEU, ROUGE, WER, and CER metrics. The results establish state-of-the-art performance for EEG-to-text decoding on these datasets and demonstrate both robustness and practical potential, while also outlining limitations and directions for real-time, scalable brain-computer interfaces. Future work includes expanding the EEG corpus for broader generalization and integrating the system into real-time, decentralized architectures for instantaneous brain-to-text communication.
Abstract
With the rapid advancement of large language models like Gemini, GPT, and others, bridging the gap between the human brain and language processing has become an important area of focus. To address this challenge, researchers have developed various models to decode EEG signals into text. However, these models still face significant performance limitations. To overcome these shortcomings, we propose a new model, R1 Translator, which aims to improve the performance of EEG-to-text decoding. The R1 Translator model combines a bidirectional LSTM encoder with a pretrained transformer-based decoder, utilizing EEG features to produce high-quality text outputs. The model processes EEG embeddings through the LSTM to capture sequential dependencies, which are then fed into the transformer decoder for effective text generation. The R1 Translator excels in ROUGE metrics, outperforming both T5 (previous research) and Brain Translator. Specifically, R1 achieves a ROUGE-1 score of 38.00% (P), which is up to 9% higher than T5 (34.89%) and 3% better than Brain (35.69%). It also leads in ROUGE-L, with a F1 score of 32.51%, outperforming T5 by 3% (29.67%) and Brain by 2% (30.38%). In terms of CER, R1 achieves a CER of 0.5795, which is 2% lower than T5 (0.5917) and 4% lower than Brain (0.6001). Additionally, R1 performs better in WER with a score of 0.7280, outperforming T5 by 4.3% (0.7610) and Brain by 3.6% (0.7553). Code is available at https://github.com/Mmurrad/EEG-To-text.
