EEG-GPT: Exploring Capabilities of Large Language Models for EEG Classification and Interpretation
Jonathan W. Kim, Ahmed Alaa, Danilo Bernardo
TL;DR
This work investigates EEG-GPT, an LLM-based framework for EEG classification and interpretation that operates effectively with minimal labeled data. By converting EEG features into verbal prompts and fine-tuning a base LLM, it achieves competitive normal/abnormal classification in a few-shot setting and maintains non-trivial zero-shot capability via in-context learning. The approach further demonstrates interpretable, stepwise reasoning through Tree of Thought and access to specialized EEG tools (seizure/spike detectors and qEEG comparison), providing transparent decision-making and potential clinical trust. Although DL models trained on raw EEG still outperform EEG-GPT in some cases, EEG-GPT offers data efficiency and explainability, highlighting a promising direction for trustworthy AI-assisted EEG analysis in clinical contexts.
Abstract
In conventional machine learning (ML) approaches applied to electroencephalography (EEG), this is often a limited focus, isolating specific brain activities occurring across disparate temporal scales (from transient spikes in milliseconds to seizures lasting minutes) and spatial scales (from localized high-frequency oscillations to global sleep activity). This siloed approach limits the development EEG ML models that exhibit multi-scale electrophysiological understanding and classification capabilities. Moreover, typical ML EEG approaches utilize black-box approaches, limiting their interpretability and trustworthiness in clinical contexts. Thus, we propose EEG-GPT, a unifying approach to EEG classification that leverages advances in large language models (LLM). EEG-GPT achieves excellent performance comparable to current state-of-the-art deep learning methods in classifying normal from abnormal EEG in a few-shot learning paradigm utilizing only 2% of training data. Furthermore, it offers the distinct advantages of providing intermediate reasoning steps and coordinating specialist EEG tools across multiple scales in its operation, offering transparent and interpretable step-by-step verification, thereby promoting trustworthiness in clinical contexts.
