Open-vocabulary Auditory Neural Decoding Using fMRI-prompted LLM
Xiaoyu Chen, Changde Du, Che Liu, Yizhe Wang, Huiguang He
TL;DR
This work tackles open-vocabulary auditory text decoding from fMRI by introducing BP-GPT, which uses brain-derived prompts to steer GPT-2 and generate target text. It couples a text-to-text baseline to derive an optimal text prompt with a brain-to-text pathway and a contrastive alignment objective to bridge modality gaps. Empirical results show BP-GPT achieves meaningful gains in METEOR and BERTScore over prior methods, validating the viability of brain-prompts for LLM-driven neural decoding and highlighting the role of prompt design and alignment. The method offers a flexible, future-proof framework that can adapt to stronger LLMs and broader neural-decoding tasks as imaging modalities and models evolve.
Abstract
Decoding language information from brain signals represents a vital research area within brain-computer interfaces, particularly in the context of deciphering the semantic information from the fMRI signal. However, many existing efforts concentrate on decoding small vocabulary sets, leaving space for the exploration of open vocabulary continuous text decoding. In this paper, we introduce a novel method, the \textbf{Brain Prompt GPT (BP-GPT)}. By using the brain representation that is extracted from the fMRI as a prompt, our method can utilize GPT-2 to decode fMRI signals into stimulus text. Further, we introduce a text-to-text baseline and align the fMRI prompt to the text prompt. By introducing the text-to-text baseline, our BP-GPT can extract a more robust brain prompt and promote the decoding of pre-trained LLM. We evaluate our BP-GPT on the open-source auditory semantic decoding dataset and achieve a significant improvement up to $4.61\%$ on METEOR and $2.43\%$ on BERTScore across all the subjects compared to the state-of-the-art method. The experimental results demonstrate that using brain representation as a prompt to further drive LLM for auditory neural decoding is feasible and effective.
