Prompt Your Brain: Scaffold Prompt Tuning for Efficient Adaptation of fMRI Pre-trained Model
Zijian Dong, Yilei Wu, Zijiao Chen, Yichi Zhang, Yueming Jin, Juan Helen Zhou
TL;DR
This work tackles the problem of adapting large-scale fMRI pre-trained representations to low-resource downstream tasks, where full fine-tuning risks overfitting and distorting learned features. It introduces Scaffold Prompt Tuning (ScaPT), a parameter-efficient framework with a hierarchical prompt structure and a Deeply-conditioned Input-Prompt (DIP) mapping that maps inputs into prompt spaces while freezing the backbone. ScaPT operates in two stages—Source Training to build group-level phenotype prompts from modular prompts, and Target Training to fuse these knowledge prompts with a new target prompt via attention-guided interpolation—resulting in updates to only about 2% of parameters. Experiments on resting-state fMRI data for neurodegenerative diagnosis/prognosis and personality trait prediction show ScaPT outperforms fine-tuning and multitask prompt baselines, with interpretable attention patterns aligning with known brain-behavior relationships. Overall, ScaPT offers an efficient, interpretable path for deploying large-scale fMRI representations in low-resource clinical and behavioral neuroimaging tasks.
Abstract
We introduce Scaffold Prompt Tuning (ScaPT), a novel prompt-based framework for adapting large-scale functional magnetic resonance imaging (fMRI) pre-trained models to downstream tasks, with high parameter efficiency and improved performance compared to fine-tuning and baselines for prompt tuning. The full fine-tuning updates all pre-trained parameters, which may distort the learned feature space and lead to overfitting with limited training data which is common in fMRI fields. In contrast, we design a hierarchical prompt structure that transfers the knowledge learned from high-resource tasks to low-resource ones. This structure, equipped with a Deeply-conditioned Input-Prompt (DIP) mapping module, allows for efficient adaptation by updating only 2% of the trainable parameters. The framework enhances semantic interpretability through attention mechanisms between inputs and prompts, and it clusters prompts in the latent space in alignment with prior knowledge. Experiments on public resting state fMRI datasets reveal ScaPT outperforms fine-tuning and multitask-based prompt tuning in neurodegenerative diseases diagnosis/prognosis and personality trait prediction, even with fewer than 20 participants. It highlights ScaPT's efficiency in adapting pre-trained fMRI models to low-resource tasks.
