MindSimulator: Exploring Brain Concept Localization via Synthetic FMRI
Guangyin Bao, Qi Zhang, Zixuan Gong, Zhuojia Wu, Duoqian Miao
TL;DR
MindSimulator tackles the challenge of localizing concept-selective regions in the visual cortex under limited and biased real fMRI data by generating synthetic fMRI conditioned on concept-oriented visual stimuli. It introduces a three-component generative encoding model—a fMRI Autoencoder, a Diffusion Estimator with $T$ timesteps, and an Inference Sampler with multi-trial enhancement and correlated noise—to learn conditional fMRI distributions in a latent space aligned with image representations. Trained on the NSD dataset with CLIP-based cross-modal alignment, MindSimulator yields voxel-level and semantic-level encoding performance that surpass baselines and generalizes to out-of-distribution images such as CIFAR datasets, enabling large-scale localization of both known and novel concept-selective regions. This data-driven synthetic approach broadens neuroscience inquiry by providing priors for concept localization and paving the way for an expanding brain concept atlas that complements traditional fLoc methods.
Abstract
Concept-selective regions within the human cerebral cortex exhibit significant activation in response to specific visual stimuli associated with particular concepts. Precisely localizing these regions stands as a crucial long-term goal in neuroscience to grasp essential brain functions and mechanisms. Conventional experiment-driven approaches hinge on manually constructed visual stimulus collections and corresponding brain activity recordings, constraining the support and coverage of concept localization. Additionally, these stimuli often consist of concept objects in unnatural contexts and are potentially biased by subjective preferences, thus prompting concerns about the validity and generalizability of the identified regions. To address these limitations, we propose a data-driven exploration approach. By synthesizing extensive brain activity recordings, we statistically localize various concept-selective regions. Our proposed MindSimulator leverages advanced generative technologies to learn the probability distribution of brain activity conditioned on concept-oriented visual stimuli. This enables the creation of simulated brain recordings that reflect real neural response patterns. Using the synthetic recordings, we successfully localize several well-studied concept-selective regions and validate them against empirical findings, achieving promising prediction accuracy. The feasibility opens avenues for exploring novel concept-selective regions and provides prior hypotheses for future neuroscience research.
