Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

Haomiao Chen; Keith W Jamison; Mert R. Sabuncu; Amy Kuceyeski

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

Haomiao Chen, Keith W Jamison, Mert R. Sabuncu, Amy Kuceyeski

TL;DR

The paper tackles the inefficiency and limited generalization of traditional voxel-grid neural encoders by introducing the Neural Response Function (NRF), a coordinate-based implicit representation that predicts fMRI responses as a continuous function over standardized MNI space: $\ hat = \Phi(M,\mathbf{x})$. NRF combines a multi-scale image feature extractor $G$ with a coordinate-conditioned MLP $P$, using Fourier-encoded coordinates to achieve resolution-agnostic predictions. The authors demonstrate data-efficient single-subject encoding and cross-subject transfer via fine-tuning and voxelwise ensemble, exploiting local anatomical smoothness and cross-subject alignment. Empirically, NRF outperforms baselines in low-data regimes, matches or exceeds them with full data, and supports flexible adaptation to new subjects with minimal data, offering a path toward an anatomically grounded, resolution-agnostic digital twin of the brain.

Abstract

Neural encoding models aim to predict fMRI-measured brain responses to natural images. fMRI data is acquired as a 3D volume of voxels, where each voxel has a defined spatial location in the brain. However, conventional encoding models often flatten this volume into a 1D vector and treat voxel responses as independent outputs. This removes spatial context, discards anatomical information, and ties each model to a subject-specific voxel grid. We introduce the Neural Response Function (NRF), a framework that models fMRI activity as a continuous function over anatomical space rather than a flat vector of voxels. NRF represents brain activity as a continuous implicit function: given an image and a spatial coordinate (x, y, z) in standardized MNI space, the model predicts the response at that location. This formulation decouples predictions from the training grid, supports querying at arbitrary spatial resolutions, and enables resolution-agnostic analyses. By grounding the model in anatomical space, NRF exploits two key properties of brain responses: (1) local smoothness -- neighboring voxels exhibit similar response patterns; modeling responses continuously captures these correlations and improves data efficiency, and (2) cross-subject alignment -- MNI coordinates unify data across individuals, allowing a model pretrained on one subject to be fine-tuned on new subjects. In experiments, NRF outperformed baseline models in both intrasubject encoding and cross-subject adaptation, achieving high performance while reducing the data size needed by orders of magnitude. To our knowledge, NRF is the first anatomically aware encoding model to move beyond flattened voxels, learning a continuous mapping from images to brain responses in 3D space.

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

TL;DR

Abstract

Beyond Grid-Locked Voxels: Neural Response Functions for Continuous Brain Encoding

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)