AVM: Towards Structure-Preserving Neural Response Modeling in the Visual Cortex Across Stimuli and Individuals
Qi Xu, Shuai Gong, Xuming Ran, Haihua Luo, Yangfan Hu
TL;DR
The paper tackles the brittleness of neural response models under stimulus and subject variability by proposing AVM, a structure-function decoupled framework that freezes a Vision Transformer backbone while introducing modular, condition-aware modulation units. This design enables localized, interpretable adaptation without retraining core representations. Across stimulus changes, cross-subject transfer, and cross-dataset adaptation on two large-scale mouse V1 datasets, AVM achieves superior predictive accuracy with significantly fewer trainable parameters, demonstrating robust generalization and efficiency. The work offers a biologically inspired approach to cortical modeling with implications for neuroscience and biologically grounded AI systems.
Abstract
While deep learning models have shown strong performance in simulating neural responses, they often fail to clearly separate stable visual encoding from condition-specific adaptation, which limits their ability to generalize across stimuli and individuals. We introduce the Adaptive Visual Model (AVM), a structure-preserving framework that enables condition-aware adaptation through modular subnetworks, without modifying the core representation. AVM keeps a Vision Transformer-based encoder frozen to capture consistent visual features, while independently trained modulation paths account for neural response variations driven by stimulus content and subject identity. We evaluate AVM in three experimental settings, including stimulus-level variation, cross-subject generalization, and cross-dataset adaptation, all of which involve structured changes in inputs and individuals. Across two large-scale mouse V1 datasets, AVM outperforms the state-of-the-art V1T model by approximately 2% in predictive correlation, demonstrating robust generalization, interpretable condition-wise modulation, and high architectural efficiency. Specifically, AVM achieves a 9.1% improvement in explained variance (FEVE) under the cross-dataset adaptation setting. These results suggest that AVM provides a unified framework for adaptive neural modeling across biological and experimental conditions, offering a scalable solution under structural constraints. Its design may inform future approaches to cortical modeling in both neuroscience and biologically inspired AI systems.
