Cognitive Neural Architecture Search Reveals Hierarchical Entailment
Lukas Kuhn, Sari Saba-Sadiya, Gemma Roig
TL;DR
This work investigates whether hierarchical entailment in the primate ventral stream can emerge when CNN architectures are evolved to align with late-ventral representations. It employs an evolutionary neural architecture search (NAS) framework that optimizes architecture for brain-alignment using fMRI data from the NSD dataset, evaluating fitness via ridge-encoded fMRI predictions. The evolved architectures (notably EvoV2, EvoV4, EvoIT) achieve brain-alignment superior to several manually designed CNNs and reveal subnetworks indicative of a hierarchical processing cascade, with IT-aligned models containing components predictive of earlier regions. Training the best-evolved architectures on CIFAR-10 shows competitive classification performance and highlights that gradient-based learning can modulate brain-alignment metrics, suggesting NAS as a powerful tool for computational cognitive neuroscience with potential to reduce manually designed architectures.
Abstract
Recent research has suggested that the brain is more shallow than previously thought, challenging the traditionally assumed hierarchical structure of the ventral visual pathway. Here, we demonstrate that optimizing convolutional network architectures for brain-alignment via evolutionary neural architecture search results in models with clear representational hierarchies. Despite having random weights, the identified models achieve brain-alignment scores surpassing even those of pretrained classification models - as measured by both regression and representational similarity analysis. Furthermore, through traditional supervised training, architectures optimized for alignment with late ventral regions become competitive classification models. These findings suggest that hierarchical structure is a fundamental mechanism of primate visual processing. Finally, this work demonstrates the potential of neural architecture search as a framework for computational cognitive neuroscience research that could reduce the field's reliance on manually designed convolutional networks.
