TopoNets: High Performing Vision and Language Models with Brain-Like Topography
Mayukh Deb, Mainak Deb, N. Apurva Ratan Murty
TL;DR
TopoNets address the lack of brain-like topography in artificial networks by introducing TopoLoss, a brain-inspired inductive bias that reshapes weights into cortical sheets and enforces locality through a differentiable blurring objective. The approach generalizes across vision and language architectures (CNNs and transformers) and yields high task performance while producing brain-like, low-dimensional representations. Empirically, TopoNets outperform prior topo methods on ImageNet and BrainScore benchmarks, enable parameter-efficient representations via pruning and downsampling, and reproduce key brain topographic signatures in both visual and language cortices. This work advances efficient, interpretable AI that more closely mirrors human cortical computation, with potential impact on scalable deployment and neuroscientific modeling.
Abstract
Neurons in the brain are organized such that nearby cells tend to share similar functions. AI models lack this organization, and past efforts to introduce topography have often led to trade-offs between topography and task performance. In this work, we present TopoLoss, a new loss function that promotes spatially organized topographic representations in AI models without significantly sacrificing task performance. TopoLoss is highly adaptable and can be seamlessly integrated into the training of leading model architectures. We validate our method on both vision (ResNet-18, ResNet-50, ViT) and language models (GPT-Neo-125M, NanoGPT), collectively TopoNets. TopoNets are the highest-performing supervised topographic models to date, exhibiting brain-like properties such as localized feature processing, lower dimensionality, and increased efficiency. TopoNets also predict responses in the brain and replicate the key topographic signatures observed in the brain's visual and language cortices. Together, this work establishes a robust and generalizable framework for integrating topography into leading model architectures, advancing the development of high-performing models that more closely emulate the computational strategies of the human brain.
