Beyond topography: Topographic regularization improves robustness and reshapes representations in convolutional neural networks
Nhut Truong, Uri Hasson
TL;DR
This study investigates how local topographic regularization shapes robustness and internal representations in end-to-end trained CNNs. By contrasting Weight Similarity (WS) and Activation Similarity (AS) losses, the authors show WS yields smooth, functionally localized maps and greater robustness to weight perturbations, while AS produces non-smooth, striped activation patterns and distinct representational changes. Both regularizers improve robustness relative to non-topographic controls, yet they reshape the latent space differently, influencing activation entropy, dimensionality, and the distribution of category-selective expert units. The work demonstrates that topographic constraints can enhance robustness without necessarily sacrificing performance and provides a framework for understanding how cortical-like spatial organization emerges from learning, with implications for pruning, compression, and future extensions to larger architectures and unsupervised objectives.
Abstract
Topographic convolutional neural networks (TCNNs) are computational models that can simulate aspects of the brain's spatial and functional organization. However, it is unclear whether and how different types of topographic regularization shape robustness, representational structure, and functional organization during end-to-end training. We address this question by comparing TCNNs trained with two local spatial losses applied to a penultimate-layer topographic grid: i) Weight Similarity (WS), whose objective penalizes differences between neighboring units' incoming weight vectors, and ii) Activation Similarity (AS), whose objective penalizes differences between neighboring units' activation patterns over stimuli. We evaluate the trained models on classification accuracy, robustness to weight perturbations and input degradation, the spatial organization of learned representations, and development of category-selective "expert units" in the penultimate layer. Both losses changed inter-unit correlation structure, but in qualitatively different ways. WS produced smooth topographies, with correlated neighborhoods. In contrast, AS produced a bimodal inter-unit correlation structure that lacked spatial smoothness. AS and WS training increased robustness relative to control (non-topographic) models: AS improved robustness to image degradation on CIFAR-10, WS did so on MNIST, and both improved robustness to weight perturbations. WS was also associated with greater input sensitivity at the unit level and stronger functional localization. In addition, as compared to control models, both AS and WS produced differences in orientation tuning, symmetry sensitivity, and eccentricity profiles of units. Together, these results show that local topographic regularization can improve robustness during end-to-end training while systematically reshaping representational structure.
