Naturally Computed Scale Invariance in the Residual Stream of ResNet18
André Longon
TL;DR
This paper investigates how scale invariance emerges in visual object recognition by examining the residual stream of ResNet18. Using center-neuron feature visualizations and a defined scale transform $S$, the authors identify scale-invariant channels that appear when a block input's smaller-scale copy combines with the block pre-sum's larger-scale copy to form the Post output. Ablation experiments show removing these channels disproportionately harms scale-robust recognition, providing causal evidence that the residual stream contributes to scale invariance and suggesting bypass connections as a potential mechanism. The work advances mechanistic interpretability across architectures and offers a bridge to neuroscience by hypothesizing how bypass-like pathways may compute invariance.
Abstract
An important capacity in visual object recognition is invariance to image-altering variables which leave the identity of objects unchanged, such as lighting, rotation, and scale. How do neural networks achieve this? Prior mechanistic interpretability research has illuminated some invariance-building circuitry in InceptionV1, but the results are limited and networks with different architectures have remained largely unexplored. This work investigates ResNet18 with a particular focus on its residual stream, an architectural component which InceptionV1 lacks. We observe that many convolutional channels in intermediate blocks exhibit scale invariant properties, computed by the element-wise residual summation of scale equivariant representations: the block input's smaller-scale copy with the block pre-sum output's larger-scale copy. Through subsequent ablation experiments, we attempt to causally link these neural properties with scale-robust object recognition behavior. Our tentative findings suggest how the residual stream computes scale invariance and its possible role in behavior. Code is available at: https://github.com/cest-andre/residual-stream-interp
