Interpreting the Residual Stream of ResNet18
André Longon
TL;DR
Problem: Understanding how DNNs compute visual representations remains incomplete; this work targets the residual stream of ResNet18 to gain mechanistic insight. Approach: It uses feature visualizations to examine center-neuron activations, defines a mix ratio $M_c = O_c(I)/O_c(B)$, and applies scale-invariance criteria built from scale operators $S$ and $S^{-1}$, plus a scale metric $SM_c$, to identify scale-invariant residual features. Findings: The residual stream shows a skip-overwrite-mix spectrum; some early- to mid-block channels exhibit scale-invariant behavior; weight and correlation analyses suggest bypass by lower-magnitude block outputs and mixed evidence for inhibitory interactions. Significance: Demonstrates a concrete mechanistic interpretation of residual streams, supporting scale-equivariance ideas and suggesting potential universality across architectures and even biological visual systems.
Abstract
A mechanistic understanding of the computations learned by deep neural networks (DNNs) is far from complete. In the domain of visual object recognition, prior research has illuminated inner workings of InceptionV1, but DNNs with different architectures have remained largely unexplored. This work investigates ResNet18 with a particular focus on its residual stream, an architectural mechanism which InceptionV1 lacks. We observe that for a given block, channel features of the stream are updated along a spectrum: either the input feature skips to the output, the block feature overwrites the output, or the output is some mixture between the input and block features. Furthermore, we show that many residual stream channels compute scale invariant representations through a mixture of the input's smaller-scale feature with the block's larger-scale feature. This not only mounts evidence for the universality of scale equivariance, but also presents how the residual stream further implements scale invariance. Collectively, our results begin an interpretation of the residual stream in visual object recognition, finding it to be a flexible feature manager and a medium to build scale invariant representations.
