Bottleneck Structure in Learned Features: Low-Dimension vs Regularity Tradeoff
Arthur Jacot
TL;DR
This work analyzes how deep, $L_{2}$-regularized neural networks bias learned mappings toward low-dimensional representations via the Bottleneck rank, and it develops finite-depth corrections to the infinite-depth picture. By expanding the representation cost as $R(f;oldsymbol{ ext{Ω}},L)=L R^{(0)}(f;oldsymbol{ ext{Ω}})+R^{(1)}(f;oldsymbol{ ext{Ω}})+rac{1}{L}R^{(2)}(f;oldsymbol{ ext{Ω}})+O(L^{-2})$, it introduces a regularity term $R^{(1)}$ that upper-bounds $2\,igl|\log |Jf(x)|_{+}igr|$ and is subadditive under composition and addition, and a second correction $R^{(2)}$ that controls convergence properties in linear and, partially, nonlinear settings. The authors prove a Bottleneck structure in large-depth networks: for balanced parameters with rank $k$ at a given input, most layers have weight matrices whose top $k$ singular values are near $1$ and the remaining singular values shrink like $O(L^{-1/2})$, implying that hidden representations are typically $k$-dimensional across most layers, provided the NTK scales as $O(L)$. They also show that rank-underestimating minima tend to be narrow due to NTK blow-up, offering an explanation for why gradient-based training tends to recover the BN-rank rather than underspecify it. A numerical symmetry-learning experiment illustrates how networks acquire low BN-rank representations by capturing task symmetries in a two-dimensional latent space.
Abstract
Previous work has shown that DNNs with large depth $L$ and $L_{2}$-regularization are biased towards learning low-dimensional representations of the inputs, which can be interpreted as minimizing a notion of rank $R^{(0)}(f)$ of the learned function $f$, conjectured to be the Bottleneck rank. We compute finite depth corrections to this result, revealing a measure $R^{(1)}$ of regularity which bounds the pseudo-determinant of the Jacobian $\left|Jf(x)\right|_{+}$ and is subadditive under composition and addition. This formalizes a balance between learning low-dimensional representations and minimizing complexity/irregularity in the feature maps, allowing the network to learn the `right' inner dimension. Finally, we prove the conjectured bottleneck structure in the learned features as $L\to\infty$: for large depths, almost all hidden representations are approximately $R^{(0)}(f)$-dimensional, and almost all weight matrices $W_{\ell}$ have $R^{(0)}(f)$ singular values close to 1 while the others are $O(L^{-\frac{1}{2}})$. Interestingly, the use of large learning rates is required to guarantee an order $O(L)$ NTK which in turns guarantees infinite depth convergence of the representations of almost all layers.
