A unified theory of feature learning in RNNs and DNNs
Jan P. Bauer, Kirsten Fischer, Moritz Helias, Agostina Palmigiano
TL;DR
This work tackles why RNNs and DNNs, despite structural similarities, exhibit different functional properties. It develops a unified mean-field kernel theory in the feature-learning (μP) regime, framing training as Bayesian inference over sequences and patterns and deriving a kernel-based description that treats RNNs and DNNs on equal footing. A key finding is a phase transition in endpoint-supervised tasks: below a critical learning signal the kernels of RNNs and DNNs coincide, but above it RNNs develop temporal coherence across timesteps due to weight sharing, with an outlier in the weight spectrum; in sequential tasks, weight sharing provides an inductive bias that enables sample-efficient generalization by interpolating across unsupervised time steps. The framework connects architectural structure to functional biases, offering a principled lens to understand and leverage temporal feature learning in networks.
Abstract
Recurrent and deep neural networks (RNNs/DNNs) are cornerstone architectures in machine learning. Remarkably, RNNs differ from DNNs only by weight sharing, as can be shown through unrolling in time. How does this structural similarity fit with the distinct functional properties these networks exhibit? To address this question, we here develop a unified mean-field theory for RNNs and DNNs in terms of representational kernels, describing fully trained networks in the feature learning ($μ$P) regime. This theory casts training as Bayesian inference over sequences and patterns, directly revealing the functional implications induced by the RNNs' weight sharing. In DNN-typical tasks, we identify a phase transition when the learning signal overcomes the noise due to randomness in the weights: below this threshold, RNNs and DNNs behave identically; above it, only RNNs develop correlated representations across timesteps. For sequential tasks, the RNNs' weight sharing furthermore induces an inductive bias that aids generalization by interpolating unsupervised time steps. Overall, our theory offers a way to connect architectural structure to functional biases.
