Information-Theoretic Progress Measures reveal Grokking is an Emergent Phase Transition
Kenzo Clauw, Sebastiano Stramaglia, Daniele Marinazzo
TL;DR
This work tackles grokking, the abrupt generalization of neural networks after extended memorization, by applying higher-order information theory to quantify inter-neuronal interactions. Using the multivariate measure $\Omega_{n}$, defined as $\Omega_{n}(\mathbf{Z}) = (n - 2)H(\mathbf{Z}) + \sum_{j=1}^{n} [H(Z_{j}) - H(\mathbf{Z} \backslash Z_{j})]$, the authors distinguish synergy and redundancy among neuron activations and show that grokking corresponds to an emergent phase transition with three phases: Feature Learning, Emergence, and Decoupling. They demonstrate that weight decay and initialization modulate the emergence, with early synergy peaks offering predictive power for grokking. The study highlights a shift toward emergentism in interpretability and points to limitations due to the toy setup and computational costs of higher-order information estimates, outlining directions for scaling and stronger causal validation. Overall, the paper provides a framework to diagnose and predict grokking through higher-order interactions and contributes to understanding how collective neuronal dynamics drive generalization.
Abstract
This paper studies emergent phenomena in neural networks by focusing on grokking where models suddenly generalize after delayed memorization. To understand this phase transition, we utilize higher-order mutual information to analyze the collective behavior (synergy) and shared properties (redundancy) between neurons during training. We identify distinct phases before grokking allowing us to anticipate when it occurs. We attribute grokking to an emergent phase transition caused by the synergistic interactions between neurons as a whole. We show that weight decay and weight initialization can enhance the emergent phase.
