GenEFT: Understanding Statics and Dynamics of Model Generalization via Effective Theory
David D. Baek, Ziming Liu, Max Tegmark
TL;DR
GenEFT addresses the statics and dynamics of neural-network generalization by combining an information-theoretic bound on required data with a dynamic Interacting Repon Theory that ties encoder/decoder learning-rate balance to memorization-generalization phase transitions. The static component centers on a description-length bound $b=\log_2 \frac{n!}{|\operatorname{Aut}(G)|}$ and a crude critical-data fraction $p_c$, while the dynamic component introduces repons as interacting representations, yielding Theorems 1–3 and a practical bound on generalizable learning $p_r$. The framework is validated on knowledge-graph autoencoding tasks with $n=30$ across multiple relations, revealing a Goldilocks decoder regime and phase diagrams that align with theory. By linking data geometry and learning dynamics through a physics-inspired lens, GenEFT offers a principled pathway to bridge theory and practice in model generalization and informs data and learning-rate decisions in graph-based settings.
Abstract
We present GenEFT: an effective theory framework for shedding light on the statics and dynamics of neural network generalization, and illustrate it with graph learning examples. We first investigate the generalization phase transition as data size increases, comparing experimental results with information-theory-based approximations. We find generalization in a Goldilocks zone where the decoder is neither too weak nor too powerful. We then introduce an effective theory for the dynamics of representation learning, where latent-space representations are modeled as interacting particles (repons), and find that it explains our experimentally observed phase transition between generalization and overfitting as encoder and decoder learning rates are scanned. This highlights the power of physics-inspired effective theories for bridging the gap between theoretical predictions and practice in machine learning.
