Emergent weight morphologies in deep neural networks
Pascal de Jong, Felix Meigel, Steffen Rulands
TL;DR
The paper investigates whether deep neural networks develop emergent weight morphologies during training that do not depend on the training data. By formulating a path-activity framework inspired by condensed matter physics, it derives that the homogeneous weight state is unstable and gives rise to periodic channel structures, which are then corroborated by numerical experiments across synthetic and real datasets. The study further shows that channel amplitudes exhibit oscillatory modulation across layers and that embedding dimensions correlate with these morphologies, suggesting functional links to representation learning. Altogether, the work reveals universal, data-independent weight morphologies that constrain and potentially aid learning, with implications for pruning, representation, and security considerations in increasingly capable AI systems.
Abstract
Whether deep neural networks can exhibit emergent behaviour is not only relevant for understanding how deep learning works, it is also pivotal for estimating potential security risks of increasingly capable artificial intelligence systems. Here, we show that training deep neural networks gives rise to emergent weight morphologies independent of the training data. Specifically, in analogy to condensed matter physics, we derive a theory that predict that the homogeneous state of deep neural networks is unstable in a way that leads to the emergence of periodic channel structures. We verified these structures by performing numerical experiments on a variety of data sets. Our work demonstrates emergence in the training of deep neural networks, which impacts the achievable performance of deep neural networks.
