Revisiting Deep Information Propagation: Fractal Frontier and Finite-size Effects
Giuseppe Alessio D'Inverno, Zhiyuan Hu, Leo Davy, Michael Unser, Gianluigi Rozza, Jonathan Dong
TL;DR
Problem: how information propagates in finite-width deep networks and how the order-to-chaos boundary behaves away from the mean-field limit. Approach: extend mean-field analysis to finite width across MLPs, CNNs, and Fourier-based structured transforms, define forward divergence $L^{(d)}$ and backpropagation metric $L'$ and study the fractal frontier via box counting. Contributions: show a fractal boundary between stable and chaotic propagation in forward and backward passes, quantify fractal dimensions around 1.6–1.9 across architectures, and demonstrate a clear finite-depth separation-robustness tradeoff. Significance: reveals intrinsic dynamical complexity of architectures independent of data and optimization, informing initialization and depth choices for robust information flow in practice.
Abstract
Information propagation characterizes how input correlations evolve across layers in deep neural networks. This framework has been well studied using mean-field theory, which assumes infinitely wide networks. However, these assumptions break down for practical, finite-size networks. In this work, we study information propagation in randomly initialized neural networks with finite width and reveal that the boundary between ordered and chaotic regimes exhibits a fractal structure. This shows the fundamental complexity of neural network dynamics, in a setting that is independent of input data and optimization. To extend this analysis beyond multilayer perceptrons, we leverage recently introduced Fourier-based structured transforms, and show that information propagation in convolutional neural networks also follow the same behavior. In practice, our investigation highlights the importance of finite network depth with respect to the tradeoff between separation and robustness.
