Computing frustration and near-monotonicity in deep neural networks
Joel Wendin, Erik G. Larsson, Claudio Altafini
TL;DR
The paper investigates whether pretrained deep CNNs encode an intrinsic order by viewing their architectures as signed DAGs and measuring their structural balance through a frustration index. By linking structural balance to monotonicity, it demonstrates that CNNs tend to be less frustrated than null models and therefore exhibit near-monotone input-output behavior, suggesting a form of implicit regularization emerging from training. The authors introduce active-subnetwork analysis, a direct IO-monotonicity test, and a concrete heuristic for estimating frustration, applying them to multiple popular CNNs and null models. The findings reveal a robust, input-sensitive partial order that persists under perturbations and may contribute to the networks' generalization and stability properties. This framework bridges spin-glass inspired disorder analysis with monotone system theory to provide a novel lens on DNN organization and regularization.
Abstract
For the signed graph associated to a deep neural network, one can compute the frustration level, i.e., test how close or distant the graph is to structural balance. For all the pretrained deep convolutional neural networks we consider, we find that the frustration is always less than expected from null models. From a statistical physics point of view, and in particular in reference to an Ising spin glass model, the reduced frustration indicates that the amount of disorder encoded in the network is less than in the null models. From a functional point of view, low frustration (i.e., proximity to structural balance) means that the function representing the network behaves near-monotonically, i.e., more similarly to a monotone function than in the null models. Evidence of near-monotonic behavior along the partial order determined by frustration is observed for all networks we consider. This confirms that the class of deep convolutional neural networks tends to have a more ordered behavior than expected from null models, and suggests a novel form of implicit regularization.
