Fixed points of nonnegative neural networks
Tomasz J. Piotrowski, Renato L. G. Cavalcante, Mateusz Gabor
TL;DR
The paper develops a nonlinear Perron-Frobenius framework for fixed-point analysis of nonnegative neural networks, showing that such networks with nonnegative weights and biases induce monotone, (weakly) scalable mappings on the nonnegative cone. It introduces the asymptotic mapping $T_ extinfty$ and the nonlinear spectral radius $ ho(T_ extinfty)$ to establish existence and, under additional primitivity, uniqueness of fixed points; it also characterizes the shape of fixed-point sets (often intervals) and provides numerically accessible certificates. The results generalize to broader monotone nonnegative networks and have practical implications for autoencoders and deep equilibrium models, offering weaker, PF-based conditions than convex-analytic approaches. The work also outlines future directions on approximate fixed points, slow convergence as a design feature, Lipschitz analyses, and extensions beyond nonnegativity, highlighting the method's potential impact on implicit models and inverse problems.
Abstract
We use fixed point theory to analyze nonnegative neural networks, which we define as neural networks that map nonnegative vectors to nonnegative vectors. We first show that nonnegative neural networks with nonnegative weights and biases can be recognized as monotonic and (weakly) scalable mappings within the framework of nonlinear Perron-Frobenius theory. This fact enables us to provide conditions for the existence of fixed points of nonnegative neural networks having inputs and outputs of the same dimension, and these conditions are weaker than those recently obtained using arguments in convex analysis. Furthermore, we prove that the shape of the fixed point set of nonnegative neural networks with nonnegative weights and biases is an interval, which under mild conditions degenerates to a point. These results are then used to obtain the existence of fixed points of more general nonnegative neural networks. From a practical perspective, our results contribute to the understanding of the behavior of autoencoders, and we also offer valuable mathematical machinery for future developments in deep equilibrium models.
