Size and depth of monotone neural networks: interpolation and approximation
Dan Mikulincer, Daniel Reichman
TL;DR
This work studies monotone neural networks with nonnegative weights, focusing on their expressive power and interpolation abilities for monotone functions on $[0,1]^d$. It proves that depth-4 monotone threshold networks universally approximate monotone functions, improving over prior depth bounds for $d>3$, and shows a near-tight depth-based interpolation construction with a wide intermediate layer. The paper also establishes an exponential separation between the size required by monotone networks and general threshold networks, via harmonic extension and circuit reductions, leveraging the Tardos function to connect to monotone circuit lower bounds. These results illuminate fundamental trade-offs between monotonicity constraints and representation efficiency, with implications for designing monotone models and for understanding connections between neural networks and circuit complexity.
Abstract
We study monotone neural networks with threshold gates where all the weights (other than the biases) are non-negative. We focus on the expressive power and efficiency of representation of such networks. Our first result establishes that every monotone function over $[0,1]^d$ can be approximated within arbitrarily small additive error by a depth-4 monotone network. When $d > 3$, we improve upon the previous best-known construction which has depth $d+1$. Our proof goes by solving the monotone interpolation problem for monotone datasets using a depth-4 monotone threshold network. In our second main result we compare size bounds between monotone and arbitrary neural networks with threshold gates. We find that there are monotone real functions that can be computed efficiently by networks with no restriction on the gates whereas monotone networks approximating these functions need exponential size in the dimension.
