Activation degree thresholds and expressiveness of polynomial neural networks
Bella Finkel, Jose Israel Rodriguez, Chenxi Wu, Thomas Yahl
TL;DR
The paper develops an algebro-geometric framework to quantify the expressiveness of polynomial neural networks via neurovarieties. It proves the activation degree threshold exists for architectures with no width-one bottlenecks and provides a universal bound $\mathrm{ActThr}(\mathbf{d}) \le 6m^2-6m+1$, resolving the high-degree conjecture and showing equi-width networks have $\mathrm{ActThr}=1$. The authors establish a fiber-based approach to relate the dimension of the neurovariety to its expected dimension for large activation degrees, and they extend key bounds from univariate to multivariate polynomial settings. They also prove that equi-width networks are maximally expressive in the sense that their neurovarieties achieve the expected dimension for all $r>1$, motivating architecture design guided by algebro-geometric expressiveness metrics.
Abstract
We study the expressive power of deep polynomial neural networks through the geometry of their neurovariety. We introduce the notion of the activation degree threshold of a network architecture to express when the dimension of the neurovariety achieves its theoretical maximum. We prove the existence of the activation degree threshold for all polynomial neural networks without width-one bottlenecks and demonstrate a universal upper bound that is quadratic in the width of largest size. In doing so, we prove the high activation degree conjecture of Kileel, Trager, and Bruna. Certain structured architectures have exceptional activation degree thresholds, making them especially expressive in the sense of their neurovariety dimension. In this direction, we prove that polynomial neural networks with equi-width architectures are maximally expressive by showing their activation degree threshold is one.
