Geometry of Polynomial Neural Networks
Kaie Kubjas, Jiayi Li, Maximilian Wiesmann
TL;DR
The paper addresses the geometry of polynomial neural networks with monomial activations by framing learnable functions as neuromanifolds and their Zariski closures as neurovarieties. It develops an algebro-geometric framework that quantifies expressivity via an expected dimension-based measure and learning dynamics via the learning degree, relating these to the generic Euclidean distance degree. Concrete results are provided for several architectures, including explicit descriptions via symmetric tensors, Grassmannians, and the Hilbert–Burch theorem, plus bounds and conjectures on dimension and optimization landscapes. Through both theory and experiments, the work illuminates how network structure constrains learnability and optimization, offering insights for architecture design and potential algorithmic techniques in polynomial neural networks.
Abstract
We study the expressivity and learning process for polynomial neural networks (PNNs) with monomial activation functions. The weights of the network parametrize the neuromanifold. In this paper, we study certain neuromanifolds using tools from algebraic geometry: we give explicit descriptions as semialgebraic sets and characterize their Zariski closures, called neurovarieties. We study their dimension and associate an algebraic degree, the learning degree, to the neurovariety. The dimension serves as a geometric measure for the expressivity of the network, the learning degree is a measure for the complexity of training the network and provides upper bounds on the number of learnable functions. These theoretical results are accompanied with experiments.
