The Alexander-Hirschowitz theorem for neurovarieties

A. Massarenti; M. Mella

The Alexander-Hirschowitz theorem for neurovarieties

A. Massarenti, M. Mella

TL;DR

The paper studies the algebraic geometry of polynomial neural networks by introducing neurovarieties and deriving conditions under which they achieve their expected dimension. It develops a Terracini-type decomposition using level-wise normal spaces to reduce dimension questions to properties of Veronese secant varieties, yielding an Alexander–Hirschowitz-type theorem for the single-output case ($n_L=1$) and, under numerical assumptions, non-defectiveness and global identifiability for multi-output architectures. The results illuminate how architectural choices (widths $\mathbf n$, activation degrees $\mathbf d$) impact expressivity, identifiability, and conditioning, with concrete criteria and explicit counterexamples. Practical tools, including a Magma library, enable symbolic exploration of dimensions across architectures and support principled network design and compression guided by algebraic constraints.

Abstract

We study neurovarieties for polynomial neural networks and fully characterize when they attain the expected dimension in the single-output case. As consequences, we establish non-defectiveness and global identifiability for multi-output architectures.

The Alexander-Hirschowitz theorem for neurovarieties

TL;DR

) and, under numerical assumptions, non-defectiveness and global identifiability for multi-output architectures. The results illuminate how architectural choices (widths

, activation degrees

) impact expressivity, identifiability, and conditioning, with concrete criteria and explicit counterexamples. Practical tools, including a Magma library, enable symbolic exploration of dimensions across architectures and support principled network design and compression guided by algebraic constraints.

The Alexander-Hirschowitz theorem for neurovarieties

TL;DR

Abstract

The Alexander-Hirschowitz theorem for neurovarieties

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Theorems & Definitions (31)