Graded Neural Networks
Tony Shaska
TL;DR
Graded Neural Networks (GNNs) introduce a principled framework for hierarchical data by endowing neural computations with algebraic grading via $\lambda \star \mathbf{x}=(\lambda^{q_i} x_i)$ on $\mathcal{V}_\mathbf{q}^n$. The approach defines graded neurons, graded ReLU, exponential activations, and graded loss families that respect the underlying grade structure, and it recovers classical neural networks when $\mathbf{q}=(1,\ldots,1)$. The authors establish foundational properties, including convexity of graded losses, universal approximation for graded-homogeneous functions, exact monomial representations, Lipschitz activation behavior, and grade-aware convergence of optimization; they also address numerical stability and computational efficiency. They further demonstrate the framework’s relevance to domains such as algebraic geometry (genus-two invariants), quantum physics (supersymmetric systems), and neuromorphic/photonic hardware, arguing that grading provides a robust, scalable path toward structured deep learning with broad practical impact.
Abstract
This paper presents a novel framework for graded neural networks (GNNs) built over graded vector spaces $\V_\w^n$, extending classical neural architectures by incorporating algebraic grading. Leveraging a coordinate-wise grading structure with scalar action $λ\star \x = (λ^{q_i} x_i)$, defined by a tuple $\w = (q_0, \ldots, q_{n-1})$, we introduce graded neurons, layers, activation functions, and loss functions that adapt to feature significance. Theoretical properties of graded spaces are established, followed by a comprehensive GNN design, addressing computational challenges like numerical stability and gradient scaling. Potential applications span machine learning and photonic systems, exemplified by high-speed laser-based implementations. This work offers a foundational step toward graded computation, unifying mathematical rigor with practical potential, with avenues for future empirical and hardware exploration.
