Polynomial-Augmented Neural Networks (PANNs) with Weak Orthogonality Constraints for Enhanced Function and PDE Approximation
Madison Cooley, Shandian Zhe, Robert M. Kirby, Varun Shankar
TL;DR
Polynomial-Augmented Neural Networks (PANNs) fuse a DNN with a trainable polynomial layer to combine the strengths of neural and polynomial approximations. By enforcing weak orthogonality between the two bases, applying polynomial preconditioning, and using basis pruning, PANNs achieve superior accuracy for smooth and finitely smooth functions and improved PDE solutions when used as PI-PANNs. The framework introduces eight discrete orthogonality constraints, a preconditioning strategy, and a suite of algorithmic techniques (precomputation, custom autograd, and truncation) implemented in PyTorch C++. Across extensive experiments, PANNs outperform pure DNNs and plain polynomial methods, with PI-PANNs delivering orders-of-magnitude improvements over traditional PINNs, and provide robust performance in high-dimensional and noisy settings.
Abstract
We present polynomial-augmented neural networks (PANNs), a novel machine learning architecture that combines deep neural networks (DNNs) with a polynomial approximant. PANNs combine the strengths of DNNs (flexibility and efficiency in higher-dimensional approximation) with those of polynomial approximation (rapid convergence rates for smooth functions). To aid in both stable training and enhanced accuracy over a variety of problems, we present (1) a family of orthogonality constraints that impose mutual orthogonality between the polynomial and the DNN within a PANN; (2) a simple basis pruning approach to combat the curse of dimensionality introduced by the polynomial component; and (3) an adaptation of a polynomial preconditioning strategy to both DNNs and polynomials. We test the resulting architecture for its polynomial reproduction properties, ability to approximate both smooth functions and functions of limited smoothness, and as a method for the solution of partial differential equations (PDEs). Through these experiments, we demonstrate that PANNs offer superior approximation properties to DNNs for both regression and the numerical solution of PDEs, while also offering enhanced accuracy over both polynomial and DNN-based regression (each) when regressing functions with limited smoothness.
