PolyKAN: A Polyhedral Analysis Framework for Provable and Approximately Optimal KAN Compression
Di Zhang
TL;DR
PolyKAN addresses the challenge of compressing Kolmogorov-Arnol d Networks (KANs) with formal guarantees on size and approximation error. It introduces a complete polyhedral characterization of KANs, along with a formal theory of $\epsilon$-equivalent compression via region merging and a rigorous error propagation framework across layers. A dynamic-programming algorithm achieves provable optimal compression for univariate splines and offers approximately optimal solutions for multilayer KANs with polynomial-time guarantees. This framework provides a principled, verifiable path to deploying interpretable spline-based networks at scale, and points to future work on tighter bounds and extensions to other spline families.
Abstract
Kolmogorov-Arnold Networks (KANs) have emerged as a promising alternative to traditional Multi-Layer Perceptrons (MLPs), offering enhanced interpretability and a solid mathematical foundation. However, their parameter efficiency remains a significant challenge for practical deployment. This paper introduces PolyKAN, a novel theoretical framework for KAN compression that provides formal guarantees on both model size reduction and approximation error. By leveraging the inherent piecewise polynomial structure of KANs, we formulate the compression problem as a polyhedral region merging task. We establish a rigorous polyhedral characterization of KANs, develop a complete theory of $ε$-equivalent compression, and design a dynamic programming algorithm that achieves approximately optimal compression under specified error bounds. Our theoretical analysis demonstrates that PolyKAN achieves provably near-optimal compression while maintaining strict error control, with guaranteed global optimality for univariate spline functions. This framework provides the first formal foundation for KAN compression with mathematical guarantees, opening new directions for the efficient deployment of interpretable neural architectures.
