On the Rate of Convergence of Kolmogorov-Arnold Network Regression Estimators
Wei Liu, Eleni Chatzi, Zhilu Lai
TL;DR
The paper analyzes Kolmogorov–Arnol'd Networks (KANs) that use univariate B-spline components to model multivariate functions via additive or hybrid additive-m multiplicative structures. It proves that spline-based KAN estimators achieve the minimax convergence rate $O(n^{-\frac{2r}{2r+1}})$ for functions in Sobolev spaces $W^r([0,1]^d)$, with knot placement $k_n \asymp n^{1/(2r+1)}$. The results hold for both additive and hybrid KANs, establishing minimax optimality over the corresponding function classes and providing identifiability remarks. Simulations corroborate the theory, showing that KANs converge at the predicted rate and outperform standard MLP baselines, highlighting the value of structure and splines for interpretable, efficient nonparametric regression.
Abstract
Kolmogorov-Arnold Networks (KANs) offer a structured and interpretable framework for multivariate function approximation by composing univariate transformations through additive or multiplicative aggregation. This paper establishes theoretical convergence guarantees for KANs when the univariate components are represented by B-splines. We prove that both additive and hybrid additive-multiplicative KANs attain the minimax-optimal convergence rate $O(n^{-2r/(2r+1)})$ for functions in Sobolev spaces of smoothness $r$. We further derive guidelines for selecting the optimal number of knots in the B-splines. The theory is supported by simulation studies that confirm the predicted convergence rates. These results provide a theoretical foundation for using KANs in nonparametric regression and highlight their potential as a structured alternative to existing methods.
