Shift-Invariant Attribute Scoring for Kolmogorov-Arnold Networks via Shapley Value
Wangxuan Fan, Ching Wang, Siqi Li, Nan Liu
TL;DR
The paper tackles interpretability and pruning in Kolmogorov–Arnold Networks (KANs), where standard magnitude-based pruning is unreliable under covariate shift. It introduces ShapKAN, a Shapley-value–based pruning framework that treats layer neurons as cooperative-game players and uses permutation sampling with antithetic variance reduction to efficiently approximate neuron attributions, applied in a bottom-up, multi-layer manner. Across synthetic and real-world datasets, ShapKAN yields shift-invariant attributions, preserves ground-truth symbolic structure under covariate shift, and enables more effective compression while maintaining generalization. This approach enhances interpretability and enables deployment of KANs in resource-constrained settings, while remaining compatible with advanced KAN variants like MultKAN 2.0.
Abstract
For many real-world applications, understanding feature-outcome relationships is as crucial as achieving high predictive accuracy. While traditional neural networks excel at prediction, their black-box nature obscures underlying functional relationships. Kolmogorov--Arnold Networks (KANs) address this by employing learnable spline-based activation functions on edges, enabling recovery of symbolic representations while maintaining competitive performance. However, KAN's architecture presents unique challenges for network pruning. Conventional magnitude-based methods become unreliable due to sensitivity to input coordinate shifts. We propose \textbf{ShapKAN}, a pruning framework using Shapley value attribution to assess node importance in a shift-invariant manner. Unlike magnitude-based approaches, ShapKAN quantifies each node's actual contribution, ensuring consistent importance rankings regardless of input parameterization. Extensive experiments on synthetic and real-world datasets demonstrate that ShapKAN preserves true node importance while enabling effective network compression. Our approach improves KAN's interpretability advantages, facilitating deployment in resource-constrained environments.
