Beyond KAN: Introducing KarSein for Adaptive High-Order Feature Interaction Modeling in CTR Prediction
Yunxiao Shi, Wujiang Xu, Haimin Zhang, Qiang Wu, Min Xu
TL;DR
<3-5 sentence high-level summary> KarSein addresses the challenge of modeling high-order feature interactions in CTR prediction without incurring the prohibitive cost of explicit cross-feature enumeration. It extends Kolmogorov–Arnold Networks to vector-valued inputs, introducing a lightweight KarSein Interaction Layer with a pairwise multiplication catalyst, a single spline-based activation per feature, and a residual linear head, while jointly modeling implicit bit-wise interactions in a parallel path. The approach achieves state-of-the-art CTR accuracy across four real-world datasets with dramatically fewer parameters and demonstrates strong interpretability through symbolic regression and structural sparsity. The results establish KarSein as a practical, scalable, and interpretable alternative for high-order interaction modeling in CTR and related tasks.
Abstract
Modeling high-order feature interactions is crucial for click-through rate (CTR) prediction, yet traditional approaches typically predefine a maximum interaction order and exhaustively enumerate feature combinations up to that order. This paradigm depends heavily on prior domain knowledge to delimit the interaction space and incurs substantial computational overhead. As a result, conventional CTR models face a persistent tension between enriching representations with complex high-order interactions and keeping computation tractable. To address this dual challenge, this study introduces the Kolmogorov-Arnold Represented Sparse Efficient Interaction Network (KarSein). Drawing inspiration from the learnable activation mechanism in the Kolmogorov-Arnold Network (KAN), KarSein leverages this mechanism to adaptively transform low-order basic features into high-order feature interactions, offering a novel approach to feature interaction modeling. KarSein extends the capabilities of KAN by introducing a more efficient architecture that significantly reduces computational costs while accommodating two-dimensional embedding vectors as feature inputs. Furthermore, it overcomes the limitation of KAN's its inability to spontaneously capture multiplicative relationships among features. Extensive experiments highlight the superiority of KarSein, demonstrating its ability to surpass not only the vanilla implementation of KAN in CTR prediction tasks but also other baseline methods. Remarkably, KarSein achieves exceptional predictive accuracy while maintaining a highly compact parameter size and minimal computational overhead. Moreover, KarSein exhibits strong interpretability and structural sparsity. As the first systematic adaptation of KAN to CTR prediction, KarSein offers a practical, parameter-efficient, and interpretable alternative for modeling complex feature interactions in CTR prediction.
