CF-KAN: Kolmogorov-Arnold Network-based Collaborative Filtering to Mitigate Catastrophic Forgetting in Recommender Systems
Jin-Duk Park, Kyung-Min Kim, Won-Yong Shin
TL;DR
The paper tackles catastrophic forgetting in recommender systems by replacing fixed MLP activations with Kolmogorov-Arnold networks (KANs) that learn edge-level nonlinearities. CF-KAN builds a KAN-based autoencoder to model sparse user--item interactions, enabling robust continual learning and interpretability via pruning. The approach achieves state-of-the-art recall and NDCG on ML-1M, Yelp, and Anime, with gains up to up to 8.2% over strong baselines, while maintaining faster training times than two-tower models. By grounding activations in the Kolmogorov–Arnol’d representation $f({f x}) = \sum_{q=1}^{2n+1} \Phi_q\big( \sum_{p=1}^n \phi_{q,p}(x_p) \big)$, CF-KAN demonstrates that edge-level learning can balance plasticity and stability in dynamic recommendation scenarios and offers interpretable, sparse explanations of recommendations.
Abstract
Collaborative filtering (CF) remains essential in recommender systems, leveraging user--item interactions to provide personalized recommendations. Meanwhile, a number of CF techniques have evolved into sophisticated model architectures based on multi-layer perceptrons (MLPs). However, MLPs often suffer from catastrophic forgetting, and thus lose previously acquired knowledge when new information is learned, particularly in dynamic environments requiring continual learning. To tackle this problem, we propose CF-KAN, a new CF method utilizing Kolmogorov-Arnold networks (KANs). By learning nonlinear functions on the edge level, KANs are more robust to the catastrophic forgetting problem than MLPs. Built upon a KAN-based autoencoder, CF-KAN is designed in the sense of effectively capturing the intricacies of sparse user--item interactions and retaining information from previous data instances. Despite its simplicity, our extensive experiments demonstrate 1) CF-KAN's superiority over state-of-the-art methods in recommendation accuracy, 2) CF-KAN's resilience to catastrophic forgetting, underscoring its effectiveness in both static and dynamic recommendation scenarios, and 3) CF-KAN's edge-level interpretation facilitating the explainability of recommendations.
