KAN or MLP: A Fairer Comparison
Runpeng Yu, Weihao Yu, Xinchao Wang
TL;DR
This work performs a fair, parameter- and FLOP-controlled comparison between Kolmogorov–Arnold Networks (KAN) and MLPs across multiple domains, finding that MLP generally outperforms KAN except in symbolic formula representation where KAN has an edge. It identifies the learnable B-spline activation as the key factor behind KAN’s distinct performance, showing that equipping MLP with B-spline activations can match or exceed KAN across tasks. Additional ablations reveal the benefits of spline activations are task-dependent, while a standard class-incremental continual learning setup shows KAN forgetting more severely than MLP, challenging prior conclusions. The results offer practical guidance for future work on KAN and MAP-style MLP alternatives, highlighting when spline activations are beneficial and where they are not.
Abstract
This paper does not introduce a novel method. Instead, it offers a fairer and more comprehensive comparison of KAN and MLP models across various tasks, including machine learning, computer vision, audio processing, natural language processing, and symbolic formula representation. Specifically, we control the number of parameters and FLOPs to compare the performance of KAN and MLP. Our main observation is that, except for symbolic formula representation tasks, MLP generally outperforms KAN. We also conduct ablation studies on KAN and find that its advantage in symbolic formula representation mainly stems from its B-spline activation function. When B-spline is applied to MLP, performance in symbolic formula representation significantly improves, surpassing or matching that of KAN. However, in other tasks where MLP already excels over KAN, B-spline does not substantially enhance MLP's performance. Furthermore, we find that KAN's forgetting issue is more severe than that of MLP in a standard class-incremental continual learning setting, which differs from the findings reported in the KAN paper. We hope these results provide insights for future research on KAN and other MLP alternatives. Project link: https://github.com/yu-rp/KANbeFair
