ProKAN: Progressive Stacking of Kolmogorov-Arnold Networks for Efficient Liver Segmentation
Bhavesh Gyanchandani, Aditya Oza, Abhinav Roy
TL;DR
ProKAN addresses the need for accurate and efficient 3D liver segmentation in CT images. It introduces progressive stacking of Kolmogorov-Arnold Networks (KANs) with learnable B-spline activations to adaptively grow model capacity while preventing overfitting. The approach demonstrates state-of-the-art performance on LiTS17 with higher Dice scores and better generalization compared to MLPs and fixed KANs, along with favorable training/inference efficiency. The work suggests practical benefits for real-time clinical deployment and offers interpretable learned spline coefficients.
Abstract
The growing need for accurate and efficient 3D identification of tumors, particularly in liver segmentation, has spurred considerable research into deep learning models. While many existing architectures offer strong performance, they often face challenges such as overfitting and excessive computational costs. An adjustable and flexible architecture that strikes a balance between time efficiency and model complexity remains an unmet requirement. In this paper, we introduce proKAN, a progressive stacking methodology for Kolmogorov-Arnold Networks (KANs) designed to address these challenges. Unlike traditional architectures, proKAN dynamically adjusts its complexity by progressively adding KAN blocks during training, based on overfitting behavior. This approach allows the network to stop growing when overfitting is detected, preventing unnecessary computational overhead while maintaining high accuracy. Additionally, proKAN utilizes KAN's learnable activation functions modeled through B-splines, which provide enhanced flexibility in learning complex relationships in 3D medical data. Our proposed architecture achieves state-of-the-art performance in liver segmentation tasks, outperforming standard Multi-Layer Perceptrons (MLPs) and fixed KAN architectures. The dynamic nature of proKAN ensures efficient training times and high accuracy without the risk of overfitting. Furthermore, proKAN provides better interpretability by allowing insight into the decision-making process through its learnable coefficients. The experimental results demonstrate a significant improvement in accuracy, Dice score, and time efficiency, making proKAN a compelling solution for 3D medical image segmentation tasks.
