KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling
Akansh Agrawal, Akshan Agrawal, Shashwat Gupta, Priyanka Bagade
TL;DR
The paper tackles the challenge of accurate medical image segmentation by bridging non-linear modeling and long-range dependency handling. It introduces KAN-Mamba FusionNet, which integrates KAN-based non-linear blocks with a Mamba-inspired KAMBA block that includes BoA, SSM, and spatial attention, within a U-KAN-inspired pipeline. Key contributions include the novel KAMBA block, the BoA activation scheme, and comprehensive ablations and evaluations on BUSI, Kvasir-Seg, and GlaS. Results show consistent improvements in IoU and F1 across datasets with manageable computational costs, suggesting practical impact for clinical image analysis.
Abstract
Medical image segmentation is essential for applications like robotic surgeries, disease diagnosis, and treatment planning. Recently, various deep-learning models have been proposed to enhance medical image segmentation. One promising approach utilizes Kolmogorov-Arnold Networks (KANs), which better capture non-linearity in input data. However, they are unable to effectively capture long-range dependencies, which are required to accurately segment complex medical images and, by that, improve diagnostic accuracy in clinical settings. Neural networks such as Mamba can handle long-range dependencies. However, they have a limited ability to accurately capture non-linearities in the images as compared to KANs. Thus, we propose a novel architecture, the KAN-Mamba FusionNet, which improves segmentation accuracy by effectively capturing the non-linearities from input and handling long-range dependencies with the newly proposed KAMBA block. We evaluated the proposed KAN-Mamba FusionNet on three distinct medical image segmentation datasets: BUSI, Kvasir-Seg, and GlaS - and found it consistently outperforms state-of-the-art methods in IoU and F1 scores. Further, we examined the effects of various components and assessed their contributions to the overall model performance via ablation studies. The findings highlight the effectiveness of this methodology for reliable medical image segmentation, providing a unique approach to address intricate visual data issues in healthcare.
