SegKAN: High-Resolution Medical Image Segmentation with Long-Distance Dependencies
Shengbo Tan, Rundong Xue, Shipeng Luo, Zeyu Zhang, Xinran Wang, Lei Zhang, Daji Ergu, Zhang Yi, Yang Zhao, Ying Cai
TL;DR
SegKAN addresses the challenge of long-distance, high-resolution hepatic vessel segmentation in CT images by introducing two key innovations. The Position-Time Series Network (PTSN) converts spatial relationships among 3D patches into temporal sequences to strengthen long-range context, while Fourier-based KAN Convolution (FKAC) stabilizes embeddings to prevent gradient explosions and reduce noise. Together, these modules form a cohesive framework that, when evaluated on the MSD Hepatic Vessel dataset, achieves a Dice score of 69.02% on average and 65.90% for vessel-specific segmentation, outperforming state-of-the-art methods such as TransUNet by up to 1.78 percentage points. The results demonstrate SegKAN’s effectiveness in modeling global context and long-range dependencies for elongated structures, with potential applicability to other high-resolution medical image segmentation tasks.
Abstract
Hepatic vessels in computed tomography scans often suffer from image fragmentation and noise interference, making it difficult to maintain vessel integrity and posing significant challenges for vessel segmentation. To address this issue, we propose an innovative model: SegKAN. First, we improve the conventional embedding module by adopting a novel convolutional network structure for image embedding, which smooths out image noise and prevents issues such as gradient explosion in subsequent stages. Next, we transform the spatial relationships between Patch blocks into temporal relationships to solve the problem of capturing positional relationships between Patch blocks in traditional Vision Transformer models. We conducted experiments on a Hepatic vessel dataset, and compared to the existing state-of-the-art model, the Dice score improved by 1.78%. These results demonstrate that the proposed new structure effectively enhances the segmentation performance of high-resolution extended objects. Code will be available at https://github.com/goblin327/SegKAN
