SPIRONet: Spatial-Frequency Learning and Topological Channel Interaction Network for Vessel Segmentation
De-Xing Huang, Xiao-Hu Zhou, Xiao-Liang Xie, Shi-Qi Liu, Shuang-Yi Wang, Zhen-Qiu Feng, Mei-Jiang Gui, Hao Li, Tian-Yu Xiang, Bo-Xian Yao, Zeng-Guang Hou
TL;DR
The paper tackles real-time vessel segmentation in intraoperative imaging, where low SNR and complex vessel morphology hinder accuracy. It introduces SPIRONet, a dual-encoder network that separately learns local spatial features and global frequency features via a Fourier-based frequency encoder, integrates them with a cross-attention fusion module, and further refines multi-channel responses through a topological channel interaction (TCI) module based on graph neural networks. SPIRONet achieves state-of-the-art performance on four challenging benchmarks (CADSA, CAXF, DCA1, XCAD) with an inference speed around 21 FPS on 512×512 images, making it suitable for real-time vascular navigation systems. The work provides comprehensive ablation and visualization analyses that demonstrate the benefits of combining spatial-frequency representations and graph-based channel refinement for robust, accurate vessel segmentation in noisy intraoperative data.
Abstract
Automatic vessel segmentation is paramount for developing next-generation interventional navigation systems. However, current approaches suffer from suboptimal segmentation performances due to significant challenges in intraoperative images (i.e., low signal-to-noise ratio, small or slender vessels, and strong interference). In this paper, a novel spatial-frequency learning and topological channel interaction network (SPIRONet) is proposed to address the above issues. Specifically, dual encoders are utilized to comprehensively capture local spatial and global frequency vessel features. Then, a cross-attention fusion module is introduced to effectively fuse spatial and frequency features, thereby enhancing feature discriminability. Furthermore, a topological channel interaction module is designed to filter out task-irrelevant responses based on graph neural networks. Extensive experimental results on several challenging datasets (CADSA, CAXF, DCA1, and XCAD) demonstrate state-of-the-art performances of our method. Moreover, the inference speed of SPIRONet is 21 FPS with a 512x512 input size, surpassing clinical real-time requirements (6~12FPS). These promising outcomes indicate SPIRONet's potential for integration into vascular interventional navigation systems. Code is available at https://github.com/Dxhuang-CASIA/SPIRONet.
