DBF-UNet: A Two-Stage Framework for Carotid Artery Segmentation with Pseudo-Label Generation
Haoxuan Li, Wei Song, Aofan Liu, Peiwu Qin
TL;DR
This work tackles the challenge of sparse, spatially discontinuous annotations in 3D carotid artery segmentation by introducing a two-stage framework. It first generates high-quality pseudo-labels through centroid-guided interpolation and SAM-Med2D fine-tuning, then refines full 3D vessel segmentation with the proposed DBF-UNet, which features dense downsampling (DSDBlock), multi-scale feature learning (MLKBlock), and bidirectional feature fusion (BFFBlock). The approach achieves state-of-the-art results on the COSMOS2022 dataset, with lumen Dice of 95.22% and wall Dice of 86.08%, demonstrating strong performance in challenging regions and with limited annotations. This work highlights effective integration of foundation-model-based prompting with specialized lightweight architectures for precision, label-efficient medical image segmentation and has practical potential for clinical workflows.
Abstract
Medical image analysis faces significant challenges due to limited annotation data, particularly in three-dimensional carotid artery segmentation tasks, where existing datasets exhibit spatially discontinuous slice annotations with only a small portion of expert-labeled slices in complete 3D volumetric data. To address this challenge, we propose a two-stage segmentation framework. First, we construct continuous vessel centerlines by interpolating between annotated slice centroids and propagate labels along these centerlines to generate interpolated annotations for unlabeled slices. The slices with expert annotations are used for fine-tuning SAM-Med2D, while the interpolated labels on unlabeled slices serve as prompts to guide segmentation during inference. In the second stage, we propose a novel Dense Bidirectional Feature Fusion UNet (DBF-UNet). This lightweight architecture achieves precise segmentation of complete 3D vascular structures. The network incorporates bidirectional feature fusion in the encoder and integrates multi-scale feature aggregation with dense connectivity for effective feature reuse. Experimental validation on public datasets demonstrates that our proposed method effectively addresses the sparse annotation challenge in carotid artery segmentation while achieving superior performance compared to existing approaches. The source code is available at https://github.com/Haoxuanli-Thu/DBF-UNet.
