Deep Attentive Features for Prostate Segmentation in 3D Transrectal Ultrasound
Yi Wang, Haoran Dou, Xiaowei Hu, Lei Zhu, Xin Yang, Ming Xu, Jing Qin, Pheng-Ann Heng, Tianfu Wang, Dong Ni
TL;DR
The paper tackles automatic prostate segmentation in 3D TRUS, a task hampered by boundary ambiguity and intensity inhomogeneity. It introduces a 3D attention-guided network that generates deep attentive features (DAF) by refining per-layer features with a learned attention over multi-level features, integrated with a 3D-FPN backbone, dilated convolutions, and 3D ASPP. A hybrid loss combining Dice and binary cross-entropy with deep supervision is used, and the model is trained end-to-end in four-fold cross-validation. Results on 40 TRUS volumes show state-of-the-art Dice/Jaccard/Conformity metrics and favorable surface distances, with significant gains over baselines and fast inference (~0.3 seconds per volume). The work provides a generalizable strategy for multi-level feature refinement in 3D medical segmentation and releases code for broader use.
Abstract
Automatic prostate segmentation in transrectal ultrasound (TRUS) images is of essential importance for image-guided prostate interventions and treatment planning. However, developing such automatic solutions remains very challenging due to the missing/ambiguous boundary and inhomogeneous intensity distribution of the prostate in TRUS, as well as the large variability in prostate shapes. This paper develops a novel 3D deep neural network equipped with attention modules for better prostate segmentation in TRUS by fully exploiting the complementary information encoded in different layers of the convolutional neural network (CNN). Our attention module utilizes the attention mechanism to selectively leverage the multilevel features integrated from different layers to refine the features at each individual layer, suppressing the non-prostate noise at shallow layers of the CNN and increasing more prostate details into features at deep layers. Experimental results on challenging 3D TRUS volumes show that our method attains satisfactory segmentation performance. The proposed attention mechanism is a general strategy to aggregate multi-level deep features and has the potential to be used for other medical image segmentation tasks. The code is publicly available at https://github.com/wulalago/DAF3D.
