DEAP-3DSAM: Decoder Enhanced and Auto Prompt SAM for 3D Medical Image Segmentation
Fangda Chen, Jintao Tang, Pancheng Wang, Ting Wang, Shasha Li, Ting Deng
TL;DR
DEAP-3DSAM addresses key limitations of SAM-based 3D medical segmentation by introducing a Feature Enhanced Decoder that fuses original image spatial details with SAM features and a Dual Attention Prompter that automatically generates prompt information via Spatial and Channel Attention. The method uses a 12-layer Image Encoder with a Scale Parallel Adapter, processing pseudo-3D patches to produce multiscale features that feed the decoder at multiple depths. Across four abdominal tumor datasets, DEAP-3DSAM achieves state-of-the-art or competitive results, outperforming or matching manual prompt methods and offering notable efficiency gains through linear self-attention and parameter sharing. The approach demonstrates the practical potential of fully automated SAM-based 3D segmentation, with implications for automated lesion localization and feature extraction in medical imaging.
Abstract
The Segment Anything Model (SAM) has recently demonstrated significant potential in medical image segmentation. Although SAM is primarily trained on 2D images, attempts have been made to apply it to 3D medical image segmentation. However, the pseudo 3D processing used to adapt SAM results in spatial feature loss, limiting its performance. Additionally, most SAM-based methods still rely on manual prompts, which are challenging to implement in real-world scenarios and require extensive external expert knowledge. To address these limitations, we introduce the Decoder Enhanced and Auto Prompt SAM (DEAP-3DSAM) to tackle these limitations. Specifically, we propose a Feature Enhanced Decoder that fuses the original image features with rich and detailed spatial information to enhance spatial features. We also design a Dual Attention Prompter to automatically obtain prompt information through Spatial Attention and Channel Attention. We conduct comprehensive experiments on four public abdominal tumor segmentation datasets. The results indicate that our DEAP-3DSAM achieves state-of-the-art performance in 3D image segmentation, outperforming or matching existing manual prompt methods. Furthermore, both quantitative and qualitative ablation studies confirm the effectiveness of our proposed modules.
