SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

Yichi Zhang; Bohao Lv; Le Xue; Wenbo Zhang; Yuchen Liu; Yu Fu; Yuan Cheng; Yuan Qi

SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

Yichi Zhang, Bohao Lv, Le Xue, Wenbo Zhang, Yuchen Liu, Yu Fu, Yuan Cheng, Yuan Qi

TL;DR

The paper tackles the annotation bottleneck in medical image segmentation by proposing SemiSAM+, a foundation-model-driven SSL framework that fuses a trainable specialist with frozen SAM-like generalist models. The specialist generates positional prompts from its outputs to interact with the generalists, while the generalists provide pseudo-labels and uncertainty-guided supervision to the specialist through a confidence-aware regularization mechanism. Experiments on Left Atrium, BraTS 2019, and an in-house PET dataset demonstrate significant gains under extremely limited labels and show the method’s plug-and-play applicability across different specialist and generalist configurations. This work offers a practical path toward annotation-efficient medical segmentation with potential clinical impact.

Abstract

Deep learning-based medical image segmentation typically requires large amount of labeled data for training, making it less applicable in clinical settings due to high annotation cost. Semi-supervised learning (SSL) has emerged as an appealing strategy due to its less dependence on acquiring abundant annotations from experts compared to fully supervised methods. Beyond existing model-centric advancements of SSL by designing novel regularization strategies, we anticipate a paradigmatic shift due to the emergence of promptable segmentation foundation models with universal segmentation capabilities using positional prompts represented by Segment Anything Model (SAM). In this paper, we present SemiSAM+, a foundation model-driven SSL framework to efficiently learn from limited labeled data for medical image segmentation. SemiSAM+ consists of one or multiple promptable foundation models as generalist models, and a trainable task-specific segmentation model as specialist model. For a given new segmentation task, the training is based on the specialist-generalist collaborative learning procedure, where the trainable specialist model delivers positional prompts to interact with the frozen generalist models to acquire pseudo-labels, and then the generalist model output provides the specialist model with informative and efficient supervision which benefits the automatic segmentation and prompt generation in turn. Extensive experiments on two public datasets and one in-house clinical dataset demonstrate that SemiSAM+ achieves significant performance improvement, especially under extremely limited annotation scenarios, and shows strong efficiency as a plug-and-play strategy that can be easily adapted to different specialist and generalist models.

SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

TL;DR

Abstract

SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (7)