Improving 3D Few-Shot Segmentation with Inference-Time Pseudo-Labeling
Mohammad Mozafari, Hosein Hasani, Reza Vahidimajd, Mohamadreza Fereydooni, Mahdieh Soleymani Baghshah
TL;DR
This work addresses data scarcity in 3D medical image segmentation under $N$-way $K$-shot FSS. It proposes an inference-time three-stage strategy that starts with Stage 1 prototypes from the support set, then uses Stage 2 confidence-aware pseudo-labeling to harvest informative regions from the unlabeled query to augment the support prototypes, and finally applies Stage 3 to perform segmentation with the augmented prototype set. The key contribution is that inference-time augmentation of prototypes leverages unlabeled query data without additional supervision or retraining. Experiments on abdominal CT and MRI demonstrate consistent Dice improvements and validate the effectiveness of windowed pseudo-label transfer and mixed prototype sets.
Abstract
In recent years, few-shot segmentation (FSS) models have emerged as a promising approach in medical imaging analysis, offering remarkable adaptability to segment novel classes with limited annotated data. Existing approaches to few-shot segmentation have often overlooked the potential of the query itself, failing to fully utilize the valuable information it contains. However, treating the query as unlabeled data provides an opportunity to enhance prediction accuracy. Specifically in the domain of medical imaging, the volumetric structure of queries offers a considerable source of valuable information that can be used to improve the target slice segmentation. In this work, we present a novel strategy to efficiently leverage the intrinsic information of the query sample for final segmentation during inference. First, we use the support slices from a reference volume to generate an initial segmentation score for the query slices through a prototypical approach. Subsequently, we apply a confidence-aware pseudo-labeling procedure to transfer the most informative parts of query slices to the support set. The final prediction is performed based on the new expanded support set, enabling the prediction of a more accurate segmentation mask for the query volume. Extensive experiments show that the proposed method can effectively boost performance across diverse settings and datasets.
