Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models
Hedda Cohen Indelman, Elay Dahan, Angeles M. Perez-Agosto, Carmit Shiran, Doron Shaked, Nati Daniel
TL;DR
The paper tackles semantic segmentation in ultrasound under data scarcity and domain gaps by introducing a two-stage refinement that preserves a zero-shot paradigm. A coarse segmentor trained on a small subset provides a mask from which positive points are selected via k-medoids and negative points are optimized for background context, enabling a zero-shot foundation model (SonoSAM) to generate refined pathology masks without fine-tuning. Across a musculoskeletal ultrasound dataset focusing on tendon pathology, the method consistently improves Dice similarity over a baseline, with larger gains in smaller data regimes, and ablation studies highlight the importance of SonoSAM and negative prompt refinement. The approach reduces the need for large labeled ultrasound datasets and is applicable to other ultrasound tasks, though it incurs higher latency due to its two-stage nature.
Abstract
Despite the remarkable success of deep learning in medical imaging analysis, medical image segmentation remains challenging due to the scarcity of high-quality labeled images for supervision. Further, the significant domain gap between natural and medical images in general and ultrasound images in particular hinders fine-tuning models trained on natural images to the task at hand. In this work, we address the performance degradation of segmentation models in low-data regimes and propose a prompt-less segmentation method harnessing the ability of segmentation foundation models to segment abstract shapes. We do that via our novel prompt point generation algorithm which uses coarse semantic segmentation masks as input and a zero-shot prompt-able foundation model as an optimization target. We demonstrate our method on a segmentation findings task (pathologic anomalies) in ultrasound images. Our method's advantages are brought to light in varying degrees of low-data regime experiments on a small-scale musculoskeletal ultrasound images dataset, yielding a larger performance gain as the training set size decreases.
