Auto-Generating Weak Labels for Real & Synthetic Data to Improve Label-Scarce Medical Image Segmentation
Tanvi Deshpande, Eva Prakash, Elsie Gyang Ross, Curtis Langlotz, Andrew Ng, Jeya Maria Jose Valanarasu
TL;DR
This work tackles label scarcity in medical image segmentation by coupling a small gold-standard set with a coarse-label predictor to auto-prompt MedSAM and generate weak labels on unlabeled data. The generated weak labels are used to augment training data, enabling a more accurate segmentation model without extensive expert labeling. The approach is validated across ultrasound, dermoscopy, and chest X-ray tasks, and is further extended with synthetic data generated by diffusion models, demonstrating robust improvements in $DICE$ scores and demonstrating the practical potential for scalable, label-efficient medical imaging pipelines. MedSAM outperforms SAM in auto-prompting, reducing the need for manual intervention and enabling automatic label generation for real and synthetic data, with implications for human-in-the-loop annotation and deployment in label-scarce settings.
Abstract
The high cost of creating pixel-by-pixel gold-standard labels, limited expert availability, and presence of diverse tasks make it challenging to generate segmentation labels to train deep learning models for medical imaging tasks. In this work, we present a new approach to overcome the hurdle of costly medical image labeling by leveraging foundation models like Segment Anything Model (SAM) and its medical alternate MedSAM. Our pipeline has the ability to generate weak labels for any unlabeled medical image and subsequently use it to augment label-scarce datasets. We perform this by leveraging a model trained on a few gold-standard labels and using it to intelligently prompt MedSAM for weak label generation. This automation eliminates the manual prompting step in MedSAM, creating a streamlined process for generating labels for both real and synthetic images, regardless of quantity. We conduct experiments on label-scarce settings for multiple tasks pertaining to modalities ranging from ultrasound, dermatology, and X-rays to demonstrate the usefulness of our pipeline. The code is available at https://github.com/stanfordmlgroup/Auto-Generate-WLs/.
