ScribblePrompt: Fast and Flexible Interactive Segmentation for Any Biomedical Image
Hallee E. Wong, Marianne Rakic, John Guttag, Adrian V. Dalca
TL;DR
ScribblePrompt addresses the challenge of segmenting unseen biomedical images by enabling fast, interactive segmentation that supports scribbles, clicks, and bounding boxes. It combines a problem formulation that learns from iterative user interactions, a robust prompt-simulation and data-augmentation strategy, and two efficient network variants (UNet-based and SAM-adapter) to generalize to new tasks without retraining. The approach achieves superior Dice scores and lower HD95 compared with baselines on 12 unseen evaluation sets, and demonstrates practical speed, including CPU-only inference, with strong user study preferences. The work contributes a scalable training paradigm with synthetic labels, versatile interaction prompts, and open-source code, demonstrating substantial potential to reduce manual annotation workload in biomedical imaging.
Abstract
Biomedical image segmentation is a crucial part of both scientific research and clinical care. With enough labelled data, deep learning models can be trained to accurately automate specific biomedical image segmentation tasks. However, manually segmenting images to create training data is highly labor intensive and requires domain expertise. We present \emph{ScribblePrompt}, a flexible neural network based interactive segmentation tool for biomedical imaging that enables human annotators to segment previously unseen structures using scribbles, clicks, and bounding boxes. Through rigorous quantitative experiments, we demonstrate that given comparable amounts of interaction, ScribblePrompt produces more accurate segmentations than previous methods on datasets unseen during training. In a user study with domain experts, ScribblePrompt reduced annotation time by 28% while improving Dice by 15% compared to the next best method. ScribblePrompt's success rests on a set of careful design decisions. These include a training strategy that incorporates both a highly diverse set of images and tasks, novel algorithms for simulated user interactions and labels, and a network that enables fast inference. We showcase ScribblePrompt in an interactive demo, provide code, and release a dataset of scribble annotations at https://scribbleprompt.csail.mit.edu
