S4M: 4-points to Segment Anything
Adrien Meyer, Lorenzo Arboit, Giuseppe Massimiani, Shih-Min Yin, Didier Mutter, Nicolas Padoy
TL;DR
S4M addresses the annotation bottleneck in medical segmentation by replacing iterative corrections with geometry-aware prompting. It introduces 4-point prompts with role-specific embeddings (extreme or major/minor) and a Canvas auxiliary task that trains the model to reason about shape from prompts alone. Across eight ultrasound and endoscopy datasets, S4M achieves a consistent +3.42 $mIoU$ gain over a strong SAM baseline and enables faster annotation via major/minor prompts. The approach preserves compatibility with existing prompting modes, enhances robustness to complex shape boundaries, and better aligns with clinical measurement practices, paving the way for scalable medical imaging datasets.
Abstract
Purpose: The Segment Anything Model (SAM) promises to ease the annotation bottleneck in medical segmentation, but overlapping anatomy and blurred boundaries make its point prompts ambiguous, leading to cycles of manual refinement to achieve precise masks. Better prompting strategies are needed. Methods: We propose a structured prompting strategy using 4 points as a compact instance-level shape description. We study two 4-point variants: extreme points and the proposed major/minor axis endpoints, inspired by ultrasound measurement practice. SAM cannot fully exploit such structured prompts because it treats all points identically and lacks geometry-aware reasoning. To address this, we introduce S4M (4-points to Segment Anything), which augments SAM to interpret 4 points as relational cues rather than isolated clicks. S4M expands the prompt space with role-specific embeddings and adds an auxiliary "Canvas" pretext task that sketches coarse masks directly from prompts, fostering geometry-aware reasoning. Results: Across eight datasets in ultrasound and surgical endoscopy, S4M improves segmentation by +3.42 mIoU over a strong SAM baseline at equal prompt budget. An annotation study with three clinicians further shows that major/minor prompts enable faster annotation. Conclusion: S4M increases performance, reduces annotation effort, and aligns prompting with clinical practice, enabling more scalable dataset development in medical imaging.
