RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

Nuren Zhaksylyk; Ibrahim Almakky; Jay Paranjape; S. Swaroop Vedula; Shameema Sikder; Vishal M. Patel; Mohammad Yaqub

RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

Nuren Zhaksylyk, Ibrahim Almakky, Jay Paranjape, S. Swaroop Vedula, Shameema Sikder, Vishal M. Patel, Mohammad Yaqub

TL;DR

The paper addresses the instability of single-point prompts in SAM-based surgical instrument segmentation under limited annotated data. It introduces RP-SAM2, which incorporates a shift block to make the prompt image-aware and a compound loss to train the shift block using multiple candidate points, all while keeping the base SAM2 components frozen. On Cataract1k, RP-SAM2 achieves approximately a 2% gain in mDSC and a 21% reduction in mHD95 with lower variability, and it enables improved pseudo-mask quality for SAM2-FT on CaDIS with modest fine-tuning. The approach demonstrates practical benefits for semi-automatic labeling in medical imaging, with potential extensions to video segmentation and prompt-tracking for dynamic workflows.

Abstract

Accurate surgical instrument segmentation is essential in cataract surgery for tasks such as skill assessment and workflow optimization. However, limited annotated data makes it difficult to develop fully automatic models. Prompt-based methods like SAM2 offer flexibility yet remain highly sensitive to the point prompt placement, often leading to inconsistent segmentations. We address this issue by introducing RP-SAM2, which incorporates a novel shift block and a compound loss function to stabilize point prompts. Our approach reduces annotator reliance on precise point positioning while maintaining robust segmentation capabilities. Experiments on the Cataract1k dataset demonstrate that RP-SAM2 improves segmentation accuracy, with a 2% mDSC gain, a 21.36% reduction in mHD95, and decreased variance across random single-point prompt results compared to SAM2. Additionally, on the CaDIS dataset, pseudo masks generated by RP-SAM2 for fine-tuning SAM2's mask decoder outperformed those generated by SAM2. These results highlight RP-SAM2 as a practical, stable and reliable solution for semi-automatic instrument segmentation in data-constrained medical settings. The code is available at https://github.com/BioMedIA-MBZUAI/RP-SAM2.

RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

TL;DR

Abstract

RP-SAM2: Refining Point Prompts for Stable Surgical Instrument Segmentation

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (4)