Amodal Segmentation for Laparoscopic Surgery Video Instruments
Ruohua Shi, Zhaochen Liu, Lingyu Duan, Tingting Jiang
TL;DR
This study addresses occlusion in laparoscopic instrument segmentation by applying amodal segmentation to predict both visible and occluded parts of surgical tools. It introduces the AIS dataset, derived from the 2017 EndoVis Robotic Instrument Segmentation Challenge, with full amodal masks for 7,084 instrument instances at 1024×1280, and benchmarks four amodal methods (SAM, AISFormer, C2F-Seg, PLUG) on this medical domain. PLUG achieves the top mean IoU of 89.25, demonstrating the feasibility and value of amodal segmentation for surgical workflows. The work enables improved intraoperative guidance, post hoc procedure analysis, and education by revealing complete instrument shapes and trajectories even under occlusion.
Abstract
Segmentation of surgical instruments is crucial for enhancing surgeon performance and ensuring patient safety. Conventional techniques such as binary, semantic, and instance segmentation share a common drawback: they do not accommodate the parts of instruments obscured by tissues or other instruments. Precisely predicting the full extent of these occluded instruments can significantly improve laparoscopic surgeries by providing critical guidance during operations and assisting in the analysis of potential surgical errors, as well as serving educational purposes. In this paper, we introduce Amodal Segmentation to the realm of surgical instruments in the medical field. This technique identifies both the visible and occluded parts of an object. To achieve this, we introduce a new Amoal Instruments Segmentation (AIS) dataset, which was developed by reannotating each instrument with its complete mask, utilizing the 2017 MICCAI EndoVis Robotic Instrument Segmentation Challenge dataset. Additionally, we evaluate several leading amodal segmentation methods to establish a benchmark for this new dataset.
