ETSM: Automating Dissection Trajectory Suggestion and Confidence Map-Based Safety Margin Prediction for Robot-assisted Endoscopic Submucosal Dissection
Mengya Xu, Wenjin Mo, Guankun Wang, Huxin Gao, An Wang, Long Bai, Chaoyang Lyu, Xiaoxiao Yang, Zhen Li, Hongliang Ren
TL;DR
This work targets safer and more efficient robot-assisted ESD by predicting both optimal dissection trajectories and confidence-based safety margins. It introduces the ETSM dataset, featuring annotated dissection trajectories and margins from dual-arm robotic ESD videos, and develops an AI planning system with a dissection trajectory predictor and a confidence-map-based margin predictor (RCMNet). RCMNet fuses a Transformer-based encoder (DINOv2) with a regression decoder (All-MLP) to produce dense 1-channel safety maps, trained with a weighted MSE that emphasizes margins. Experimental results show strong trajectory prediction with BC, competitive safety-margin prediction across resolutions, robustness to common corruptions, and reasonable out-of-domain generalization, highlighting practical potential for intraoperative guidance and future automation in ESD. The study contributes a first regression-based approach to delineating varying safety levels in dissection areas and lays groundwork for temporally-aware improvements and broader clinical deployment.
Abstract
Robot-assisted Endoscopic Submucosal Dissection (ESD) improves the surgical procedure by providing a more comprehensive view through advanced robotic instruments and bimanual operation, thereby enhancing dissection efficiency and accuracy. Accurate prediction of dissection trajectories is crucial for better decision-making, reducing intraoperative errors, and improving surgical training. Nevertheless, predicting these trajectories is challenging due to variable tumor margins and dynamic visual conditions. To address this issue, we create the ESD Trajectory and Confidence Map-based Safety Margin (ETSM) dataset with $1849$ short clips, focusing on submucosal dissection with a dual-arm robotic system. We also introduce a framework that combines optimal dissection trajectory prediction with a confidence map-based safety margin, providing a more secure and intelligent decision-making tool to minimize surgical risks for ESD procedures. Additionally, we propose the Regression-based Confidence Map Prediction Network (RCMNet), which utilizes a regression approach to predict confidence maps for dissection areas, thereby delineating various levels of safety margins. We evaluate our RCMNet using three distinct experimental setups: in-domain evaluation, robustness assessment, and out-of-domain evaluation. Experimental results show that our approach excels in the confidence map-based safety margin prediction task, achieving a mean absolute error (MAE) of only $3.18$. To the best of our knowledge, this is the first study to apply a regression approach for visual guidance concerning delineating varying safety levels of dissection areas. Our approach bridges gaps in current research by improving prediction accuracy and enhancing the safety of the dissection process, showing great clinical significance in practice.
