A Review of 3D Reconstruction Techniques for Deformable Tissues in Robotic Surgery
Mengya Xu, Ziqi Guo, An Wang, Long Bai, Hongliang Ren
TL;DR
This paper surveys and compares state-of-the-art 3D reconstruction methods for deformable tissues in robotic surgery, focusing on NeRF-based approaches (EndoNeRF, EndoSurf) and explicit Gaussian Splatting methods (LerPlane, 4D-GS). It provides a unified methodology overview, details implementation and training regimes, and evaluates real-time feasibility across three datasets (EndoNeRF, StereoMIS, C3VD), highlighting a domain gap that limits 4D-GS in surgical contexts. Key findings show NeRF-based methods achieve higher quality reconstructions at higher computational cost, LerPlane offers substantial speed advantages with competitive results on deformable scenes, and 4D-GS, despite real-time rendering potential, struggles due to surgical-domain differences. The study underscores the practical trade-offs between reconstruction fidelity, speed, and generalization, guiding future work toward domain-adaptive Gaussian representations and occlusion-aware sampling for real-time surgical scene reconstruction.
Abstract
As a crucial and intricate task in robotic minimally invasive surgery, reconstructing surgical scenes using stereo or monocular endoscopic video holds immense potential for clinical applications. NeRF-based techniques have recently garnered attention for the ability to reconstruct scenes implicitly. On the other hand, Gaussian splatting-based 3D-GS represents scenes explicitly using 3D Gaussians and projects them onto a 2D plane as a replacement for the complex volume rendering in NeRF. However, these methods face challenges regarding surgical scene reconstruction, such as slow inference, dynamic scenes, and surgical tool occlusion. This work explores and reviews state-of-the-art (SOTA) approaches, discussing their innovations and implementation principles. Furthermore, we replicate the models and conduct testing and evaluation on two datasets. The test results demonstrate that with advancements in these techniques, achieving real-time, high-quality reconstructions becomes feasible.
