ToDER: Towards Colonoscopy Depth Estimation and Reconstruction with Geometry Constraint Adaptation
Zhenhua Wu, Yanlin Jin, Liangdong Qiu, Xiaoguang Han, Xiang Wan, Guanbin Li
TL;DR
ToDER tackles the challenge of depth estimation and 3D reconstruction in optical colonoscopy by leveraging a bi-directional domain adaptation framework with a dedicated TNet to enforce geometric constraints. The approach combines style translation between synthetic and realistic domains, dual depth networks, and a geometry-aware refinement step, followed by surfel-based reconstruction. Experimental results on synthetic and realistic data demonstrate superior depth accuracy and high-quality reconstructions, with ablations confirming the effectiveness of bi-directional adaptation and TNet. The method offers a practical pathway to visualize unobserved colon regions and potentially reduce misdiagnoses in clinical practice.
Abstract
Visualizing colonoscopy is crucial for medical auxiliary diagnosis to prevent undetected polyps in areas that are not fully observed. Traditional feature-based and depth-based reconstruction approaches usually end up with undesirable results due to incorrect point matching or imprecise depth estimation in realistic colonoscopy videos. Modern deep-based methods often require a sufficient number of ground truth samples, which are generally hard to obtain in optical colonoscopy. To address this issue, self-supervised and domain adaptation methods have been explored. However, these methods neglect geometry constraints and exhibit lower accuracy in predicting detailed depth. We thus propose a novel reconstruction pipeline with a bi-directional adaptation architecture named ToDER to get precise depth estimations. Furthermore, we carefully design a TNet module in our adaptation architecture to yield geometry constraints and obtain better depth quality. Estimated depth is finally utilized to reconstruct a reliable colon model for visualization. Experimental results demonstrate that our approach can precisely predict depth maps in both realistic and synthetic colonoscopy videos compared with other self-supervised and domain adaptation methods. Our method on realistic colonoscopy also shows the great potential for visualizing unobserved regions and preventing misdiagnoses.
