SURESTEP: An Uncertainty-Aware Trajectory Optimization Framework to Enhance Visual Tool Tracking for Robust Surgical Automation
Nikhil U. Shinde, Zih-Yun Chiu, Florian Richter, Jason Lim, Yuheng Zhi, Sylvia Herbert, Michael C. Yip
TL;DR
SURESTEP addresses the challenge of robust surgical automation under perception and kinematic uncertainty by propagating motion and observation noise in a Gaussian EKF belief space and explicitly minimizing the uncertainty at the task horizon. By modeling depth-, FOV-, and orientation-based observation noise and optimizing trajectories to minimize the final covariance trace $\text{Tr}(\boldsymbol{\Sigma}_{T|T})$ (while enforcing $\mathbb{E}[\boldsymbol{x}_T]=\boldsymbol{x}^G$ and adding a pose-loss), it yields trajectories that enhance tool visibility and pose estimation reliability. Empirical results on real dVRK tasks show substantial performance gains: an $82\%$ success rate for needle regrasps versus $18\%$ for baselines, and favorable simulation ablations across multiple uncertainty components. The framework demonstrates the value of uncertainty-aware planning in surgical automation and offers avenues to extend to camera-motion optimization and differentiable collision checking, potentially broadening applicability and robustness in real-world settings.
Abstract
Inaccurate tool localization is one of the main reasons for failures in automating surgical tasks. Imprecise robot kinematics and noisy observations caused by the poor visual acuity of an endoscopic camera make tool tracking challenging. Previous works in surgical automation adopt environment-specific setups or hard-coded strategies instead of explicitly considering motion and observation uncertainty of tool tracking in their policies. In this work, we present SURESTEP, an uncertainty-aware trajectory optimization framework for robust surgical automation. We model the uncertainty of tool tracking with the components motivated by the sources of noise in typical surgical scenes. Using a Gaussian assumption to propagate our uncertainty models through a given tool trajectory, SURESTEP provides a general framework that minimizes the upper bound on the entropy of the final estimated tool distribution. We compare SURESTEP with a baseline method on a real-world suture needle regrasping task under challenging environmental conditions, such as poor lighting and a moving endoscopic camera. The results over 60 regrasps on the da Vinci Research Kit (dVRK) demonstrate that our optimized trajectories significantly outperform the un-optimized baseline.
