Safe Uncertainty-Aware Learning of Robotic Suturing
Wilbert Peter Empleo, Yitaek Kim, Hansoul Kim, Thiusius Rajeeth Savarimuthu, Iñigo Iturrate
TL;DR
Safe uncertainty-aware autonomous robotic suturing is addressed by combining an ensemble of diffusion policies with Out-of-Distribution detection to decide when to hand control to a surgeon. The approach adds a model-free Control Barrier Function to impose formal safety constraints on the robot controller. The paper demonstrates that the diffusion-policy ensemble provides meaningful epistemic uncertainty estimates and that the CBF can constrain actions to a predefined safe set even under unsafe predictions. Evaluations in a surgical simulator show robust task execution under perturbations such as needle drops, camera shifts, and phantom movements, with OOD detection enabling timely human intervention.
Abstract
Robot-Assisted Minimally Invasive Surgery is currently fully manually controlled by a trained surgeon. Automating this has great potential for alleviating issues, e.g., physical strain, highly repetitive tasks, and shortages of trained surgeons. For these reasons, recent works have utilized Artificial Intelligence methods, which show promising adaptability. Despite these advances, there is skepticism of these methods because they lack explainability and robust safety guarantees. This paper presents a framework for a safe, uncertainty-aware learning method. We train an Ensemble Model of Diffusion Policies using expert demonstrations of needle insertion. Using an Ensemble model, we can quantify the policy's epistemic uncertainty, which is used to determine Out-Of-Distribution scenarios. This allows the system to release control back to the surgeon in the event of an unsafe scenario. Additionally, we implement a model-free Control Barrier Function to place formal safety guarantees on the predicted action. We experimentally evaluate our proposed framework using a state-of-the-art robotic suturing simulator. We evaluate multiple scenarios, such as dropping the needle, moving the camera, and moving the phantom. The learned policy is robust to these perturbations, showing corrective behaviors and generalization, and it is possible to detect Out-Of-Distribution scenarios. We further demonstrate that the Control Barrier Function successfully limits the action to remain within our specified safety set in the case of unsafe predictions.
