SurgicAI: A Hierarchical Platform for Fine-Grained Surgical Policy Learning and Benchmarking
Jin Wu, Haoying Zhou, Peter Kazanzides, Adnan Munawar, Anqi Liu
TL;DR
SurgicAI addresses the challenge of automating complex robotic suturing by providing a deformable-thread, AMBF-based simulation platform compatible with the da Vinci system, paired with a standardized data pipeline and benchmark suite. Its core innovation is a hierarchical learning framework comprising High-Level and Low-Level policies to manage multi-stage suturing tasks, enabling reusable subskills and scalable policy learning. Empirical results show that pure online RL struggles under sparse rewards, while integrating expert demonstrations via imitation learning or hybrid RL-IL approaches (e.g., TD3+HER+BC) yields higher success rates and more efficient trajectories; offline methods also perform well with dense rewards. The platform’s modularity, extensive data collection capabilities, and open maintenance pipeline position SurgicAI as a practical tool to advance policy learning in surgical robotics and to bridge simulation-to-real transfer, with future plans for more realistic rendering, broader algorithms, and broader collaboration.
Abstract
Despite advancements in robotic-assisted surgery, automating complex tasks like suturing remain challenging due to the need for adaptability and precision. Learning-based approaches, particularly reinforcement learning (RL) and imitation learning (IL), require realistic simulation environments for efficient data collection. However, current platforms often include only relatively simple, non-dexterous manipulations and lack the flexibility required for effective learning and generalization. We introduce SurgicAI, a novel platform for development and benchmarking addressing these challenges by providing the flexibility to accommodate both modular subtasks and more importantly task decomposition in RL-based surgical robotics. Compatible with the da Vinci Surgical System, SurgicAI offers a standardized pipeline for collecting and utilizing expert demonstrations. It supports deployment of multiple RL and IL approaches, and the training of both singular and compositional subtasks in suturing scenarios, featuring high dexterity and modularization. Meanwhile, SurgicAI sets clear metrics and benchmarks for the assessment of learned policies. We implemented and evaluated multiple RL and IL algorithms on SurgicAI. Our detailed benchmark analysis underscores SurgicAI's potential to advance policy learning in surgical robotics. Details: https://github.com/surgical-robotics-ai/SurgicAI
