Hybrid Deep Reinforcement Learning for Radio Tracer Localisation in Robotic-assisted Radioguided Surgery
Hanyi Zhang, Kaizhong Deng, Zhaoyang Jacopo Hu, Baoru Huang, Daniel S. Elson
TL;DR
The paper addresses the challenge of autonomous radiotracer localization in robot-assisted radioguided surgery, where traditional gamma-probe guidance depends heavily on operator expertise. It introduces a two-phase hybrid method that first uses adaptive grid-based scanning to provide directional priors and then employs a PPO-based DRL agent, augmented by an Angle Prediction Module and a Corrected Solid Angle Model, to precisely navigate to the radiotracer. Key contributions include the Phase I adaptive scanning protocol with a 5x5 grid and progressively narrowed search, the Phase II DRL framework with a multi-component reward and angle estimation, and extensive validation showing improved accuracy, efficiency, and robustness in simulation (3,200 runs) and real-world tests on the dVRK (80% success). The work has practical implications for reducing operator dependency and increasing procedural consistency in radioguided surgeries, paving the way for broader adoption of automated radiotracer localization.
Abstract
Radioguided surgery, such as sentinel lymph node biopsy, relies on the precise localization of radioactive targets by non-imaging gamma/beta detectors. Manual radioactive target detection based on visual display or audible indication of gamma level is highly dependent on the ability of the surgeon to track and interpret the spatial information. This paper presents a learning-based method to realize the autonomous radiotracer detection in robot-assisted surgeries by navigating the probe to the radioactive target. We proposed novel hybrid approach that combines deep reinforcement learning (DRL) with adaptive robotic scanning. The adaptive grid-based scanning could provide initial direction estimation while the DRL-based agent could efficiently navigate to the target utilising historical data. Simulation experiments demonstrate a 95% success rate, and improved efficiency and robustness compared to conventional techniques. Real-world evaluation on the da Vinci Research Kit (dVRK) further confirms the feasibility of the approach, achieving an 80% success rate in radiotracer detection. This method has the potential to enhance consistency, reduce operator dependency, and improve procedural accuracy in radioguided surgeries.
