Autonomous Soft Robotic Guidewire Navigation via Imitation Learning
Noah Barnes, Ji Woong Kim, Lingyun Di, Hannah Qu, Anuruddha Bhattacharjee, Miroslaw Janowski, Dheeraj Gandhi, Bailey Felix, Shaopeng Jiang, Olivia Young, Mark Fuge, Ryan D. Sochol, Jeremy D. Brown, Axel Krieger
TL;DR
This work tackles autonomous navigation of soft robotic guidewires in intracranial aneurysm treatment where fluoroscopic feedback provides only partial visual information. It presents a transformer-based end-to-end imitation-learning framework with goal conditioning, relative motor outputs, and automatic contrast-dye management, formalized as $\pi_{\theta}(a_{t+1:t+K}^{motor}-a_t^{motor}, a_t^{contrast}|o_t)$. The policy leverages a CNN–Transformer architecture with a goal-distance feature map and outputs a motor-action chunk of size $K$ along with a contrast-injection decision, trained with $l_{L1}$ and $l_{BCE}$ losses (total $L=l_{L1}+0.5\,l_{BCE}$); it comprises 95.06M parameters and runs at 25 fps. Trained on 647 demonstrations across 36 modular mazes, the method generalizes to unseen geometries, achieving 83% success on novel layouts and outperforming diffusion, MLP, and centerline baselines, with ablations validating the necessity of recovery data, goal representation, action representation, and contrast strategy. These results suggest a viable path toward bench-top training translating to real-world endovascular interventions with limited additional demonstrations.
Abstract
In endovascular surgery, endovascular interventionists push a thin tube called a catheter, guided by a thin wire to a treatment site inside the patient's blood vessels to treat various conditions such as blood clots, aneurysms, and malformations. Guidewires with robotic tips can enhance maneuverability, but they present challenges in modeling and control. Automation of soft robotic guidewire navigation has the potential to overcome these challenges, increasing the precision and safety of endovascular navigation. In other surgical domains, end-to-end imitation learning has shown promising results. Thus, we develop a transformer-based imitation learning framework with goal conditioning, relative action outputs, and automatic contrast dye injections to enable generalizable soft robotic guidewire navigation in an aneurysm targeting task. We train the model on 36 different modular bifurcated geometries, generating 647 total demonstrations under simulated fluoroscopy, and evaluate it on three previously unseen vascular geometries. The model can autonomously drive the tip of the robot to the aneurysm location with a success rate of 83% on the unseen geometries, outperforming several baselines. In addition, we present ablation and baseline studies to evaluate the effectiveness of each design and data collection choice. Project website: https://softrobotnavigation.github.io/
