Table of Contents
Fetching ...

Vision-Based Reasoning with Topology-Encoded Graphs for Anatomical Path Disambiguation in Robot-Assisted Endovascular Navigation

Jiyuan Zhao, Zhengyu Shi, Wentong Tian, Tianliang Yao, Dong Liu, Tao Liu, Yizhe Wu, Peng Qi

TL;DR

A two-stage framework (SCAR-UNet-GAT) for real-time robotic path planning and the integration of multi-level attention mechanisms enhances the delineation of thin, tortuous vessels and improves robustness against imaging noise is proposed.

Abstract

Robotic-assisted percutaneous coronary intervention (PCI) is constrained by the inherent limitations of 2D Digital Subtraction Angiography (DSA). Unlike physicians, who can directly manipulate guidewires and integrate tactile feedback with their prior anatomical knowledge, teleoperated robotic systems must rely solely on 2D projections. This mode of operation, simultaneously lacking spatial context and tactile sensation, may give rise to projection-induced ambiguities at vascular bifurcations. To address this challenge, we propose a two-stage framework (SCAR-UNet-GAT) for real-time robotic path planning. In the first stage, SCAR-UNet, a spatial-coordinate-attention-regularized U-Net, is employed for accurate coronary vessel segmentation. The integration of multi-level attention mechanisms enhances the delineation of thin, tortuous vessels and improves robustness against imaging noise. From the resulting binary masks, vessel centerlines and bifurcation points are extracted, and geometric descriptors (e.g., branch diameter, intersection angles) are fused with local DSA patches to construct node features. In the second stage, a Graph Attention Network (GAT) reasons over the vessel graph to identify anatomically consistent and clinically feasible trajectories, effectively distinguishing true bifurcations from projection-induced false crossings. On a clinical DSA dataset, SCAR-UNet achieved a Dice coefficient of 93.1%. For path disambiguation, the proposed GAT-based method attained a success rate of 95.0% and a target-arrival success rate of 90.0%, substantially outperforming conventional shortest-path planning (60.0% and 55.0%) and heuristic-based planning (75.0% and 70.0%). Validation on a robotic platform further confirmed the practical feasibility and robustness of the proposed framework.

Vision-Based Reasoning with Topology-Encoded Graphs for Anatomical Path Disambiguation in Robot-Assisted Endovascular Navigation

TL;DR

A two-stage framework (SCAR-UNet-GAT) for real-time robotic path planning and the integration of multi-level attention mechanisms enhances the delineation of thin, tortuous vessels and improves robustness against imaging noise is proposed.

Abstract

Robotic-assisted percutaneous coronary intervention (PCI) is constrained by the inherent limitations of 2D Digital Subtraction Angiography (DSA). Unlike physicians, who can directly manipulate guidewires and integrate tactile feedback with their prior anatomical knowledge, teleoperated robotic systems must rely solely on 2D projections. This mode of operation, simultaneously lacking spatial context and tactile sensation, may give rise to projection-induced ambiguities at vascular bifurcations. To address this challenge, we propose a two-stage framework (SCAR-UNet-GAT) for real-time robotic path planning. In the first stage, SCAR-UNet, a spatial-coordinate-attention-regularized U-Net, is employed for accurate coronary vessel segmentation. The integration of multi-level attention mechanisms enhances the delineation of thin, tortuous vessels and improves robustness against imaging noise. From the resulting binary masks, vessel centerlines and bifurcation points are extracted, and geometric descriptors (e.g., branch diameter, intersection angles) are fused with local DSA patches to construct node features. In the second stage, a Graph Attention Network (GAT) reasons over the vessel graph to identify anatomically consistent and clinically feasible trajectories, effectively distinguishing true bifurcations from projection-induced false crossings. On a clinical DSA dataset, SCAR-UNet achieved a Dice coefficient of 93.1%. For path disambiguation, the proposed GAT-based method attained a success rate of 95.0% and a target-arrival success rate of 90.0%, substantially outperforming conventional shortest-path planning (60.0% and 55.0%) and heuristic-based planning (75.0% and 70.0%). Validation on a robotic platform further confirmed the practical feasibility and robustness of the proposed framework.
Paper Structure (14 sections, 8 equations, 3 figures, 3 tables, 1 algorithm)

This paper contains 14 sections, 8 equations, 3 figures, 3 tables, 1 algorithm.

Figures (3)

  • Figure 1: The robotic intervention procedure relies on real-time 2D DSA images acquired by a C-arm X-ray system for path planning, providing precise navigation guidance for the robotic operation. (a) The intraoperative setup shows the endovascular robot operating under DSA guidance. (b) An apparent crossing point appears in the 2D projection when vessels at different depths visually overlap, which may be misinterpreted as a true bifurcation. (c) Correct path planning avoids such projection-induced crossings, while (d) invalid planning erroneously treats them as real intersections, potentially leading to navigation errors. The schematic was created using BioRender (https://biorender.com).
  • Figure 2: Overview of the proposed SCAR-UNet-GAT framework for robotic vascular path planning. (a) The pipeline combines SCAR-UNet-based segmentation of DSA image sequences with topology-aware GAT-based path planning, and is deployed on a robotic interventional system to provide navigation guidance for intervention procedures. (b) The SCAR-UNet segmentation module uses an encoder-decoder architecture with spatial-coordinate attention, integrated with multiple attention mechanisms and ASPP, enabling accurate extraction of vascular masks from DSA images. (c) The topology-aware path planning module extracts vessel centerlines and crossing features from segmentation results, fuses image patch features at intersections, constructs a vessel graph, predicts path probabilities using GAT reasoning, and then undergoes post-processing to determine the planned route.
  • Figure 3: Validation of Robotic Vascular Path Planning on Clinical Data and Robotic Systems. (a) Results on clinical DSA images, illustrating the details of apparent crossing points and the visual comparison among three methods: shortest-path, heuristic-based, and the proposed approach. (b) The robotic intervention platform, consisting of a robotic arm, gripper, monocular camera, pulse pump, and vascular phantom. The monocular camera provides DSA-like vascular images as inputs for the path planning system. (c) Results on vascular phantom images, further demonstrating the disambiguation of apparent crossing points and the comparative visualization of the three methods. (d) Validation on robotic systems, where YOLOv5 was employed for real-time guidewire tip detection and PID-based control was used to regulate advancement and rotation of the gripper, enabling the guidewire to follow the planned trajectory and successfully reach the target point.