UltraDP: Generalizable Carotid Ultrasound Scanning with Force-Aware Diffusion Policy
Ruoqu Chen, Xiangjie Yan, Kangchen Lv, Gao Huang, Zheng Li, Xiang Li
TL;DR
This work addresses the generalization bottleneck in autonomous carotid ultrasound scanning by introducing UltraDP, a diffusion-policy framework that fuses multi-modal inputs—ultrasound images $U$, wrist-camera data $I$, contact wrench $w$, and probe pose $x$—to predict safe, smooth navigation actions. A specialized guidance module centers the carotid artery in the image, while a high-frequency hybrid force-impedance controller ensures safe physical interaction with subjects. The authors construct a large real-world dataset (210 scans, 460k samples) and demonstrate that UltraDP generalizes well to unseen subjects, outperforming rule-based and BC baselines in real-world trials. The approach advances practical ultrasound robotics by improving generalization, safety, and efficiency in human-in-the-loop scanning, with potential to scale to broader anatomical regions and imaging tasks.
Abstract
Ultrasound scanning is a critical imaging technique for real-time, non-invasive diagnostics. However, variations in patient anatomy and complex human-in-the-loop interactions pose significant challenges for autonomous robotic scanning. Existing ultrasound scanning robots are commonly limited to relatively low generalization and inefficient data utilization. To overcome these limitations, we present UltraDP, a Diffusion-Policy-based method that receives multi-sensory inputs (ultrasound images, wrist camera images, contact wrench, and probe pose) and generates actions that are fit for multi-modal action distributions in autonomous ultrasound scanning of carotid artery. We propose a specialized guidance module to enable the policy to output actions that center the artery in ultrasound images. To ensure stable contact and safe interaction between the robot and the human subject, a hybrid force-impedance controller is utilized to drive the robot to track such trajectories. Also, we have built a large-scale training dataset for carotid scanning comprising 210 scans with 460k sample pairs from 21 volunteers of both genders. By exploring our guidance module and DP's strong generalization ability, UltraDP achieves a 95% success rate in transverse scanning on previously unseen subjects, demonstrating its effectiveness.
