MoSS: Monocular Shape Sensing for Continuum Robots
Chengnan Shentu, Enxu Li, Chaojun Chen, Puspita Triana Dewi, David B. Lindell, Jessica Burgner-Kahrs
TL;DR
MoSSNet tackles the challenge of real-time 3D shape sensing for continuum robots using a single RGB camera. It introduces an encoder–decoder network with three decoders (centerline, arclength, and importance) and a weighted curve-fitting step to recover a 3D centerline from monocular imagery, achieving 0.91 mm mean shape error at 70 Hz on real data. The method is validated on both real hardware and simulation, showing strong sim-to-real transfer and robustness to varying camera configurations, without fiducial markers or camera calibration. The work also provides large MoSS-Real and MoSS-Sim datasets and demonstrates significant potential for low-cost, real-time continuum-robot sensing in medical and industrial applications.
Abstract
Continuum robots are promising candidates for interactive tasks in medical and industrial applications due to their unique shape, compliance, and miniaturization capability. Accurate and real-time shape sensing is essential for such tasks yet remains a challenge. Embedded shape sensing has high hardware complexity and cost, while vision-based methods require stereo setup and struggle to achieve real-time performance. This paper proposes the first eye-to-hand monocular approach to continuum robot shape sensing. Utilizing a deep encoder-decoder network, our method, MoSSNet, eliminates the computation cost of stereo matching and reduces requirements on sensing hardware. In particular, MoSSNet comprises an encoder and three parallel decoders to uncover spatial, length, and contour information from a single RGB image, and then obtains the 3D shape through curve fitting. A two-segment tendon-driven continuum robot is used for data collection and testing, demonstrating accurate (mean shape error of 0.91 mm, or 0.36% of robot length) and real-time (70 fps) shape sensing on real-world data. Additionally, the method is optimized end-to-end and does not require fiducial markers, manual segmentation, or camera calibration. Code and datasets will be made available at https://github.com/ContinuumRoboticsLab/MoSSNet.
