Vision-Based Online Key Point Estimation of Deformable Robots
Hehui Zheng, Sebastian Pinzello, Barnabas Gavin Cangan, Thomas Buchner, Robert K. Katzschmann
TL;DR
This work tackles the challenge of estimating the shape of highly deformable soft robots with infinite degrees of freedom by introducing VOKE, a two-view, vision-based CNN regression framework. VOKE outputs either a point-based set of key points or a piecewise constant curvature (PCC) model from paired grayscale images, enabling online, marker-less 3D shape estimation without prior shape knowledge. Across wax-cast, SoPrA, and a soft robotic fish, VOKE demonstrates competitive accuracy, robustness to lighting and noise, and real-time performance, outperforming existing marker-less baselines by up to 4.5% in tip estimation error. The approach lays groundwork for closed-loop control of soft robots by providing reliable, online shape estimates with a calibration-free camera alignment strategy.
Abstract
The precise control of soft and continuum robots requires knowledge of their shape, which has, in contrast to classical rigid robots, infinite degrees of freedom. To partially reconstruct the shape, proprioceptive techniques use built-in sensors resulting in inaccurate results and increased fabrication complexity. Exteroceptive methods so far rely on expensive tracking systems with reflective markers placed on all components, which are infeasible for deformable robots interacting with the environment due to marker occlusion and damage. Here, a regression approach is presented for 3D key point estimation using a convolutional neural network. The proposed approach takes advantage of data-driven supervised learning and is capable of online marker-less estimation during inference. Two images of a robotic system are taken simultaneously at 25 Hz from two different perspectives, and are fed to the network, which returns for each pair the parameterized key point or PCC shape representations. The proposed approach outperforms marker-less state-of-the-art methods by a maximum of 4.5% in estimation accuracy while at the same time being more robust and requiring no prior knowledge of the shape. Online evaluations on two types of soft robotic arms and a soft robotic fish demonstrate our method's accuracy and versatility on highly deformable systems.
