Table of Contents
Fetching ...

Vision-Based Force Estimation for Minimally Invasive Telesurgery Through Contact Detection and Local Stiffness Models

Shuyuan Yang, My H. Le, Kyle R. Golobish, Juan C. Beaver, Zonghe Chua

TL;DR

A novel vision-based contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings is presented and potential usefulness of contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings is suggested.

Abstract

In minimally invasive telesurgery, obtaining accurate force information is difficult due to the complexities of in-vivo end effector force sensing. This constrains development and implementation of haptic feedback and force-based automated performance metrics, respectively. Vision-based force sensing approaches using deep learning are a promising alternative to intrinsic end effector force sensing. However, they have limited ability to generalize to novel scenarios, and require learning on high-quality force sensor training data that can be difficult to obtain. To address these challenges, this paper presents a novel vision-based contact-conditional approach for force estimation in telesurgical environments. Our method leverages supervised learning with human labels and end effector position data to train deep neural networks. Predictions from these trained models are optionally combined with robot joint torque information to estimate forces indirectly from visual data. We benchmark our method against ground truth force sensor data and demonstrate generality by fine-tuning to novel surgical scenarios in a data-efficient manner. Our methods demonstrated greater than 90% accuracy on contact detection and less than 10% force prediction error. These results suggest potential usefulness of contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings.

Vision-Based Force Estimation for Minimally Invasive Telesurgery Through Contact Detection and Local Stiffness Models

TL;DR

A novel vision-based contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings is presented and potential usefulness of contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings is suggested.

Abstract

In minimally invasive telesurgery, obtaining accurate force information is difficult due to the complexities of in-vivo end effector force sensing. This constrains development and implementation of haptic feedback and force-based automated performance metrics, respectively. Vision-based force sensing approaches using deep learning are a promising alternative to intrinsic end effector force sensing. However, they have limited ability to generalize to novel scenarios, and require learning on high-quality force sensor training data that can be difficult to obtain. To address these challenges, this paper presents a novel vision-based contact-conditional approach for force estimation in telesurgical environments. Our method leverages supervised learning with human labels and end effector position data to train deep neural networks. Predictions from these trained models are optionally combined with robot joint torque information to estimate forces indirectly from visual data. We benchmark our method against ground truth force sensor data and demonstrate generality by fine-tuning to novel surgical scenarios in a data-efficient manner. Our methods demonstrated greater than 90% accuracy on contact detection and less than 10% force prediction error. These results suggest potential usefulness of contact-conditional force estimation for sensory substitution haptic feedback and tissue handling skill evaluation in clinical settings.
Paper Structure (23 sections, 6 equations, 6 figures, 7 tables)

This paper contains 23 sections, 6 equations, 6 figures, 7 tables.

Figures (6)

  • Figure 1: Example force predictions for force estimation approaches that require (a) robot state, and (b) no robot state information, for one demonstration from the silicone dataset.
  • Figure 2: Predicted contact probabilities on one demonstration from the realistic dataset for EfficientNetB3 models trained using (a) ground truth contact labels from force sensor measurements, and (b) human contact labels.
  • Figure 3: Scaled position predictions using the graph neural network (GNN) and fully connected network (FCN) compared to the joint encoder-based position estimated of the end effector on the realistic dataset.
  • Figure 4: Example force predictions for force estimation approaches that require (a) robot state, and (b) no robot state information, for one demonstration from the realistic dataset.
  • Figure 5: Box plot showing the accuracy of contact predictions using the EfficientNet-based visual contact detector trained on different numbers of human-labeled examples from the realistic dataset. The model was pre-trained on the silicone dataset. Connected dots represent the mean.
  • ...and 1 more figures