SurgeMOD: Translating image-space tissue motions into vision-based surgical forces

Mikel De Iturrate Reyzabal; Dionysios Malas; Shuai Wang; Sebastien Ourselin; Hongbin Liu

SurgeMOD: Translating image-space tissue motions into vision-based surgical forces

Mikel De Iturrate Reyzabal, Dionysios Malas, Shuai Wang, Sebastien Ourselin, Hongbin Liu

TL;DR

This work tackles vision-based force estimation in minimally invasive robotic surgery by deriving an image-space motion basis from natural organ motions. It introduces a frequency-domain modal framework where motion textures are transformed into mode shapes via FFT and restricted to $K=20$ low-frequency components, enabling a dynamic-constraint formulation to infer forces directly from video. The method is validated on silicone phantom and ex-vivo porcine tissue, achieving strong alignment with force sensor readings and producing interpretable force textures in the image space. It provides a principled vision-based baseline for surgical force estimation and points to practical extensions, including depth cues and automated segmentation, to enhance haptic feedback in real-time procedures.

Abstract

We present a new approach for vision-based force estimation in Minimally Invasive Robotic Surgery based on frequency domain basis of motion of organs derived directly from video. Using internal movements generated by natural processes like breathing or the cardiac cycle, we infer the image-space basis of the motion on the frequency domain. As we are working with this representation, we discretize the problem to a limited amount of low-frequencies to build an image-space mechanical model of the environment. We use this pre-built model to define our force estimation problem as a dynamic constraint problem. We demonstrate that this method can estimate point contact forces reliably for silicone phantom and ex-vivo experiments, matching real readings from a force sensor. In addition, we perform qualitative experiments in which we synthesize coherent force textures from surgical videos over a certain region of interest selected by the user. Our method demonstrates good results for both quantitative and qualitative analysis, providing a good starting point for a purely vision-based method for surgical force estimation.

SurgeMOD: Translating image-space tissue motions into vision-based surgical forces

TL;DR

Abstract

SurgeMOD: Translating image-space tissue motions into vision-based surgical forces

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)