TransForce: Transferable Force Prediction for Vision-based Tactile Sensors with Sequential Image Translation
Zhuo Chen, Ni Ou, Xuyang Zhang, Shan Luo
TL;DR
This work tackles the problem of transferring force prediction across vision-based tactile sensors (VBTSs) despite domain gaps in illumination and marker patterns. It introduces TransForce, a two-stage framework that first translates tactile images from a source sensor to a target sensor style using a CycleGAN-like translator, then trains a recurrent force predictor on the generated sequences to estimate forces for unseen sensors. The approach leverages sequential visual cues to better capture elastomer deformation and achieves high accuracy in both normal and shear directions, with marker-based modalities excelling in shear and RGB information aiding normal-force estimation. By enabling reuse of existing image-force data across sensors, TransForce offers a practical pathway to fast, low-cost force calibration and scalable tactile sensing for versatile robot manipulation.
Abstract
Vision-based tactile sensors (VBTSs) provide high-resolution tactile images crucial for robot in-hand manipulation. However, force sensing in VBTSs is underutilized due to the costly and time-intensive process of acquiring paired tactile images and force labels. In this study, we introduce a transferable force prediction model, TransForce, designed to leverage collected image-force paired data for new sensors under varying illumination colors and marker patterns while improving the accuracy of predicted forces, especially in the shear direction. Our model effectively achieves translation of tactile images from the source domain to the target domain, ensuring that the generated tactile images reflect the illumination colors and marker patterns of the new sensors while accurately aligning the elastomer deformation observed in existing sensors, which is beneficial to force prediction of new sensors. As such, a recurrent force prediction model trained with generated sequential tactile images and existing force labels is employed to estimate higher-accuracy forces for new sensors with lowest average errors of 0.69N (5.8\% in full work range) in $x$-axis, 0.70N (5.8\%) in $y$-axis, and 1.11N (6.9\%) in $z$-axis compared with models trained with single images. The experimental results also reveal that pure marker modality is more helpful than the RGB modality in improving the accuracy of force in the shear direction, while the RGB modality show better performance in the normal direction.
