Imagine2touch: Predictive Tactile Sensing for Robotic Manipulation using Efficient Low-Dimensional Signals
Abdallah Ayad, Adrian Röfer, Nick Heppert, Abhinav Valada
TL;DR
Imagine2touch introduces a cross-modal framework that predicts tactile readings from shallow depth-image patches to endow robots with predictive touch capabilities. The approach uses a compact neural architecture to map $z_d \in \mathbb{R}^{48\times48}$ to $\tilde{\tau} \in \mathbb{R}^{15}$, trained on a small, low-cost ReSkin dataset of 1630 tactile–vision pairs, and evaluated via an ensemble-based object recognition pipeline. Results show that the predictive tactile signal supports object recognition after multiple touches, outperforming a proprioceptive baseline and generalizing to out-of-distribution objects. This work highlights the practical potential of inexpensive tactile sensors for visuo-tactile perception and points to future directions such as reconstructing depth from tactile signals for full 3D tactile understanding.
Abstract
Humans seemingly incorporate potential touch signals in their perception. Our goal is to equip robots with a similar capability, which we term Imagine2touch. Imagine2touch aims to predict the expected touch signal based on a visual patch representing the area to be touched. We use ReSkin, an inexpensive and compact touch sensor to collect the required dataset through random touching of five basic geometric shapes, and one tool. We train Imagine2touch on two out of those shapes and validate it on the ood. tool. We demonstrate the efficacy of Imagine2touch through its application to the downstream task of object recognition. In this task, we evaluate Imagine2touch performance in two experiments, together comprising 5 out of training distribution objects. Imagine2touch achieves an object recognition accuracy of 58% after ten touches per object, surpassing a proprioception baseline.
