A multi-modal tactile fingertip design for robotic hands to enhance dexterous manipulation
Zhuowei Xu, Zilin Si, Kevin Zhang, Oliver Kroemer, Zeynep Temel
TL;DR
This work introduces a compact, low-cost multi-modal tactile fingertip that embeds strain gauges for static force sensing and a contact microphone for vibrotactile sensing inside the fingertip. The design emphasizes manufacturability, durability, and ease of integration, achieving a footprint of $1.9\,\mathrm{cm}\times1.9\,\mathrm{cm}\times2.7\,\mathrm{cm}$ and a cost below $100. Through extensive characterization, the fingertip provides accurate 2D force estimates in the $0$–$5$ N range and high material-discrimination accuracy ($\approx$ $95.49\%$) from vibrotactile signals. Evaluations on three manipulation tasks show that tactile sensing can either supplement or replace vision under occlusion, delivering perfect performance on fragile-object pinching and cup counting, and strong performance in hidden-object material detection ($94.64\%$), highlighting the practical impact of compact, interpretable tactile sensing for dexterous robotic manipulation.
Abstract
Tactile sensing holds great promise for enhancing manipulation precision and versatility, but its adoption in robotic hands remains limited due to high sensor costs, manufacturing and integration challenges, and difficulties in extracting expressive and reliable information from signals. In this work, we present a low-cost, easy-to-make, adaptable, and compact fingertip design for robotic hands that integrates multi-modal tactile sensors. We use strain gauge sensors to capture static forces and a contact microphone sensor to measure high-frequency vibrations during contact. These tactile sensors are integrated into a compact design with a minimal sensor footprint, and all sensors are internal to the fingertip and therefore not susceptible to direct wear and tear from interactions. From sensor characterization, we show that strain gauge sensors provide repeatable 2D planar force measurements in the 0-5 N range and the contact microphone sensor has the capability to distinguish contact material properties. We apply our design to three dexterous manipulation tasks that range from zero to full visual occlusion. Given the expressiveness and reliability of tactile sensor readings, we show that different tactile sensing modalities can be used flexibly in different stages of manipulation, solely or together with visual observations to achieve improved task performance. For instance, we can precisely count and unstack a desired number of paper cups from a stack with 100\% success rate which is hard to achieve with vision only.
