Towards Human Haptic Gesture Interpretation for Robotic Systems
Bibit Bianchini, Prateek Verma, Kenneth Salisbury
TL;DR
The paper addresses the lack of standardized tactile sensing for natural human-robot interactions by proposing a four-gesture dictionary (Tap, Touch, Grab, Slip) and collecting a UR5e wrist force-torque dataset to benchmark multiple feature sets and classifiers. It compares six feature representations, including two autoencoder-based bottlenecks and three manual feature sets, across three classifier families, concluding that neural networks trained on raw force data achieve the best test accuracy (81%), with competitive results aligning with prior literature when gestures are mapped appropriately. The work demonstrates that simple, common force-torque sensing can rival more complex tactile sensors for gesture interpretation, delivering a reproducible benchmark and actionable guidance for future pHRI research. Overall, the approach offers a practical, data-efficient path toward robust haptic gesture understanding in robotic systems and highlights avenues for robustness and active exploration in dynamic environments.
Abstract
Physical human-robot interactions (pHRI) are less efficient and communicative than human-human interactions, and a key reason is a lack of informative sense of touch in robotic systems. Interpreting human touch gestures is a nuanced, challenging task with extreme gaps between human and robot capability. Among prior works that demonstrate human touch recognition capability, differences in sensors, gesture classes, feature sets, and classification algorithms yield a conglomerate of non-transferable results and a glaring lack of a standard. To address this gap, this work presents 1) four proposed touch gesture classes that cover an important subset of the gesture characteristics identified in the literature, 2) the collection of an extensive force dataset on a common pHRI robotic arm with only its internal wrist force-torque sensor, and 3) an exhaustive performance comparison of combinations of feature sets and classification algorithms on this dataset. We demonstrate high classification accuracies among our proposed gesture definitions on a test set, emphasizing that neural net-work classifiers on the raw data outperform other combinations of feature sets and algorithms. The accompanying video is here: https://youtu.be/gJPVImNKU68
