Towards Intuitive Human-Robot Interaction through Embodied Gesture-Driven Control with Woven Tactile Skins
ChunPing Lam, Xiangjia Chen, Chenming Wu, Hao Chen, Binzhi Sun, Guoxin Fang, Charlie C. L. Wang, Chengkai Dai, Yeung Yam
TL;DR
This work introduces an embodied HRI framework that uses a capacitance-based woven tactile skin to enable intuitive gesture-driven control on curved robot surfaces. A 14-gesture vocabulary maps to task-space translations, rotations, and auxiliary commands, while a lightweight CNN-stem and temporal transformer recognizer delivers real-time, near-100% gesture accuracy. Validation on a 6-DOF robot during vision-free pick-and-place and pouring tasks shows substantial gains in efficiency (up to 57% faster) and high reliability, highlighting the approach's potential to narrow the gap between human intent and robot action. The study demonstrates a practical pathway toward more natural, efficient embodied HRI and points to future work on broader gesture sets, larger sensing areas, and multimodal integration.
Abstract
This paper presents a novel human-robot interaction (HRI) framework that enables intuitive gesture-driven control through a capacitance-based woven tactile skin. Unlike conventional interfaces that rely on panels or handheld devices, the woven tactile skin integrates seamlessly with curved robot surfaces, enabling embodied interaction and narrowing the gap between human intent and robot response. Its woven design combines fabric-like flexibility with structural stability and dense multi-channel sensing through the interlaced conductive threads. Building on this capability, we define a gesture-action mapping of 14 single- and multi-touch gestures that cover representative robot commands, including task-space motion and auxiliary functions. A lightweight convolution-transformer model designed for gesture recognition in real time achieves an accuracy of near-100%, outperforming prior baseline approaches. Experiments on robot arm tasks, including pick-and-place and pouring, demonstrate that our system reduces task completion time by up to 57% compared with keyboard panels and teach pendants. Overall, our proposed framework demonstrates a practical pathway toward more natural and efficient embodied HRI.
