MimicTouch: Leveraging Multi-modal Human Tactile Demonstrations for Contact-rich Manipulation
Kelin Yu, Yunhai Han, Qixian Wang, Vaibhav Saxena, Danfei Xu, Ye Zhao
TL;DR
MimicTouch tackles contact-rich manipulation by learning tactile-guided policies from demonstrations performed with human hands, eliminating the sensing-collection gap created by vision-driven teleoperation. It combines four components: collecting multi-modal tactile demonstrations, self-supervised tactile-audio representation learning, non-parametric imitation to derive an offline policy, and online residual RL to bridge the human-robot embodiment gap. Empirical results show superior data collection throughput, better offline policy performance than teleoperation-based baselines, and dramatic online improvements with strong zero-shot generalization across diverse insertion and assembly tasks. The approach enables efficient, tactile-centric policy learning with practical implications for real-world manipulation under occlusion and cluttered environments.
Abstract
Tactile sensing is critical to fine-grained, contact-rich manipulation tasks, such as insertion and assembly. Prior research has shown the possibility of learning tactile-guided policy from teleoperated demonstration data. However, to provide the demonstration, human users often rely on visual feedback to control the robot. This creates a gap between the sensing modality used for controlling the robot (visual) and the modality of interest (tactile). To bridge this gap, we introduce "MimicTouch", a novel framework for learning policies directly from demonstrations provided by human users with their hands. The key innovations are i) a human tactile data collection system which collects multi-modal tactile dataset for learning human's tactile-guided control strategy, ii) an imitation learning-based framework for learning human's tactile-guided control strategy through such data, and iii) an online residual RL framework to bridge the embodiment gap between the human hand and the robot gripper. Through comprehensive experiments, we highlight the efficacy of utilizing human's tactile-guided control strategy to resolve contact-rich manipulation tasks. The project website is at https://sites.google.com/view/MimicTouch.
