DexSkills: Skill Segmentation Using Haptic Data for Learning Autonomous Long-Horizon Robotic Manipulation Tasks
Xiaofeng Mao, Gabriele Giudici, Claudio Coppola, Kaspar Althoefer, Ildar Farkhatdinov, Zhibin Li, Lorenzo Jamone
TL;DR
DexSkills tackles the challenge of learning long-horizon dexterous manipulation by decomposing tasks into a reusable library of primitive skills learned from human demonstrations using only proprioceptive and tactile data. It combines learning features, primitive skill policies, and supervised representation learning with an auto-regressive autoencoder and a label decoder to segment unseen demonstrations into skill sequences and execute them autonomously. The method defines 20 primitive skills and demonstrates autonomous execution of long-horizon tasks with high segmentation accuracy, supported by ablations and a public teleoperation dataset. The work advances data-efficient, hardware-centric imitation learning for real-world dexterous manipulation and provides a practical benchmark for tactile-haptic segmentation and one-shot task composition.
Abstract
Effective execution of long-horizon tasks with dexterous robotic hands remains a significant challenge in real-world problems. While learning from human demonstrations have shown encouraging results, they require extensive data collection for training. Hence, decomposing long-horizon tasks into reusable primitive skills is a more efficient approach. To achieve so, we developed DexSkills, a novel supervised learning framework that addresses long-horizon dexterous manipulation tasks using primitive skills. DexSkills is trained to recognize and replicate a select set of skills using human demonstration data, which can then segment a demonstrated long-horizon dexterous manipulation task into a sequence of primitive skills to achieve one-shot execution by the robot directly. Significantly, DexSkills operates solely on proprioceptive and tactile data, i.e., haptic data. Our real-world robotic experiments show that DexSkills can accurately segment skills, thereby enabling autonomous robot execution of a diverse range of tasks.
