Leveraging Haptic Feedback to Improve Data Quality and Quantity for Deep Imitation Learning Models
Catie Cuan, Allison Okamura, Mohi Khansari
TL;DR
The paper investigates whether real-time haptic feedback to human demonstrators during teleoperation improves data quality and quantity for deep visual imitation learning and whether these improvements translate to better autonomous policies. Using a two-phase study on latch-door opening, it shows that haptic feedback increases data throughput and the proportion of high-quality demonstrations, and that policies trained on haptic data achieve an overall 11% performance gain on real doors, with larger gains on more challenging left-swing tasks. The approach combines real-world data collection with phase-appropriate data curation, a ResNet-18 visual encoder, and simulation-to-real evaluation via RetinaGAN, without altering the model architecture. The findings highlight the practical value of low-cost haptic augmentation for improving imitation-learning pipelines in real-world robotic manipulation tasks.
Abstract
Learning from demonstration is a proven technique to teach robots new skills. Data quality and quantity play a critical role in the performance of models trained using data collected from human demonstrations. In this paper we enhance an existing teleoperation data collection system with real-time haptic feedback to the human demonstrators; we observe improvements in the collected data throughput and in the performance of autonomous policies using models trained with the data. Our experimental testbed was a mobile manipulator robot that opened doors with latch handles. Evaluation of teleoperated data collection on eight real conference room doors found that adding haptic feedback improved data throughput by 6%. We additionally used the collected data to train six image-based deep imitation learning models, three with haptic feedback and three without it. These models were used to implement autonomous door-opening with the same type of robot used during data collection. A policy from a imitation learning model trained with data collected while the human demonstrators received haptic feedback performed on average 11% better than its counterpart trained with data collected without haptic feedback, indicating that haptic feedback provided during data collection resulted in improved autonomous policies.
