ARMADA: Augmented Reality for Robot Manipulation and Robot-Free Data Acquisition
Nataliya Nechyporenko, Ryan Hoque, Christopher Webb, Mouli Sivapurapu, Jian Zhang
TL;DR
This work addresses the data bottleneck in imitation learning by enabling robot-compatible demonstrations without a physical robot, using ARMADA: an Apple Vision Pro–based AR system that overlays a real-time robot digital twin and provides live feedback. The approach maps user hand poses to robot IK within a Drake-based solver, streaming data at $30\ \,\mathrm{Hz}$ and replaying trajectories on hardware. In a study with 15 participants across 3 tasks and 3 feedback conditions, live AR feedback raised average replayability from $1.3\%$ to $71.1\%$, with post-feedback still lagging behind live visualization. This suggests scalable data collection for IL is feasible without robot teleoperation, with future work toward learning control policies from such data and extending IK retargeting for more complex tasks.
Abstract
Teleoperation for robot imitation learning is bottlenecked by hardware availability. Can high-quality robot data be collected without a physical robot? We present a system for augmenting Apple Vision Pro with real-time virtual robot feedback. By providing users with an intuitive understanding of how their actions translate to robot motions, we enable the collection of natural barehanded human data that is compatible with the limitations of physical robot hardware. We conducted a user study with 15 participants demonstrating 3 different tasks each under 3 different feedback conditions and directly replayed the collected trajectories on physical robot hardware. Results suggest live robot feedback dramatically improves the quality of the collected data, suggesting a new avenue for scalable human data collection without access to robot hardware. Videos and more are available at https://nataliya.dev/armada.
