Ubiquitous Robot Control Through Multimodal Motion Capture Using Smartwatch and Smartphone Data
Fabian C Weigend, Neelesh Kumar, Oya Aran, Heni Ben Amor
TL;DR
The paper addresses the challenge of enabling robot control in ubiquitous settings using consumer wearables by introducing WearMoCap, an open-source library with three multimodal pose-estimation modes. The approach combines $\mathbf{q}_{la}$ and $\mathbf{q}_{ua}$ estimates from LSTM-based pipelines (Watch Only and Upper Arm) and a Differentiable Ensemble Kalman Filter leveraging $\mathbf{q}_{hi}$ for the Pocket mode, with forward kinematics to produce end-effector positions. Real-robot validation on UR5 across Handover and Intervention tasks shows sub-$2\,\mathrm{cm}$ accuracy for the Upper Arm mode and clear trade-offs among modes, highlighting the balance between precision and ubiquity. The work demonstrates practical, open-source multimodal motion capture for robot teleoperation in everyday environments and informs mode selection for specific tasks, with the repository available at $www.github.com/wearable-motion-capture$.
Abstract
We present an open-source library for seamless robot control through motion capture using smartphones and smartwatches. Our library features three modes: Watch Only Mode, enabling control with a single smartwatch; Upper Arm Mode, offering heightened accuracy by incorporating the smartphone attached to the upper arm; and Pocket Mode, determining body orientation via the smartphone placed in any pocket. These modes are applied in two real-robot tasks, showcasing placement accuracy within 2 cm compared to a gold-standard motion capture system. WearMoCap stands as a suitable alternative to conventional motion capture systems, particularly in environments where ubiquity is essential. The library is available at: www.github.com/wearable-motion-capture.
