A High-Fidelity Robotic Manipulator Teleoperation Framework for Human-Centered Augmented Reality Evaluation
Harsh Chhajed, Tian Guo
TL;DR
ARBot tackles the reproducibility gap in AR evaluation by providing a deterministic physical proxy that captures natural human hand motion via multimodal interfaces (ARPose mobile app and CV+IMU) and replay it on robotic manipulators through a proactively-safe control stack. The core methods combine a Newton-Raphson-based IK solver with a Convex Quadratic Programming safety filter and a position-output integration to ensure sub-centimeter accuracy and low latency ($\approx$ $5$–$20$ mm and $\approx$ $20$ ms for ARPose, up to $\sim$90 ms for CV+IMU). The paper demonstrates through an IRB-approved study (N=11) and a 132-trajectory dataset that ARBot achieves high fidelity and repeatability, outperforming human motion variability and enabling standardized benchmarking for AR tracking and interaction. The work contributes an open-source end-to-end platform, a rich dataset, and methodological groundwork for reproducible, human-centered AR evaluation with practical impact on AR system development and benchmarking.
Abstract
Validating Augmented Reality (AR) tracking and interaction models requires precise, repeatable ground-truth motion. However, human users cannot reliably perform consistent motion due to biomechanical variability. Robotic manipulators are promising to act as human motion proxies if they can mimic human movements. In this work, we design and implement ARBot, a real-time teleoperation platform that can effectively capture natural human motion and accurately replay the movements via robotic manipulators. ARBot includes two capture models: stable wrist motion capture via a custom CV and IMU pipeline, and natural 6-DOF control via a mobile application. We design a proactively-safe QP controller to ensure smooth, jitter-free execution of the robotic manipulator, enabling it to function as a high-fidelity record and replay physical proxy. We open-source ARBot and release a benchmark dataset of 132 human and synthetic trajectories captured using ARBot to support controllable and scalable AR evaluation.
