5G Virtual Reality Manipulator Teleoperation using a Mobile Phone
Alexander Werner, William Melek
TL;DR
This work introduces a portable teleoperation pipeline that uses a consumer mobile phone as the leader device to drive a robot manipulator in $SE(3)$ by fusing the phone's IMU and camera data. It delivers immersive, multi-view visual feedback via VR-rendered point clouds and conveys interaction forces through the phone's haptic actuator, all over wireless networks such as 5G or Wi‑Fi. The system combines visual-inertial pose estimation, a compliant Cartesian impedance controller, and a momentum-based contact observer, with careful handling of bandwidth and latency through WebRTC and dedicated codecs. The results show the approach is viable for temporary or remote manipulation tasks, with 5G offering more consistent control than Wi‑Fi and visual feedback identified as a key bottleneck to further improvement.
Abstract
This paper presents an approach to teleoperate a manipulator using a mobile phone as a leader device. Using its IMU and camera, the phone estimates its Cartesian pose which is then used to to control the Cartesian pose of the robot's tool. The user receives visual feedback in the form of multi-view video - a point cloud rendered in a virtual reality environment. This enables the user to observe the scene from any position. To increase immersion, the robot's estimate of external forces is relayed using the phone's haptic actuator. Leader and follower are connected through wireless networks such as 5G or Wi-Fi. The paper describes the setup and analyzes its performance.
