Immersive Teleoperation Framework for Locomanipulation Tasks
Takuya Boehringer, Jonathan Embley-Riches, Karim Hammoud, Valerio Modugno, Dimitrios Kanoulas
TL;DR
This work tackles precise loco-manipulation in remote or hazardous environments by integrating immersive VR with Gaussian splatting to build a photorealistic, occlusion-aware representation of the scene for teleoperation. The authors propose a two-stage framework: first navigate via a conventional base-control interface, then perform manipulation in VR using a Gaussian splat of the environment aligned to the robot, enabling intuitive control and improved spatial awareness. A user study demonstrates substantial usability and efficiency gains over a baseline joystick-camera setup, with most participants preferring the VR interface. Real-world experiments across two tasks validate the framework's versatility, highlighting its potential to enhance precision manipulation in cluttered or occluded scenarios.
Abstract
Recent advancements in robotic loco-manipulation have leveraged Virtual Reality (VR) to enhance the precision and immersiveness of teleoperation systems, significantly outperforming traditional methods reliant on 2D camera feeds and joystick controls. Despite these advancements, challenges remain, particularly concerning user experience across different setups. This paper introduces a novel VR-based teleoperation framework designed for a robotic manipulator integrated onto a mobile platform. Central to our approach is the application of Gaussian splatting, a technique that abstracts the manipulable scene into a VR environment, thereby enabling more intuitive and immersive interactions. Users can navigate and manipulate within the virtual scene as if interacting with a real robot, enhancing both the engagement and efficacy of teleoperation tasks. An extensive user study validates our approach, demonstrating significant usability and efficiency improvements. Two-thirds (66%) of participants completed tasks faster, achieving an average time reduction of 43%. Additionally, 93% preferred the Gaussian Splat interface overall, with unanimous (100%) recommendations for future use, highlighting improvements in precision, responsiveness, and situational awareness. Finally, we demonstrate the effectiveness of our framework through real-world experiments in two distinct application scenarios, showcasing the practical capabilities and versatility of the Splat-based VR interface.
