A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot

Murilo Vinicius da Silva; Matheus Hipolito Carvalho; Juliano Negri; Thiago Segreto; Gustavo J. G. Lahr; Ricardo V. Godoy; Marcelo Becker

A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot

Murilo Vinicius da Silva, Matheus Hipolito Carvalho, Juliano Negri, Thiago Segreto, Gustavo J. G. Lahr, Ricardo V. Godoy, Marcelo Becker

TL;DR

This work tackles the challenge of teleoperating a quadruped robot with a manipulation arm in hazardous environments by introducing a vision-based, shared-control interface that maps the operator's wrist pose and hand orientation to the robot end-effector. It combines depth-camera perception, ArUco-based calibration, MediaPipe tracking, and gesture recognition to enable manual and semi-autonomous grasping modes, while a collision-aware planner ensures safe operation. The approach is implemented on a Boston Dynamics Spot platform and validated through both simulation and real-time experiments, achieving a mean wrist-to-end-effector error of approximately 0.07 m and successful pick-and-place tasks by two users. The results suggest a practical, low-cost alternative for high-risk industrial applications, with future work focusing on path planning and enhanced shared-control strategies.

Abstract

In hazardous and remote environments, robotic systems perform critical tasks demanding improved safety and efficiency. Among these, quadruped robots with manipulator arms offer mobility and versatility for complex operations. However, teleoperating quadruped robots is challenging due to the lack of integrated obstacle detection and intuitive control methods for the robotic arm, increasing collision risks in confined or dynamically changing workspaces. Teleoperation via joysticks or pads can be non-intuitive and demands a high level of expertise due to its complexity, culminating in a high cognitive load on the operator. To address this challenge, a teleoperation approach that directly maps human arm movements to the robotic manipulator offers a simpler and more accessible solution. This work proposes an intuitive remote control by leveraging a vision-based pose estimation pipeline that utilizes an external camera with a machine learning-based model to detect the operator's wrist position. The system maps these wrist movements into robotic arm commands to control the robot's arm in real-time. A trajectory planner ensures safe teleoperation by detecting and preventing collisions with both obstacles and the robotic arm itself. The system was validated on the real robot, demonstrating robust performance in real-time control. This teleoperation approach provides a cost-effective solution for industrial applications where safety, precision, and ease of use are paramount, ensuring reliable and intuitive robotic control in high-risk environments.

A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot

TL;DR

Abstract

A Vision-Based Shared-Control Teleoperation Scheme for Controlling the Robotic Arm of a Four-Legged Robot

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)