GazeGrasp: DNN-Driven Robotic Grasping with Wearable Eye-Gaze Interface
Issatay Tokmurziyev, Miguel Altamirano Cabrera, Luis Moreno, Muhammad Haris Khan, Dzmitry Tsetserukou
TL;DR
GazeGrasp addresses the need for hands-free control of collaborative robots by enabling object manipulation via gaze alone. It integrates an ESP32-CAM eye tracker, MediaPipe gaze estimation, and YOLOv8 object detection with a UR10 arm, employing polynomial calibration with $d=3$, Kalman filtering for smooth gaze, and a magnetic-snapping mechanism to enhance precision. The system maps gaze-driven selections to real-world robot coordinates through camera-to-base transformations and allows picking and placing with 3-second gaze commands, demonstrated on a range of objects. Experimental results from 13 participants show a significant 31% reduction in gaze-alignment time when magnetic snapping is enabled, underscoring the approach's potential to elevate accessibility and autonomy in assistive robotics; the work lays a foundation for broader adoption in rehabilitation, healthcare, and industrial automation, with future work focusing on long-term usability and obstacle-rich environments.
Abstract
We present GazeGrasp, a gaze-based manipulation system enabling individuals with motor impairments to control collaborative robots using eye-gaze. The system employs an ESP32 CAM for eye tracking, MediaPipe for gaze detection, and YOLOv8 for object localization, integrated with a Universal Robot UR10 for manipulation tasks. After user-specific calibration, the system allows intuitive object selection with a magnetic snapping effect and robot control via eye gestures. Experimental evaluation involving 13 participants demonstrated that the magnetic snapping effect significantly reduced gaze alignment time, improving task efficiency by 31%. GazeGrasp provides a robust, hands-free interface for assistive robotics, enhancing accessibility and autonomy for users.
