Floor extraction and door detection for visually impaired guidance
Bruno Berenguel-Baeta, Manuel Guerrero-Viu, Alejandro de Nova, Jesus Bermudez-Cameo, Alejandro Perez-Yus, Jose J. Guerrero
TL;DR
This work addresses safe navigation in unknown environments for visually impaired users by a wearable perception system that fuses an RGB-D camera with a wide-field fish-eye camera. It introduces a two-stage floor recovery (depth-based floor plane extraction followed by color-based floor expansion) and a door-detection pipeline that uses line hypotheses and Cross-Ratio verification within the fused data, with extrinsic calibration between the cameras via Gauss-Newton optimization and Scaramuzza's omnidirectional model. The key contributions include a floor area expansion up to about $20\times$ the RGB-D field of view and a conservative door detector that minimizes false positives, validated across indoor and outdoor scenes. The results demonstrate robust real-time performance and potential for guiding users through safe trajectories, highlighting practical impact for visually impaired mobility and autonomous navigation.
Abstract
Finding obstacle-free paths in unknown environments is a big navigation issue for visually impaired people and autonomous robots. Previous works focus on obstacle avoidance, however they do not have a general view of the environment they are moving in. New devices based on computer vision systems can help impaired people to overcome the difficulties of navigating in unknown environments in safe conditions. In this work it is proposed a combination of sensors and algorithms that can lead to the building of a navigation system for visually impaired people. Based on traditional systems that use RGB-D cameras for obstacle avoidance, it is included and combined the information of a fish-eye camera, which will give a better understanding of the user's surroundings. The combination gives robustness and reliability to the system as well as a wide field of view that allows to obtain many information from the environment. This combination of sensors is inspired by human vision where the center of the retina (fovea) provides more accurate information than the periphery, where humans have a wider field of view. The proposed system is mounted on a wearable device that provides the obstacle-free zones of the scene, allowing the planning of trajectories for people guidance.
