Table of Contents
Fetching ...

Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People

Dharmateja Adapa, Virendra Singh Shekhawat, Avinash Gautam, Sudeept Mohan

TL;DR

This work proposes a Visual Assist (VA) system for the indoor navigation of BVI individuals using visual Fiducial markers for localization using a Pan-Tilt turret-mounted camera which enhances the field of view to 360{\deg} for enhanced marker tracking.

Abstract

Large indoor spaces have complex layouts making them difficult to navigate. Indoor spaces in hospitals, universities, shopping complexes, etc., carry multi-modal information in the form of text and symbols. Hence, it is difficult for Blind and Visually Impaired (BVI) people to independently navigate such spaces. Indoor environments are usually GPS-denied; therefore, Bluetooth-based, WiFi-based, or Range-based methods are used for localization. These methods have high setup costs, lesser accuracy, and sometimes need special sensing equipment. We propose a Visual Assist (VA) system for the indoor navigation of BVI individuals using visual Fiducial markers for localization. State-of-the-art (SOTA) approaches for visual localization using Fiducial markers use fixed cameras having a narrow field of view. These approaches stop tracking the markers when they are out of sight. We employ a Pan-Tilt turret-mounted camera which enhances the field of view to 360° for enhanced marker tracking. We, therefore, need fewer markers for mapping and navigation. The efficacy of the proposed VA system is measured on three metrics, i.e., RMSE (Root Mean Square Error), ADNN (Average Distance to Nearest Neighbours), and ATE (Absolute Trajectory Error). Our system outperforms Hector-SLAM, ORB-SLAM3, and UcoSLAM. The proposed system achieves localization accuracy within $\pm8cm$ compared to $\pm12cm$ and $\pm10cm$ for ORB-SLAM3 and UcoSLAM, respectively.

Autonomous Mapping and Navigation using Fiducial Markers and Pan-Tilt Camera for Assisting Indoor Mobility of Blind and Visually Impaired People

TL;DR

This work proposes a Visual Assist (VA) system for the indoor navigation of BVI individuals using visual Fiducial markers for localization using a Pan-Tilt turret-mounted camera which enhances the field of view to 360{\deg} for enhanced marker tracking.

Abstract

Large indoor spaces have complex layouts making them difficult to navigate. Indoor spaces in hospitals, universities, shopping complexes, etc., carry multi-modal information in the form of text and symbols. Hence, it is difficult for Blind and Visually Impaired (BVI) people to independently navigate such spaces. Indoor environments are usually GPS-denied; therefore, Bluetooth-based, WiFi-based, or Range-based methods are used for localization. These methods have high setup costs, lesser accuracy, and sometimes need special sensing equipment. We propose a Visual Assist (VA) system for the indoor navigation of BVI individuals using visual Fiducial markers for localization. State-of-the-art (SOTA) approaches for visual localization using Fiducial markers use fixed cameras having a narrow field of view. These approaches stop tracking the markers when they are out of sight. We employ a Pan-Tilt turret-mounted camera which enhances the field of view to 360° for enhanced marker tracking. We, therefore, need fewer markers for mapping and navigation. The efficacy of the proposed VA system is measured on three metrics, i.e., RMSE (Root Mean Square Error), ADNN (Average Distance to Nearest Neighbours), and ATE (Absolute Trajectory Error). Our system outperforms Hector-SLAM, ORB-SLAM3, and UcoSLAM. The proposed system achieves localization accuracy within compared to and for ORB-SLAM3 and UcoSLAM, respectively.
Paper Structure (41 sections, 7 equations, 24 figures, 4 tables)

This paper contains 41 sections, 7 equations, 24 figures, 4 tables.

Figures (24)

  • Figure 1: Cuboid with the marker in different orientations on 4 sides. Green, red, and blue arrows represent the x,y, and z axis, respectively
  • Figure 2: Conceptual diagram of the proposed VA System for BVI people. Green-colored blocks indicate the components described in this paper. Black arrows indicate control flow, red arrows indicate continuous data flow, and green arrows indicate intermittent data flow. $P_{ROB}$ conveys the pose of the robot derived from the PT camera module. $M$ represents the compact map representation. $F$ represents the features identified in the environment during mapping. $F_{M}$ indicates the detected features embedded in the map $M$
  • Figure 3: High-level block diagram of the mapping system. $P_{ROB}$ conveys the pose of the robot derived from the PT camera module. $M_{O}$ represents the occupancy grid map representation. $M$ represents the compact map representation. $M_{R}$ indicates the marker positions derived after the reduced marker placement. $F$ represents the features identified in the environment during mapping
  • Figure 4: Mobile robot configuration for environment mapping (left) and transform tree (right)
  • Figure 6: Voronoi partitioning applied on corridor environment
  • ...and 19 more figures