Fully Onboard Low-Power Localization with Semantic Sensor Fusion on a Nano-UAV using Floor Plans
Nicky Zimmerman, Hanna Müller, Michele Magno, Luca Benini
TL;DR
This work tackles indoor localization for GPS-denied nano-UAVs under severe power and compute constraints by fusing geometric information from miniaturized ToF sensors with semantic cues obtained from an onboard object detector. The authors deploy a fully onboard, memory-efficient semantic map and Monte Carlo Localization on a GAP9 multi-core RISC-V processor, achieving real-time performance without external infrastructure. Key contributions include a quantized YOLOv5p object detector running at 20 fps with minimal memory and energy overhead, a compact 16-bit semantic map format, and a novel fusion model that improves global localization in floor-plan environments. The method demonstrates 90% localization success with an average error of 0.32 m in real-world office settings and is released as open source, enabling broader adoption for autonomous nano-UAVs.
Abstract
Nano-sized unmanned aerial vehicles (UAVs) are well-fit for indoor applications and for close proximity to humans. To enable autonomy, the nano-UAV must be able to self-localize in its operating environment. This is a particularly-challenging task due to the limited sensing and compute resources on board. This work presents an online and onboard approach for localization in floor plans annotated with semantic information. Unlike sensor-based maps, floor plans are readily-available, and do not increase the cost and time of deployment. To overcome the difficulty of localizing in sparse maps, the proposed approach fuses geometric information from miniaturized time-of-flight sensors and semantic cues. The semantic information is extracted from images by deploying a state-of-the-art object detection model on a high-performance multi-core microcontroller onboard the drone, consuming only 2.5mJ per frame and executing in 38ms. In our evaluation, we globally localize in a real-world office environment, achieving 90% success rate. We also release an open-source implementation of our work.
