Vision-Based Autonomous UAV Navigation and Landing for Urban Search and Rescue
Mayank Mittal, Rohit Mohan, Wolfram Burgard, Abhinav Valada
TL;DR
This work tackles autonomous UAV navigation and landing for urban search-and-rescue in unknown post-disaster environments where pre-existing maps are unreliable. It introduces a pixel-level landing-site detection pipeline that fuses hazard costmaps for depth confidence ($J_{DE}$), flatness ($J_{FL}$), steepness ($J_N$), and energy ($J_{EC}$) into a final score $J = c_1 J_{DE} + c_2 J_{FL} + c_3 J_N + c_4 J_{EC}$ with $c_i \in [0,1]$ and $\sum c_i = 1$, followed by global clustering to produce sparse, safe landing regions; a minimum-jerk trajectory ensures smooth landings. The system integrates state estimation (ROVIO+EKF), online mapping with Octomap and Voxblox TSDF, and a next-best-view planner to explore unknown debris-filled environments, all running on a low-power onboard computer. The authors also introduce AutoLand, a first-of-its-kind synthetic dataset with over $1.2$ million RGB images plus groundtruth depth, normals, semantics, and camera poses, and validate the approach in both hyperrealistic simulation and real-world collapsed buildings, showing robust landing-site detection and practical, time-critical performance for USAR operations.
Abstract
Unmanned Aerial Vehicles (UAVs) equipped with bioradars are a life-saving technology that can enable identification of survivors under collapsed buildings in the aftermath of natural disasters such as earthquakes or gas explosions. However, these UAVs have to be able to autonomously navigate in disaster struck environments and land on debris piles in order to accurately locate the survivors. This problem is extremely challenging as pre-existing maps cannot be leveraged for navigation due to structural changes that may have occurred. Furthermore, existing landing site detection algorithms are not suitable to identify safe landing regions on debris piles. In this work, we present a computationally efficient system for autonomous UAV navigation and landing that does not require any prior knowledge about the environment. We propose a novel landing site detection algorithm that computes costmaps based on several hazard factors including terrain flatness, steepness, depth accuracy, and energy consumption information. We also introduce a first-of-a-kind synthetic dataset of over 1.2 million images of collapsed buildings with groundtruth depth, surface normals, semantics and camera pose information. We demonstrate the efficacy of our system using experiments from a city scale hyperrealistic simulation environment and in real-world scenarios with collapsed buildings.
