NEUSIS: A Compositional Neuro-Symbolic Framework for Autonomous Perception, Reasoning, and Planning in Complex UAV Search Missions
Zhixi Cai, Cristian Rojas Cardenas, Kevin Leo, Chenyuan Zhang, Kal Backman, Hanbing Li, Boying Li, Mahsa Ghorbanali, Stavya Datta, Lizhen Qu, Julian Gutierrez Santiago, Alexey Ignatiev, Yuan-Fang Li, Mor Vered, Peter J Stuckey, Maria Garcia de la Banda, Hamid Rezatofighi
TL;DR
NEUSIS addresses autonomous UAV search missions by locating EOIs under time constraints in hazard-prone environments. It introduces a compositional neuro-symbolic pipeline (GRiD for $3$D perception-grounding-reasoning, a Probabilistic World Model, and SNaC for hierarchical planning) that maintains a persistent belief about the environment. Experiments in HAMERITT AirSim simulations show NEUSIS outperforms a state-of-the-art vision-language baseline and a state-of-the-art planning baseline in success rate, navigation efficiency, and 3D localization. The work demonstrates the value of explicit visual reasoning, probabilistic world modeling, and multi-level planning for robust UAV search under KOZs and time limits, with practical potential for search-and-rescue and related hazardous-domain applications.
Abstract
This paper addresses the problem of autonomous UAV search missions, where a UAV must locate specific Entities of Interest (EOIs) within a time limit, based on brief descriptions in large, hazard-prone environments with keep-out zones. The UAV must perceive, reason, and make decisions with limited and uncertain information. We propose NEUSIS, a compositional neuro-symbolic system designed for interpretable UAV search and navigation in realistic scenarios. NEUSIS integrates neuro-symbolic visual perception, reasoning, and grounding (GRiD) to process raw sensory inputs, maintains a probabilistic world model for environment representation, and uses a hierarchical planning component (SNaC) for efficient path planning. Experimental results from simulated urban search missions using AirSim and Unreal Engine show that NEUSIS outperforms a state-of-the-art (SOTA) vision-language model and a SOTA search planning model in success rate, search efficiency, and 3D localization. These results demonstrate the effectiveness of our compositional neuro-symbolic approach in handling complex, real-world scenarios, making it a promising solution for autonomous UAV systems in search missions.
