Probabilistically Informed Robot Object Search with Multiple Regions
Matthew Collins, Jared J. Beard, Nicholas Ohi, Yu Gu
TL;DR
We address autonomous search under uncertainty in large, unstructured environments by formulating the problem as a belief MDP with options (BMDP-O) to enable Monte Carlo Tree Search to scale via multi-step region moves and configurable sensor fields of view. The approach includes a lite MDP-O variant that avoids belief updates for real-time operation, and a segmentation of the map into ROI to focus planning on high-probability regions. Experimental results in a 200×200 environment show that ROI-enabled PUCT planners outperform baselines, with Regions Lite offering substantial compute reductions at the cost of more search steps, and multi-cell FOVs improving efficiency overall. The method is extendable to arbitrary FOVs and sensor types, enabling scalable, real-time search in hazardous settings and providing a pathway toward more adaptive, sensor-agnostic robotic search systems.
Abstract
The increasing use of autonomous robot systems in hazardous environments underscores the need for efficient search and rescue operations. Despite significant advancements, existing literature on object search often falls short in overcoming the difficulty of long planning horizons and dealing with sensor limitations, such as noise. This study introduces a novel approach that formulates the search problem as a belief Markov decision processes with options (BMDP-O) to make Monte Carlo tree search (MCTS) a viable tool for overcoming these challenges in large scale environments. The proposed formulation incorporates sequences of actions (options) to move between regions of interest, enabling the algorithm to efficiently scale to large environments. This approach also enables the use of customizable fields of view, for use with multiple types of sensors. Experimental results demonstrate the superiority of this approach in large environments when compared to the problem without options and alternative tools such as receding horizon planners. Given compute time for the proposed formulation is relatively high, a further approximated "lite" formulation is proposed. The lite formulation finds objects in a comparable number of steps with faster computation.
