IG-MCTS: Human-in-the-Loop Cooperative Navigation under Incomplete Information
Shenghui Chen, Ruihan Zhao, Sandeep Chinchali, Ufuk Topcu
TL;DR
This work addresses human–robot cooperative navigation under incomplete information by introducing CoNav-Maze and a planning framework called IG-MCTS, guided by a Neural Human Perception Model (NHPM). IG-MCTS balances task progress with informative communication by treating information sharing as an actionable decision, using a learned NHPM to predict human belief updates and an augmentation of the reward with an information term. Empirical results show IG-MCTS reduces communication by up to two orders of magnitude and lowers cognitive load (eye-tracking metrics) while maintaining competitive task performance, with a scalable continuous-space extension via Voronoi-graph planning. The approach demonstrates strong potential for scalable, low-bandwidth human–robot collaboration in real-world navigation, rescue, and exploration tasks.
Abstract
Human-robot cooperative navigation is challenging under incomplete information. We introduce CoNav-Maze, a simulated environment where a robot navigates with local perception while a human operator provides guidance based on an inaccurate map. The robot can share its onboard camera views to help the operator refine their understanding of the environment. To enable efficient cooperation, we propose Information Gain Monte Carlo Tree Search (IG-MCTS), an online planning algorithm that jointly optimizes autonomous movement and informative communication. IG-MCTS leverages a learned Neural Human Perception Model (NHPM) -- trained on a crowdsourced mapping dataset -- to predict how the human's internal map evolves as new observations are shared. User studies show that IG-MCTS significantly reduces communication demands and yields eye-tracking metrics indicative of lower cognitive load, while maintaining task performance comparable to teleoperation and instruction-following baselines. Finally, we illustrate generalization beyond discrete mazes through a continuous-space waterway navigation setting, in which NHPM benefits from deeper encoder-decoder architectures and IG-MCTS leverages a dynamically constructed Voronoi-partitioned traversability graph.
