Inferring Belief States in Partially-Observable Human-Robot Teams
Jack Kolb, Karen M. Feigh
TL;DR
This work tackles real-time estimation of human situation awareness in partially observable human-robot teams by modeling three belief states: $\beta^{true}$, $\beta^{robot}$, and $\beta^{pred}$. It compares a handcrafted logical predicates approach with a Large Language Model in an Overcooked-AI domain extended for partial observability, leveraging SAGAT-based ground-truth human responses. Results show the logical predicates baseline remains robust across visibility levels, while the LLM can match or exceed it in zero-shot settings, suggesting both approaches are viable for predicting user beliefs in dynamic HRI tasks. The study provides a public dataset, demo, and code to advance theory-of-m mind and decision-support in human-robot teaming under partial observability.
Abstract
We investigate the real-time estimation of human situation awareness using observations from a robot teammate with limited visibility. In human factors and human-autonomy teaming, it is recognized that individuals navigate their environments using an internal mental simulation, or mental model. The mental model informs cognitive processes including situation awareness, contextual reasoning, and task planning. In teaming domains, the mental model includes a team model of each teammate's beliefs and capabilities, enabling fluent teamwork without the need for explicit communication. However, little work has applied team models to human-robot teaming. In this work we compare the performance of two models, logical predicates and large language models, at estimating user situation awareness over varying visibility conditions. Our results indicate that the methods are largely resilient to low-visibility conditions in our domain, however opportunities exist to improve their overall performance.
