Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness
Aidan Curtis, George Matheos, Nishad Gothoskar, Vikash Mansinghka, Joshua Tenenbaum, Tomás Lozano-Pérez, Leslie Pack Kaelbling
TL;DR
TAMPURA addresses long-horizon robotic planning under partial observability and outcome uncertainty by combining belief-space task planning with a learning-driven, sparse abstract MDP. It builds on belief-state MDPs and belief-state controller MDPs, introduces belief state propositions and an abstract belief space, and grounds operators with uncertain effects via operator schemata. The core innovation is Bayes-optimistic model learning that guides deterministic planning to efficiently explore task-relevant transitions and then refines to a probabilistic sparse MDP solvable with LAO*. This approach shows superior performance on simulated long-horizon tasks and robust real-world demonstrations, enabling information gathering and safe operation under uncertainty.
Abstract
Integrated task and motion planning (TAMP) has proven to be a valuable approach to generalizable long-horizon robotic manipulation and navigation problems. However, the typical TAMP problem formulation assumes full observability and deterministic action effects. These assumptions limit the ability of the planner to gather information and make decisions that are risk-aware. We propose a strategy for TAMP with Uncertainty and Risk Awareness (TAMPURA) that is capable of efficiently solving long-horizon planning problems with initial-state and action outcome uncertainty, including problems that require information gathering and avoiding undesirable and irreversible outcomes. Our planner reasons under uncertainty at both the abstract task level and continuous controller level. Given a set of closed-loop goal-conditioned controllers operating in the primitive action space and a description of their preconditions and potential capabilities, we learn a high-level abstraction that can be solved efficiently and then refined to continuous actions for execution. We demonstrate our approach on several robotics problems where uncertainty is a crucial factor and show that reasoning under uncertainty in these problems outperforms previously proposed determinized planning, direct search, and reinforcement learning strategies. Lastly, we demonstrate our planner on two real-world robotics problems using recent advancements in probabilistic perception.
