A POMDP-based hierarchical planning framework for manipulation under pose uncertainty
Muhammad Suhail Saleem, Rishi Veerapaneni, Maxim Likhachev
TL;DR
The paper tackles manipulation under large pose uncertainty in unstructured environments by using binary contact signals within a POMDP. It introduces a hierarchical belief representation that first reasons in a 3D volumetric space to shrink the potentially occupied region, then transitions to a refined particle-space view for final pose estimation, all solved by a closed-loop, anytime planner with a 1-second budget. Key contributions include the two-phase hierarchical belief framework, a modified RTDP-Bel based solver that focuses on the most probable outcomes, and the demonstration of real-time, high-precision plug insertion with substantial improvements over greedy baselines (97%+ success in simulation and 93% real-world in the reported tasks). The work enables robust, real-time manipulation under large pose uncertainties in domestic-style settings, offering a scalable approach to contact-based localization when visual feedback is limited.
Abstract
Robots often face challenges in domestic environments where visual feedback is ineffective, such as retrieving objects obstructed by occlusions or finding a light switch in the dark. In these cases, utilizing contacts to localize the target object can be effective. We propose an online planning framework using binary contact signals for manipulation tasks with pose uncertainty, formulated as a Partially Observable Markov Decision Process (POMDP). Naively representing the belief as a particle set makes planning infeasible due to the large uncertainties in domestic settings, as identifying the best sequence of actions requires rolling out thousands of actions across millions of particles, taking significant compute time. To address this, we propose a hierarchical belief representation. Initially, we represent the uncertainty coarsely in a 3D volumetric space. Policies that refine uncertainty in this space are computed and executed, and once uncertainty is sufficiently reduced, the problem is translated back into the particle space for further refinement before task completion. We utilize a closed-loop planning and execution framework with a heuristic-search-based anytime solver that computes partial policies within a limited time budget. The performance of the framework is demonstrated both in real world and in simulation on the high-precision task of inserting a plug into a port using a UR10e manipulator, resolving positional uncertainties up to 50 centimeters and angular uncertainties close to $2π$. Experimental results highlight the framework's effectiveness, achieving a 93\% success rate in the real world and over 50\% improvement in solution quality compared to greedy baselines, significantly accelerating planning and enabling real-time solutions for complex problems.
