Embodied Uncertainty-Aware Object Segmentation
Xiaolin Fang, Leslie Pack Kaelbling, Tomás Lozano-Pérez
TL;DR
The paper tackles ambiguity in object segmentation for embodied robotics by introducing UncOS, which generates a distribution over region-level segmentation hypotheses via repeated prompting of large pre-trained models. Building on this, EOS creates a 3D belief state and a belief-guided action planner that selects robot perturbations to reduce segmentation uncertainty in a closed loop with observations. Empirical results show that UncOS achieves state-of-the-art-like performance on unseen objects in single-image settings, and that the embodied extension EOS enables more efficient disambiguation through targeted interactions in real robots. Overall, the work advances embodied perception by coupling region-level uncertainty with action-driven disambiguation, enabling more reliable manipulation in cluttered, unknown-object scenes.
Abstract
We introduce uncertainty-aware object instance segmentation (UncOS) and demonstrate its usefulness for embodied interactive segmentation. To deal with uncertainty in robot perception, we propose a method for generating a hypothesis distribution of object segmentation. We obtain a set of region-factored segmentation hypotheses together with confidence estimates by making multiple queries of large pre-trained models. This process can produce segmentation results that achieve state-of-the-art performance on unseen object segmentation problems. The output can also serve as input to a belief-driven process for selecting robot actions to perturb the scene to reduce ambiguity. We demonstrate the effectiveness of this method in real-robot experiments. Website: https://sites.google.com/view/embodied-uncertain-seg
