Learning to Double Guess: An Active Perception Approach for Estimating the Center of Mass of Arbitrary Objects
Shengmiao Jin, Yuchen Mo, Wenzhen Yuan
TL;DR
This work tackles the problem of estimating the center of mass for arbitrary objects in unstructured environments by introducing U-GRAPH, a framework that couples a Bayesian Neural Network for uncertainty quantification with an ActiveNet that selects informative second orientations through grid search. The two-measurement strategy uses an initial CoM estimate from a fixed grasp, followed by a second orientation to gain additional information, and fuses both measurements into a Gaussian posterior for a robust final estimate. Empirical evaluation on customized training objects and unseen real-world objects shows that U-GRAPH achieves an average error of about $1.47\,\text{cm}$ and $7.6\%$ relative error on unseen data, outperforming analytical baselines and simple two-measurement schemes. The method demonstrates strong generalization across objects with varying contact geometry, surface friction, and shapes, and outlines future improvements such as slip-detection with GelSight sensors to further enhance robustness.
Abstract
Manipulating arbitrary objects in unstructured environments is a significant challenge in robotics, primarily due to difficulties in determining an object's center of mass. This paper introduces U-GRAPH: Uncertainty-Guided Rotational Active Perception with Haptics, a novel framework to enhance the center of mass estimation using active perception. Traditional methods often rely on single interaction and are limited by the inherent inaccuracies of Force-Torque (F/T) sensors. Our approach circumvents these limitations by integrating a Bayesian Neural Network (BNN) to quantify uncertainty and guide the robotic system through multiple, information-rich interactions via grid search and a neural network that scores each action. We demonstrate the remarkable generalizability and transferability of our method with training on a small dataset with limited variation yet still perform well on unseen complex real-world objects.
