Improving 6D Object Pose Estimation of metallic Household and Industry Objects
Thomas Pöllabauer, Michael Gasser, Tristan Wirth, Sarah Berkei, Volker Knauthe, Arjan Kuijper
TL;DR
This work addresses the reduced accuracy of 6D pose estimation for metallic objects caused by reflections and specular highlights. It introduces a new BOP-compatible metallic dataset rendered with physically-based rendering to mirror industrial lighting and backgrounds, and extends the GDRNPP framework with two new heads: keypoint heatmap prediction and material properties estimation. The key contributions also include leveraging Bottleneck Attention Modules to fuse geometric and appearance cues, and demonstrating substantial performance gains on metallic objects across standard 6D pose metrics. The findings show that explicit geometric keypoints and material-aware predictions can significantly improve pose estimation in challenging metallic scenarios, advancing applicability in robotics and automation; the dataset is publicly available for further research.
Abstract
6D object pose estimation suffers from reduced accuracy when applied to metallic objects. We set out to improve the state-of-the-art by addressing challenges such as reflections and specular highlights in industrial applications. Our novel BOP-compatible dataset, featuring a diverse set of metallic objects (cans, household, and industrial items) under various lighting and background conditions, provides additional geometric and visual cues. We demonstrate that these cues can be effectively leveraged to enhance overall performance. To illustrate the usefulness of the additional features, we improve upon the GDRNPP algorithm by introducing an additional keypoint prediction and material estimator head in order to improve spatial scene understanding. Evaluations on the new dataset show improved accuracy for metallic objects, supporting the hypothesis that additional geometric and visual cues can improve learning.
