Shape-biased Texture Agnostic Representations for Improved Textureless and Metallic Object Detection and 6D Pose Estimation
Peter Hönig, Stefan Thalhammer, Jean-Baptiste Weibel, Matthias Hirschmanner, Markus Vincze
TL;DR
This work tackles detection and 6D pose estimation for textureless and metallic objects by inducing a shape bias through randomized UV-mapped textures during synthetic data rendering, achieving geometry-focused representations with negligible overhead. The proposed approach, implemented in a BlenderProc-based pipeline with $n=1226$ textures, outperforms texture-based baselines and a style-transfer baseline across three detectors and two pose estimators, especially under varying illumination and noise. It also demonstrates improved robustness to common perturbations and provides insight via ablations on texture count and mesh origin. The method offers a practical route to reduce reliance on online augmentations and enhances generalization to textureless/metallic materials in real-world robotics scenarios.
Abstract
Recent advances in machine learning have greatly benefited object detection and 6D pose estimation. However, textureless and metallic objects still pose a significant challenge due to few visual cues and the texture bias of CNNs. To address his issue, we propose a strategy for inducing a shape bias to CNN training. In particular, by randomizing textures applied to object surfaces during data rendering, we create training data without consistent textural cues. This methodology allows for seamless integration into existing data rendering engines, and results in negligible computational overhead for data rendering and network training. Our findings demonstrate that the shape bias we induce via randomized texturing, improves over existing approaches using style transfer. We evaluate with three detectors and two pose estimators. For the most recent object detector and for pose estimation in general, estimation accuracy improves for textureless and metallic objects. Additionally we show that our approach increases the pose estimation accuracy in the presence of image noise and strong illumination changes. Code and datasets are publicly available at github.com/hoenigpeter/randomized_texturing.
