Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection
Tessa Pulli, Peter Hönig, Stefan Thalhammer, Matthias Hirschmanner, Markus Vincze
TL;DR
This work tackles RGB-only 6D pose estimation of transparent objects by introducing edge-detection as a preprocessing step to amplify boundary cues. The authors integrate edge representations with a GDR-Net–based pose estimator and a YOLOX detector, evaluating four configurations on the synthetic Trans6D-32K dataset. Results show that Canny edge preprocessing can improve mean $ADD$-S scores and pose accuracy for several objects, though edge augmentation does not consistently benefit object detection. The study highlights the potential of edge representations to aid challenging transparent-object perception and points to future work on broader datasets and textured objects to generalize the findings.
Abstract
Object pose estimation of transparent objects remains a challenging task in the field of robot vision due to the immense influence of lighting, background, and reflections. However, the edges of clear objects have the highest contrast, which leads to stable and prominent features. We propose a novel approach by incorporating edge detection in a pre-processing step for the tasks of object detection and object pose estimation. We conducted experiments to investigate the effect of edge detectors on transparent objects. We examine the performance of the state-of-the-art 6D object pose estimation pipeline GDR-Net and the object detector YOLOX when applying different edge detectors as pre-processing steps (i.e., Canny edge detection with and without color information, and holistically-nested edges (HED)). We evaluate the physically-based rendered dataset Trans6D-32 K of transparent objects with parameters proposed by the BOP Challenge. Our results indicate that applying edge detection as a pre-processing enhances performance for certain objects.
