EfficientPose 6D: Scalable and Efficient 6D Object Pose Estimation
Zixuan Fang, Thomas Pöllabauer, Tristan Wirth, Sarah Berkei, Volker Knauthe, Arjan Kuijper
TL;DR
This work targets real-time 6D object pose estimation by enlarging the GDRNPP-based family with 40 candidate architectures and introducing the Adaptive Margin-Dependent Iterative Selection (AMIS) algorithm to automatically balance inference time and pose accuracy across diverse datasets. AMIS identifies robust sweet spots in the inference time–accuracy space, enabling model selection tailored to application constraints. Empirical results show that AMIS-selected models can deliver up to 35% faster inference with minimal accuracy loss, or about 25% accuracy gains with modest time increases, across LM-O, YCB-V, T-LESS, and ITODD. The approach offers a practical path to fast, scalable 6D pose estimation suitable for industrial real-time feedback and robotic manipulation, with avenues for further optimization in temporal fusion and end-to-end lightweight designs.
Abstract
In industrial applications requiring real-time feedback, such as quality control and robotic manipulation, the demand for high-speed and accurate pose estimation remains critical. Despite advances improving speed and accuracy in pose estimation, finding a balance between computational efficiency and accuracy poses significant challenges in dynamic environments. Most current algorithms lack scalability in estimation time, especially for diverse datasets, and the state-of-the-art (SOTA) methods are often too slow. This study focuses on developing a fast and scalable set of pose estimators based on GDRNPP to meet or exceed current benchmarks in accuracy and robustness, particularly addressing the efficiency-accuracy trade-off essential in real-time scenarios. We propose the AMIS algorithm to tailor the utilized model according to an application-specific trade-off between inference time and accuracy. We further show the effectiveness of the AMIS-based model choice on four prominent benchmark datasets (LM-O, YCB-V, T-LESS, and ITODD).
