Domain Generalization for 6D Pose Estimation Through NeRF-based Image Synthesis
Antoine Legrand, Renaud Detry, Christophe De Vleeschouwer
TL;DR
The paper tackles domain shift in 6D pose estimation for spacecraft by proposing NeRF-based data augmentation. It trains an in-the-wild NeRF on synthetic data to generate $S_{nerf}$ with diverse viewpoints, illumination via appearance embeddings, and texture through color perturbations, forming $S_{train}=S_{synth}\cup S_{nerf}$. Experiments on SPEED+ show substantial improvements in target-domain pose accuracy, with reductions of up to $55\%$ and $45\%$ in angular and translation errors on Lightbox and Sunlamp, respectively, and ablations confirm the value of appearance extrapolation and texture randomization. The approach demonstrates that NeRF-synthesized data can enable robust pose estimation even when real data or CAD models are limited, offering a scalable path for domain-generalizable 6D pose estimation in space missions.
Abstract
This work introduces a novel augmentation method that increases the diversity of a train set to improve the generalization abilities of a 6D pose estimation network. For this purpose, a Neural Radiance Field is trained from synthetic images and exploited to generate an augmented set. Our method enriches the initial set by enabling the synthesis of images with (i) unseen viewpoints, (ii) rich illumination conditions through appearance extrapolation, and (iii) randomized textures. We validate our augmentation method on the challenging use-case of spacecraft pose estimation and show that it significantly improves the pose estimation generalization capabilities. On the SPEED+ dataset, our method reduces the error on the pose by 50% on both target domains.
