SAOR: Single-View Articulated Object Reconstruction
Mehmet Aygün, Oisin Mac Aodha
TL;DR
SAOR tackles single-view articulated object reconstruction without category-specific 3D templates or skeleton priors by adopting a skeleton-free, part-based deformation model trained with a cross-instance swap consistency loss and silhouette-based viewpoint sampling. The method predicts shape, texture, and camera pose from a single image in a single forward pass, using differentiable rendering to enforce self-supervised consistency across multiple categories. Key innovations include a part-based articulation mechanism with learned skinning weights and a streamlined swap loss that reduces degeneracy for articulated objects, enabling category-agnostic generalization to over 100 animal categories. The results demonstrate improved 2D keypoint transfer and 3D Chamfer metrics compared to non-3D-supervised baselines, with efficient inference suitable for practical use, though texture realism and extreme viewpoints remain challenging.
Abstract
We introduce SAOR, a novel approach for estimating the 3D shape, texture, and viewpoint of an articulated object from a single image captured in the wild. Unlike prior approaches that rely on pre-defined category-specific 3D templates or tailored 3D skeletons, SAOR learns to articulate shapes from single-view image collections with a skeleton-free part-based model without requiring any 3D object shape priors. To prevent ill-posed solutions, we propose a cross-instance consistency loss that exploits disentangled object shape deformation and articulation. This is helped by a new silhouette-based sampling mechanism to enhance viewpoint diversity during training. Our method only requires estimated object silhouettes and relative depth maps from off-the-shelf pre-trained networks during training. At inference time, given a single-view image, it efficiently outputs an explicit mesh representation. We obtain improved qualitative and quantitative results on challenging quadruped animals compared to relevant existing work.
