Pose-free 3D Gaussian splatting via shape-ray estimation
Youngju Na, Taeyeon Kim, Jumin Lee, Kyu Beom Han, Woo Jae Kim, Sung-eui Yoon
TL;DR
SHARE tackles the challenge of pose-free, generalizable 3D Gaussian splatting by introducing a pose-aware canonical volume that fuses multi-view features without explicit 3D pose alignment. It jointly estimates relative camera rays and 3D Gaussians, and uses an anchor-based coarse-to-fine scheme to refine local geometry around coarse anchors. The two main components—Ray-guided Multi-view Fusion and Anchor-aligned Gaussian Prediction—enable robust, feed-forward pose-free novel view synthesis across diverse real-world datasets and demonstrate strong cross-dataset generalization. This approach offers an efficient, scalable alternative for pose-free 3D reconstruction in sparse-view scenarios, with practical implications for real-world rendering and scene understanding.
Abstract
While generalizable 3D Gaussian splatting enables efficient, high-quality rendering of unseen scenes, it heavily depends on precise camera poses for accurate geometry. In real-world scenarios, obtaining accurate poses is challenging, leading to noisy pose estimates and geometric misalignments. To address this, we introduce SHARE, a pose-free, feed-forward Gaussian splatting framework that overcomes these ambiguities by joint shape and camera rays estimation. Instead of relying on explicit 3D transformations, SHARE builds a pose-aware canonical volume representation that seamlessly integrates multi-view information, reducing misalignment caused by inaccurate pose estimates. Additionally, anchor-aligned Gaussian prediction enhances scene reconstruction by refining local geometry around coarse anchors, allowing for more precise Gaussian placement. Extensive experiments on diverse real-world datasets show that our method achieves robust performance in pose-free generalizable Gaussian splatting. Code is avilable at https://github.com/youngju-na/SHARE
