PanoDreamer: Optimization-Based Single Image to 360 3D Scene With Diffusion
Avinash Paliwal, Xilong Zhou, Andrii Tsarov, Nima Khademi Kalantari
TL;DR
PanoDreamer tackles the challenge of producing coherent 360° 3D scenes from a single image by decoupling panorama generation from depth estimation and solving both as coupled optimization problems via alternating minimization. The pipeline then inpaints occluded areas and reconstructs the scene with a 3D Gaussian splatting representation, using a four-layer LDI for depth-aware texture completion. Key contributions include a two-stage panorama-depth optimization (MultiConDiffusion) and a patch-wise panorama depth fusion (PanoDepthFusion), followed by an end-to-end 3DGS optimization with depth-guided losses. The method achieves superior global coherence and detail in wide-view renderings, with practical impact for VR/AR and immersive visualization, while acknowledging horizon-bounded inputs and occasional edge-blur artifacts as avenues for future refinement.
Abstract
In this paper, we present PanoDreamer, a novel method for producing a coherent 360° 3D scene from a single input image. Unlike existing methods that generate the scene sequentially, we frame the problem as single-image panorama and depth estimation. Once the coherent panoramic image and its corresponding depth are obtained, the scene can be reconstructed by inpainting the small occluded regions and projecting them into 3D space. Our key contribution is formulating single-image panorama and depth estimation as two optimization tasks and introducing alternating minimization strategies to effectively solve their objectives. We demonstrate that our approach outperforms existing techniques in single-image 360° 3D scene reconstruction in terms of consistency and overall quality.
