PanSplat: 4K Panorama Synthesis with Feed-Forward Gaussian Splatting
Cheng Zhang, Haofei Xu, Qianyi Wu, Camilo Cruz Gambardella, Dinh Phung, Jianfei Cai
TL;DR
PanSplat tackles high-resolution panorama synthesis by extending feed-forward 3D Gaussian splatting to spherical panorama geometry. It introduces a spherical 3D Gaussian pyramid placed on a Fibonacci lattice, a hierarchical spherical cost volume with transformer-backed feature extraction, and Gaussian heads that predict multi-scale Gaussian parameters, rendered via a cubemap pipeline. A two-step deferred backpropagation strategy and deferred blending address memory constraints, enabling 4K ($2048 \times 4096$) synthesis on a single A100 GPU and achieving state-of-the-art results with substantial speedups over prior methods. The approach yields sharp, high-frequency textures and improved geometry, demonstrating strong generalization to real-world data and broad VR-relevant applications, though dynamic scenes remain future work.
Abstract
With the advent of portable 360° cameras, panorama has gained significant attention in applications like virtual reality (VR), virtual tours, robotics, and autonomous driving. As a result, wide-baseline panorama view synthesis has emerged as a vital task, where high resolution, fast inference, and memory efficiency are essential. Nevertheless, existing methods are typically constrained to lower resolutions (512 $\times$ 1024) due to demanding memory and computational requirements. In this paper, we present PanSplat, a generalizable, feed-forward approach that efficiently supports resolution up to 4K (2048 $\times$ 4096). Our approach features a tailored spherical 3D Gaussian pyramid with a Fibonacci lattice arrangement, enhancing image quality while reducing information redundancy. To accommodate the demands of high resolution, we propose a pipeline that integrates a hierarchical spherical cost volume and Gaussian heads with local operations, enabling two-step deferred backpropagation for memory-efficient training on a single A100 GPU. Experiments demonstrate that PanSplat achieves state-of-the-art results with superior efficiency and image quality across both synthetic and real-world datasets. Code is available at https://github.com/chengzhag/PanSplat.
