BillBoard Splatting (BBSplat): Learnable Textured Primitives for Novel View Synthesis
David Svitov, Pietro Morerio, Lourdes Agapito, Alessio Del Bue
TL;DR
BillBoard Splatting (BBSplat) tackles the efficiency-accuracy gap in novel view synthesis by representing scenes with learnable textured planar primitives (billboards) that replace 3D Gaussians in Gaussian Splatting pipelines. By equipping each billboard with an RGB texture and an alpha map, BBSplat permits arbitrary shapes, high-frequency detail, and accurate mesh extraction, while enabling ray-tracing-like rasterization effects. The approach introduces a texture-based regularization and a compression pipeline that yields substantial storage reductions (up to ~7x on average and up to x17x vs 3DGS) without sacrificing rendering quality, achieving state-of-the-art PSNR on DTU and competitive results on Tanks&Temples and Mip-NeRF-360. Overall, BBSplat provides a scalable, photorealistic NVS framework with explicit geometry suitable for mesh extraction and rendering, improving practicality for real-world applications and downstream tasks.
Abstract
We present billboard Splatting (BBSplat) - a novel approach for novel view synthesis based on textured geometric primitives. BBSplat represents the scene as a set of optimizable textured planar primitives with learnable RGB textures and alpha-maps to control their shape. BBSplat primitives can be used in any Gaussian Splatting pipeline as drop-in replacements for Gaussians. The proposed primitives close the rendering quality gap between 2D and 3D Gaussian Splatting (GS), enabling the accurate extraction of 3D mesh as in the 2DGS framework. Additionally, the explicit nature of planar primitives enables the use of the ray-tracing effects in rasterization. Our novel regularization term encourages textures to have a sparser structure, enabling an efficient compression that leads to a reduction in the storage space of the model up to x17 times compared to 3DGS. Our experiments show the efficiency of BBSplat on standard datasets of real indoor and outdoor scenes such as Tanks&Temples, DTU, and Mip-NeRF-360. Namely, we achieve a state-of-the-art PSNR of 29.72 for DTU at Full HD resolution.
