SAGE: Semantic-Driven Adaptive Gaussian Splatting in Extended Reality
Chiara Schiavo, Elena Camuffo, Leonardo Badia, Simone Milani
TL;DR
XR rendering under tight memory and compute budgets is addressed by SAGE, which fuses semantic segmentation with 3D Gaussian Splatting to perform per-semantic-category level-of-detail optimization guided by a target $SSIM_t$. The method maps 2D semantic labels onto a Structure-from-Motion point cloud and solves a constrained optimization $ \min_{i} \sum_l N_l(i) \;\text{s.t.}\; \text{SSIM}_{l,i}(d_{min,l}) \ge \text{SSIM}_t$, using a distance-aware, piecewise exponential model for $\text{SSIM}_{l,i}(d_{min,l})$ and fitting parameters per label. Evaluations on the Mip-NeRF360 dataset show that SAGE substantially reduces Gaussians and memory while maintaining comparable visual quality, with transferable per-label iterations across scenes and robust cross-view performance. The approach enables scalable, real-time XR rendering by prioritizing resources where semantic importance and viewpoint proximity demand higher fidelity.
Abstract
3D Gaussian Splatting (3DGS) has significantly improved the efficiency and realism of three-dimensional scene visualization in several applications, ranging from robotics to eXtended Reality (XR). This work presents SAGE (Semantic-Driven Adaptive Gaussian Splatting in Extended Reality), a novel framework designed to enhance the user experience by dynamically adapting the Level of Detail (LOD) of different 3DGS objects identified via a semantic segmentation. Experimental results demonstrate how SAGE effectively reduces memory and computational overhead while keeping a desired target visual quality, thus providing a powerful optimization for interactive XR applications.
