SeeLe: A Unified Acceleration Framework for Real-Time Gaussian Splatting
Xiaotong Huang, He Zhu, Zihan Liu, Weikai Lin, Xiaohong Liu, Zhezhi He, Jingwen Leng, Minyi Guo, Yu Feng
TL;DR
This work tackles the challenge of real-time Gaussian Splatting (3DGS) on resource-limited mobile devices by identifying three bottlenecks: computational intensity, rendering inefficiency, and memory budget. It introduces Seele, a unified framework with two GPU-oriented techniques: Hybrid Preprocessing (HP), which uses a view-dependent scene representation and offline clustering with online filtering to load only relevant Gaussians, and Contribution-Aware Rasterization (CR), which prioritizes high-contribution Gaussians and skips low-contribution ones to boost rasterization efficiency. An integrated fine-tuning step further preserves rendering quality while improving view consistency. Empirically, Seele achieves up to 6.3x speedups and substantial runtime model reductions (up to ~39%), with better rendering quality across multiple datasets and hardware configurations, demonstrating strong practical value for mobile real-time rendering and guiding future GPU-aware acceleration strategies for 3DGS.
Abstract
3D Gaussian Splatting (3DGS) has become a crucial rendering technique for many real-time applications. However, the limited hardware resources on today's mobile platforms hinder these applications, as they struggle to achieve real-time performance. In this paper, we propose SeeLe, a general framework designed to accelerate the 3DGS pipeline for resource-constrained mobile devices. Specifically, we propose two GPU-oriented techniques: hybrid preprocessing and contribution-aware rasterization. Hybrid preprocessing alleviates the GPU compute and memory pressure by reducing the number of irrelevant Gaussians during rendering. The key is to combine our view-dependent scene representation with online filtering. Meanwhile, contribution-aware rasterization improves the GPU utilization at the rasterization stage by prioritizing Gaussians with high contributions while reducing computations for those with low contributions. Both techniques can be seamlessly integrated into existing 3DGS pipelines with minimal fine-tuning. Collectively, our framework achieves 2.6$\times$ speedup and 32.3\% model reduction while achieving superior rendering quality compared to existing methods.
