City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web
Kaiwen Song, Xiaoyi Zeng, Chenqu Ren, Juyong Zhang
TL;DR
City-on-Web tackles real-time neural rendering of large-scale scenes on the web by combining a block-based radiance-field representation with an explicit Level-of-Detail scheme and dynamic loading. It enables per-block shaders to render large environments while preserving 3D occlusion through depth-sorted alpha blending, and proves equivalence to traditional volume rendering under Lambertian assumptions. The approach yields real-time rendering at about $32$ FPS at 1080p on a consumer RTX 3060 with memory usage around $1100$ MB, while maintaining high perceptual quality (SSIM/LPIPS) and competitive PSNR on urban datasets, via training with a finest-LOD model and progressive LOD baking. The work demonstrates practical web deployment of large-scale neural rendering, offering significant memory savings, scalable rendering, and a thorough experimental study, while acknowledging limitations related to lighting variation and non-Lambertian effects that motivate future improvements.
Abstract
Existing neural radiance field-based methods can achieve real-time rendering of small scenes on the web platform. However, extending these methods to large-scale scenes still poses significant challenges due to limited resources in computation, memory, and bandwidth. In this paper, we propose City-on-Web, the first method for real-time rendering of large-scale scenes on the web. We propose a block-based volume rendering method to guarantee 3D consistency and correct occlusion between blocks, and introduce a Level-of-Detail strategy combined with dynamic loading/unloading of resources to significantly reduce memory demands. Our system achieves real-time rendering of large-scale scenes at approximately 32FPS with RTX 3060 GPU on the web and maintains rendering quality comparable to the current state-of-the-art novel view synthesis methods.
