DAV-GSWT: Diffusion-Active-View Sampling for Data-Efficient Gaussian Splatting Wang Tiles
Rong Fu, Jiekai Wu, Haiyun Wei, Yee Tan Jia, Wenxin Zhang, Yang Li, Xiaowen Ma, Wangyu Wu, Simon Fong
TL;DR
This work tackles data efficiency for large-scale 3D terrain synthesis by marrying diffusion priors with active view sampling in Gaussian Splatting Wang Tiles. The DAV-GSWT framework selects informative viewpoints via image- and latent-space uncertainty, refines geometry and textures with diffusion-based priors, and stitches tiles with semantic-aware seam optimization and adaptive LOD rendering. Key contributions include a probabilistic uncertainty-driven view acquisition loop, a diffusion-refinement pipeline for tile boundaries, and a real-time renderer with uncertainty-guided caching that maintains interactivity under tight data budgets. Experiments on synthetic and real terrains demonstrate substantial reductions in required views while preserving high visual fidelity and interactive performance, enabling scalable, infinite-terrain production for immersive applications.
Abstract
The emergence of 3D Gaussian Splatting has fundamentally redefined the capabilities of photorealistic neural rendering by enabling high-throughput synthesis of complex environments. While procedural methods like Wang Tiles have recently been integrated to facilitate the generation of expansive landscapes, these systems typically remain constrained by a reliance on densely sampled exemplar reconstructions. We present DAV-GSWT, a data-efficient framework that leverages diffusion priors and active view sampling to synthesize high-fidelity Gaussian Splatting Wang Tiles from minimal input observations. By integrating a hierarchical uncertainty quantification mechanism with generative diffusion models, our approach autonomously identifies the most informative viewpoints while hallucinating missing structural details to ensure seamless tile transitions. Experimental results indicate that our system significantly reduces the required data volume while maintaining the visual integrity and interactive performance necessary for large-scale virtual environments.
