Neural Texture Splatting: Expressive 3D Gaussian Splatting for View Synthesis, Geometry, and Dynamic Reconstruction
Yiming Wang, Shaofei Wang, Marko Mihajlovic, Siyu Tang
TL;DR
This work addresses the limited expressiveness of 3D Gaussian Splatting (3DGS) by introducing Neural Texture Splatting (NTS), which attaches a global neural field to predict per-primitive RGBA texture fields for each splat. The global tri-plane representation and lightweight decoders enable view- and time-dependent local textures while maintaining efficiency through shared information and CP decomposition. Across dense-view and sparse-view tasks, including static and dynamic scenes, NTS yields state-of-the-art improvements in novel view synthesis, geometry, and dynamic reconstruction, while reducing per-primitive overfitting and improving generalization. Although NTS incurs additional computation from ray–Gaussian intersections and neural decoding, it remains a plug-and-play enhancement to existing 3DGS pipelines with clear practical impact for high-fidelity 3D scene reconstruction and rendering.
Abstract
3D Gaussian Splatting (3DGS) has emerged as a leading approach for high-quality novel view synthesis, with numerous variants extending its applicability to a broad spectrum of 3D and 4D scene reconstruction tasks. Despite its success, the representational capacity of 3DGS remains limited by the use of 3D Gaussian kernels to model local variations. Recent works have proposed to augment 3DGS with additional per-primitive capacity, such as per-splat textures, to enhance its expressiveness. However, these per-splat texture approaches primarily target dense novel view synthesis with a reduced number of Gaussian primitives, and their effectiveness tends to diminish when applied to more general reconstruction scenarios. In this paper, we aim to achieve concrete performance improvement over state-of-the-art 3DGS variants across a wide range of reconstruction tasks, including novel view synthesis, geometry and dynamic reconstruction, under both sparse and dense input settings. To this end, we introduce Neural Texture Splatting (NTS). At the core of our approach is a global neural field (represented as a hybrid of a tri-plane and a neural decoder) that predicts local appearance and geometric fields for each primitive. By leveraging this shared global representation that models local texture fields across primitives, we significantly reduce model size and facilitate efficient global information exchange, demonstrating strong generalization across tasks. Furthermore, our neural modeling of local texture fields introduces expressive view- and time-dependent effects, a critical aspect that existing methods fail to account for. Extensive experiments show that Neural Texture Splatting consistently improves models and achieves state-of-the-art results across multiple benchmarks.
