Feature Splatting for Better Novel View Synthesis with Low Overlap
T. Berriel Martins, Javier Civera
TL;DR
FeatSplat replaces spherical-harmonic color encodings in 3D Gaussian Splatting with learnable per-Gaussian feature vectors, enabling richer textures and better generalization for novel view synthesis, especially at low view overlap. By alpha-blending Gaussian features to form per-pixel features and decoding them with a compact MLP conditioned on a camera embedding, the method supports both high-quality RGB rendering and per-pixel semantic segmentation. Empirical results on Mip-360, Tanks and Temples, Deep Blending, and ScanNet++ show FeatSplat achieving superior or competitive PSNR, SSIM, and LPIPS, while reducing memory and enabling real-time rendering; 32-dim features often outperform 16-dim variants. The approach also supports lighting manipulation at inference and extends to semantic segmentation with modest additional capacity, marking a practical, flexible improvement over SH-based 3DGS with meaningful implications for robotics, AR/VR, and open-vocabulary tasks.
Abstract
3D Gaussian Splatting has emerged as a very promising scene representation, achieving state-of-the-art quality in novel view synthesis significantly faster than competing alternatives. However, its use of spherical harmonics to represent scene colors limits the expressivity of 3D Gaussians and, as a consequence, the capability of the representation to generalize as we move away from the training views. In this paper, we propose to encode the color information of 3D Gaussians into per-Gaussian feature vectors, which we denote as Feature Splatting (FeatSplat). To synthesize a novel view, Gaussians are first "splatted" into the image plane, then the corresponding feature vectors are alpha-blended, and finally the blended vector is decoded by a small MLP to render the RGB pixel values. To further inform the model, we concatenate a camera embedding to the blended feature vector, to condition the decoding also on the viewpoint information. Our experiments show that these novel model for encoding the radiance considerably improves novel view synthesis for low overlap views that are distant from the training views. Finally, we also show the capacity and convenience of our feature vector representation, demonstrating its capability not only to generate RGB values for novel views, but also their per-pixel semantic labels. Code available at https://github.com/tberriel/FeatSplat . Keywords: Gaussian Splatting, Novel View Synthesis, Feature Splatting
