Efficient Semantic Splatting for Remote Sensing Multi-view Segmentation
Zipeng Qi, Hao Chen, Haotian Zhang, Zhengxia Zou, Zhenwei Shi
TL;DR
The paper targets efficient, accurate multi-view semantic segmentation for remote sensing under sparse labeling by extending Gaussian Splatting to semantic attributes. It combines explicit point-cloud splatting with a one-time rendering pipeline, SAM2-based boundary pseudo-labels, and 2D/3D aggregation losses to enhance view consistency and spatial continuity. Empirical results on CARLA-based synthetic data and Google Maps real data show superior accuracy and dramatically lower latency compared with training- and optimization-based baselines, validating practical applicability in real-world remote sensing pipelines. The approach offers a scalable, label-efficient path toward high-quality multi-view scene understanding and downstream 3D reconstructions.
Abstract
In this paper, we propose a novel semantic splatting approach based on Gaussian Splatting to achieve efficient and low-latency. Our method projects the RGB attributes and semantic features of point clouds onto the image plane, simultaneously rendering RGB images and semantic segmentation results. Leveraging the explicit structure of point clouds and a one-time rendering strategy, our approach significantly enhances efficiency during optimization and rendering. Additionally, we employ SAM2 to generate pseudo-labels for boundary regions, which often lack sufficient supervision, and introduce two-level aggregation losses at the 2D feature map and 3D spatial levels to improve the view-consistent and spatial continuity.
