HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Jingyu Lin; Jiaqi Gu; Lubin Fan; Bojian Wu; Yujing Lou; Renjie Chen; Ligang Liu; Jieping Ye

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

Jingyu Lin, Jiaqi Gu, Lubin Fan, Bojian Wu, Yujing Lou, Renjie Chen, Ligang Liu, Jieping Ye

TL;DR

HybridGS introduces a novel hybrid representation that decouples transient objects from static scene content by using per-image 2D Gaussians for transients and multi-view-consistent 3D Gaussians for statics. A multi-view regulated supervision scheme guides 3D Gaussians across co-visible regions, complemented by a three-stage training strategy that alternates and then jointly optimizes both components. The approach yields state-of-the-art novel-view synthesis on challenging indoor/outdoor datasets with distractors, while reducing storage and computation compared to traditional 3DGS. This work lays a robust, efficient foundation for handling transient content in casually captured scenes without reliance on semantic priors, with potential extensions to illumination variability and appearance modeling.

Abstract

Generating high-quality novel view renderings of 3D Gaussian Splatting (3DGS) in scenes featuring transient objects is challenging. We propose a novel hybrid representation, termed as HybridGS, using 2D Gaussians for transient objects per image and maintaining traditional 3D Gaussians for the whole static scenes. Note that, the 3DGS itself is better suited for modeling static scenes that assume multi-view consistency, but the transient objects appear occasionally and do not adhere to the assumption, thus we model them as planar objects from a single view, represented with 2D Gaussians. Our novel representation decomposes the scene from the perspective of fundamental viewpoint consistency, making it more reasonable. Additionally, we present a novel multi-view regulated supervision method for 3DGS that leverages information from co-visible regions, further enhancing the distinctions between the transients and statics. Then, we propose a straightforward yet effective multi-stage training strategy to ensure robust training and high-quality view synthesis across various settings. Experiments on benchmark datasets show our state-of-the-art performance of novel view synthesis in both indoor and outdoor scenes, even in the presence of distracting elements.

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

TL;DR

Abstract

HybridGS: Decoupling Transients and Statics with 2D and 3D Gaussian Splatting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (11)