Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting

Zhiying Yan; Yiyuan Liang; Shilv Cai; Tao Zhang; Sheng Zhong; Luxin Yan; Xu Zou

Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting

Zhiying Yan, Yiyuan Liang, Shilv Cai, Tao Zhang, Sheng Zhong, Luxin Yan, Xu Zou

TL;DR

The paper tackles the challenge of dynamic scene understanding with semantic 4D Gaussians by introducing Dual-Hierarchical Optimization (DHO), which separates static background from dynamic foreground via Hierarchical Gaussian Flow and provides semantic-guided rendering through Hierarchical Gaussian Guidance. It augments 4D Gaussian Splatting with semantic features bound to Gaussians, compressed CLIP semantics, and a deformation-aware rendering pipeline, enabling higher-fidelity rendering and improved segmentation in complex scenes. Empirical results on synthetic and real datasets show consistent gains in PSNR, SSIM, LPIPS, and mIoU, with ablations confirming the essential roles of HGF and HGG. The approach is memory-efficient and adaptable to existing models, offering robust semantic reasoning and downstream editability for dynamic 4D scenes.

Abstract

Semantic 4D Gaussians can be used for reconstructing and understanding dynamic scenes, with temporal variations than static scenes. Directly applying static methods to understand dynamic scenes will fail to capture the temporal features. Few works focus on dynamic scene understanding based on Gaussian Splatting, since once the same update strategy is employed for both dynamic and static parts, regardless of the distinction and interaction between Gaussians, significant artifacts and noise appear. We propose Dual-Hierarchical Optimization (DHO), which consists of Hierarchical Gaussian Flow and Hierarchical Gaussian Guidance in a divide-and-conquer manner. The former implements effective division of static and dynamic rendering and features. The latter helps to mitigate the issue of dynamic foreground rendering distortion in textured complex scenes. Extensive experiments show that our method consistently outperforms the baselines on both synthetic and real-world datasets, and supports various downstream tasks. Project Page: https://sweety-yan.github.io/DHO.

Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting

TL;DR

Abstract

Divide-and-Conquer: Dual-Hierarchical Optimization for Semantic 4D Gaussian Spatting

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (13)