RecurGS: Interactive Scene Modeling via Discrete-State Recurrent Gaussian Fusion
Wenhao Hu, Haonan Zhou, Zesheng Li, Liu Liu, Jiacheng Dong, Zhizhong Su, Gaoang Wang
TL;DR
RecurGS tackles interactive scene modeling under discrete state changes by maintaining a single evolving Gaussian scene and updating it through cross-state consistency, SE(3) pose refinement, and replay-guided recurrent optimization. The approach enables novel-state synthesis by transforming foreground objects while preserving background, and uses visibility-guided refinement and region completion to efficiently fuse only observable regions. Experiments on synthetic and real data show competitive reconstruction quality with substantially improved update efficiency and scalability compared to all-at-once baselines like IGFuse. The method demonstrates strong potential for long-horizon, object-level interactive Gaussian worlds in vision and robotics applications.
Abstract
Recent advances in 3D scene representations have enabled high-fidelity novel view synthesis, yet adapting to discrete scene changes and constructing interactive 3D environments remain open challenges in vision and robotics. Existing approaches focus solely on updating a single scene without supporting novel-state synthesis. Others rely on diffusion-based object-background decoupling that works on one state at a time and cannot fuse information across multiple observations. To address these limitations, we introduce RecurGS, a recurrent fusion framework that incrementally integrates discrete Gaussian scene states into a single evolving representation capable of interaction. RecurGS detects object-level changes across consecutive states, aligns their geometric motion using semantic correspondence and Lie-algebra based SE(3) refinement, and performs recurrent updates that preserve historical structures through replay supervision. A voxelized, visibility-aware fusion module selectively incorporates newly observed regions while keeping stable areas fixed, mitigating catastrophic forgetting and enabling efficient long-horizon updates. RecurGS supports object-level manipulation, synthesizes novel scene states without requiring additional scans, and maintains photorealistic fidelity across evolving environments. Extensive experiments across synthetic and real-world datasets demonstrate that our framework delivers high-quality reconstructions with substantially improved update efficiency, providing a scalable step toward continuously interactive Gaussian worlds.
