Dynamic Scene Reconstruction: Recent Advance in Real-time Rendering and Streaming
Jiaxuan Zhu, Hao Tang
TL;DR
This survey provides a comprehensive overview of dynamic scene reconstruction and rendering from 2D imagery, emphasizing Neural Radiance Field (NeRF)–based and 3D Gaussian Splatting (3D-GS) approaches. It categorizes dynamic NeRFs into time-augmented, deformation-based, and hybrid representations, and reviews dynamic 3D-GS through deformation-field, 4D-primitive, and per-frame training strategies, highlighting efficiency and quality trade-offs. The article also covers volumetric video representations and streaming, detailing compression, rate-distortion optimization, and streaming pipelines, supported by extensive dataset and benchmark comparisons. By synthesizing 170+ papers, it identifies key challenges—data sparsity, temporal coherence, and scalability—and outlines practical future directions for real-time, wide-scale dynamic scene capture and transmission.
Abstract
Representing and rendering dynamic scenes from 2D images is a fundamental yet challenging problem in computer vision and graphics. This survey provides a comprehensive review of the evolution and advancements in dynamic scene representation and rendering, with a particular emphasis on recent progress in Neural Radiance Fields based and 3D Gaussian Splatting based reconstruction methods. We systematically summarize existing approaches, categorize them according to their core principles, compile relevant datasets, compare the performance of various methods on these benchmarks, and explore the challenges and future research directions in this rapidly evolving field. In total, we review over 170 relevant papers, offering a broad perspective on the state of the art in this domain.
