A Fast Volumetric Capture and Reconstruction Pipeline for Dynamic Point Clouds and Gaussian Splats
Athanasios Charisoudis, Simone Croci, Lam Kit Yung, Pascal Frossard, Aljosa Smolic
TL;DR
The paper presents a fast, modular volumetric capture system that works with RGB-D or RGB-only input and outputs both point clouds and Gaussian splats. It integrates a unified, GPU-accelerated preprocessing stage with two parallel reconstruction backends, introducing world-frame Gaussian rotation re-parameterization and targeted fine-tuning of GPS-Gaussian to improve robustness across camera configurations. Outputs are provided in standard formats (PLY, MPEG V-PCC) and SPLAT, with web-based viewers and Unity/Unreal plugins enabling on-site previews at 5–10 FPS. The work emphasizes deployability, open-source release, and practical applicability in unconstrained environments, offering a scalable path for real-time volumetric reconstruction with commodity hardware.
Abstract
We present a fast and efficient volumetric capture and reconstruction system that processes either RGB-D or RGB-only input to generate 3D representations in the form of point clouds and Gaussian splats. For Gaussian splat reconstructions, we took the GPS-Gaussian regressor and improved it, enabling high-quality reconstructions with minimal overhead. The system is designed for easy setup and deployment, supporting in-the-wild operation under uncontrolled illumination and arbitrary backgrounds, as well as flexible camera configurations, including sparse setups, arbitrary camera numbers and baselines. Captured data can be exported in standard formats such as PLY, MPEG V-PCC, and SPLAT, and visualized through a web-based viewer or Unity/Unreal plugins. A live on-location preview of both input and reconstruction is available at 5-10 FPS. We present qualitative findings focused on deployability and targeted ablations. The complete framework is open-source, facilitating reproducibility and further research.
