GPN: Generative Point-based NeRF
Haipeng Wang
TL;DR
The paper tackles the problem of incomplete scenes captured by real-world scanners by introducing Generative Point-based NeRF (GPN), which jointly reconstructs and repairs point clouds through a hypernetwork-based VAE that outputs NeRF weights conditioned on colored point clouds. It presents two frameworks: Generation, which reconstructs a high-resolution complete cloud, and Completion, which repairs missing regions while maintaining multi-view consistency via per-scene fine-tuning. The approach leverages voxel-based sampling with NeRFAcc and TriVol-inspired feature extraction to render and optimize at high quality with modest memory, achieving competitive results on ShapeNet for reconstruction, repair, and completion. The work enables color-consistent, high-fidelity implicit representations from sparse inputs and supports flexible completion and interpolation across object parts, with practical implications for industrial 3D reconstruction and editing.
Abstract
Scanning real-life scenes with modern registration devices typically gives incomplete point cloud representations, primarily due to the limitations of partial scanning, 3D occlusions, and dynamic light conditions. Recent works on processing incomplete point clouds have always focused on point cloud completion. However, these approaches do not ensure consistency between the completed point cloud and the captured images regarding color and geometry. We propose using Generative Point-based NeRF (GPN) to reconstruct and repair a partial cloud by fully utilizing the scanning images and the corresponding reconstructed cloud. The repaired point cloud can achieve multi-view consistency with the captured images at high spatial resolution. For the finetunes of a single scene, we optimize the global latent condition by incorporating an Auto-Decoder architecture while retaining multi-view consistency. As a result, the generated point clouds are smooth, plausible, and geometrically consistent with the partial scanning images. Extensive experiments on ShapeNet demonstrate that our works achieve competitive performances to the other state-of-the-art point cloud-based neural scene rendering and editing performances.
