Points2NeRF: Generating Neural Radiance Fields from 3D point cloud
Dominik Zimny, Joanna Waczyńska, Tomasz Trzciński, Przemysław Spurek
TL;DR
Points2NeRF introduces a hypernetwork-based autoencoder that converts 3D colored point clouds into NeRF weights, enabling conditioned and generative Neural Radiance Field representations without relying solely on 2D images. The approach uses a PointNet-like encoder to produce a latent code, which a decoder then maps to NeRF weights $\Theta$ for a target network $F_{\Theta}:(x,y,z,\theta,\psi)\rightarrow (r,g,b,\sigma)$, enabling view synthesis and later mesh extraction via marching cubes. Experiments on ShapeNet show that Points2NeRF yields competitive PSNR and enables high-quality mesh reconstruction, while also serving as a useful pre-training initializer that significantly speeds up NeRF training. The work highlights practical impact for robotics and graphics by enabling robust NeRFs from sparse point clouds and offering a generative, conditioned 3D representation with potential for interpolation in object space.
Abstract
Contemporary registration devices for 3D visual information, such as LIDARs and various depth cameras, capture data as 3D point clouds. In turn, such clouds are challenging to be processed due to their size and complexity. Existing methods address this problem by fitting a mesh to the point cloud and rendering it instead. This approach, however, leads to the reduced fidelity of the resulting visualization and misses color information of the objects crucial in computer graphics applications. In this work, we propose to mitigate this challenge by representing 3D objects as Neural Radiance Fields (NeRFs). We leverage a hypernetwork paradigm and train the model to take a 3D point cloud with the associated color values and return a NeRF network's weights that reconstruct 3D objects from input 2D images. Our method provides efficient 3D object representation and offers several advantages over the existing approaches, including the ability to condition NeRFs and improved generalization beyond objects seen in training. The latter we also confirmed in the results of our empirical evaluation.
