Neural Point-Based Graphics
Kara-Ali Aliev, Artem Sevastopolsky, Maria Kolos, Dmitry Ulyanov, Victor Lempitsky
TL;DR
This work introduces neural point-based graphics that treat each point in a raw point cloud as carrying a learnable descriptor for local geometry and appearance, paired with a neural renderer to synthesize photorealistic views from novel viewpoints without surface reconstruction. A progressive multi-scale rendering framework and joint optimization of descriptors and the renderer enable robust view synthesis across diverse scenes, including those challenging for meshing. They demonstrate competitive or superior performance to mesh-based and neural rendering baselines on ScanNet, human portraits, and object scenes, with particular advantages on thin structures and incomplete geometries. The approach also supports scene editing and discusses practical considerations like anti-aliasing, signaling a scalable, mesh-free path for high-quality neural rendering from simple point primitives.
Abstract
We present a new point-based approach for modeling the appearance of real scenes. The approach uses a raw point cloud as the geometric representation of a scene, and augments each point with a learnable neural descriptor that encodes local geometry and appearance. A deep rendering network is learned in parallel with the descriptors, so that new views of the scene can be obtained by passing the rasterizations of a point cloud from new viewpoints through this network. The input rasterizations use the learned descriptors as point pseudo-colors. We show that the proposed approach can be used for modeling complex scenes and obtaining their photorealistic views, while avoiding explicit surface estimation and meshing. In particular, compelling results are obtained for scene scanned using hand-held commodity RGB-D sensors as well as standard RGB cameras even in the presence of objects that are challenging for standard mesh-based modeling.
