Implicit Neural Compression of Point Clouds
Hongning Ruan, Yulin Shao, Qianqian Yang, Liang Zhao, Zhaoyang Zhang, Dusit Niyato
TL;DR
This work introduces NeRC$^3$, an implicit neural representation-based framework for compressing point clouds by learning two neural fields: one for voxel occupancy and one for voxel attributes. It extends to dynamic scenes with i-NeRC$^3$, r-NeRC$^3$, c-NeRC$^3$, and 4D-NeRC$^3$, including a 4D spatio-temporal INR that jointly encodes multiple frames. The approach achieves state-of-the-art or competitive rate-distortion performance against traditional MPEG PCC standards and prior INR-based PCC methods, particularly excelling in dynamic geometry compression and joint geometry-attribute tasks. Despite higher encoding complexity due to per-instance network training, the methods demonstrate strong qualitative reconstructions and scalable mechanisms to manage temporal redundancy, with practical variants that balance performance and speed. This work opens a new direction for INR-based PCC by integrating geometry and attributes in neural space and by addressing temporal redundancy directly in 4D representations.
Abstract
Point clouds have gained prominence across numerous applications due to their ability to accurately represent 3D objects and scenes. However, efficiently compressing unstructured, high-precision point cloud data remains a significant challenge. In this paper, we propose NeRC$^3$, a novel point cloud compression framework that leverages implicit neural representations (INRs) to encode both geometry and attributes of dense point clouds. Our approach employs two coordinate-based neural networks: one maps spatial coordinates to voxel occupancy, while the other maps occupied voxels to their attributes, thereby implicitly representing the geometry and attributes of a voxelized point cloud. The encoder quantizes and compresses network parameters alongside auxiliary information required for reconstruction, while the decoder reconstructs the original point cloud by inputting voxel coordinates into the neural networks. Furthermore, we extend our method to dynamic point cloud compression through techniques that reduce temporal redundancy, including a 4D spatio-temporal representation termed 4D-NeRC$^3$. Experimental results validate the effectiveness of our approach: For static point clouds, NeRC$^3$ outperforms octree-based G-PCC standard and existing INR-based methods. For dynamic point clouds, 4D-NeRC$^3$ achieves superior geometry compression performance compared to the latest G-PCC and V-PCC standards, while matching state-of-the-art learning-based methods. It also demonstrates competitive performance in joint geometry and attribute compression.
