Voxel-based Point Cloud Geometry Compression with Space-to-Channel Context
Bojun Liu, Yangzhi Ma, Ao Luo, Li Li, Dong Liu
TL;DR
This work addresses the inefficiencies of voxel-based point cloud geometry compression, particularly the limited receptive field at high bit depths. It introduces stage-wise Space-to-Channel (S2C) context modeling for dense and low-level sparse data and a level-wise S2C framework with Geometry Residual Coding (GRC) and Residual Probability Approximation (RPA) for high-level sparse data, aided by a spherical coordinate representation. By transforming spatial expansion into channel expansion, the method expands the receptive field without upsampling, achieving better bit-rate savings and lower encoding/decoding complexity than state-of-the-art voxel-based approaches. Experimental results across dense and sparse datasets validate the effectiveness and efficiency of S2C, highlighting practical improvements for real-world point cloud compression tasks.
Abstract
Voxel-based methods are among the most efficient for point cloud geometry compression, particularly with dense point clouds. However, they face limitations due to a restricted receptive field, especially when handling high-bit depth point clouds. To overcome this issue, we introduce a stage-wise Space-to-Channel (S2C) context model for both dense point clouds and low-level sparse point clouds. This model utilizes a channel-wise autoregressive strategy to effectively integrate neighborhood information at a coarse resolution. For high-level sparse point clouds, we further propose a level-wise S2C context model that addresses resolution limitations by incorporating Geometry Residual Coding (GRC) for consistent-resolution cross-level prediction. Additionally, we use the spherical coordinate system for its compact representation and enhance our GRC approach with a Residual Probability Approximation (RPA) module, which features a large kernel size. Experimental results show that our S2C context model not only achieves bit savings while maintaining or improving reconstruction quality but also reduces computational complexity compared to state-of-the-art voxel-based compression methods.
