Rate-Distortion Optimized Skip Coding of Region Adaptive Hierarchical Transform Coefficients for MPEG G-PCC
Zehan Wang, Yuxuan Wei, Hui Yuan, Wei Zhang, Peng Li
TL;DR
The paper tackles bitrate inefficiency in Region-Adaptive Hierarchical Transform (RAHT) within MPEG G-PCC by introducing an adaptive skip coding scheme for RAHT residuals. It develops a rate–distortion (RD) framework with an adaptive Lagrange multiplier to decide, per color component, whether to skip residuals in the last $k\in\{1,2,3,4\}$ RAHT layers, computing distortion in the transform domain via Parseval. Experimental results on GeS-TM v4.0 under MPEG CTC show notable BD-rate reductions, especially under inter-frame prediction, with average improvements of $-3.50\%$, $-5.56\%$, and $-4.18\%$ for Luma, Cb, Cr respectively. The approach yields substantial bitrate savings at low bitrates (e.g., CGI sequence queen) while incurring minimal decoding overhead, indicating practical impact for dynamic point cloud compression.
Abstract
Three-dimensional (3D) point clouds are becoming more and more popular for representing 3D objects and scenes. Due to limited network bandwidth, efficient compression of 3D point clouds is crucial. To tackle this challenge, the Moving Picture Experts Group (MPEG) is actively developing the Geometry-based Point Cloud Compression (G-PCC) standard, incorporating innovative methods to optimize compression, such as the Region-Adaptive Hierarchical Transform (RAHT) nestled within a layer-by-layer octree-tree structure. Nevertheless, a notable problem still exists in RAHT, i.e., the proportion of zero residuals in the last few RAHT layers leads to unnecessary bitrate consumption. To address this problem, we propose an adaptive skip coding method for RAHT, which adaptively determines whether to encode the residuals of the last several layers or not, thereby improving the coding efficiency. In addition, we propose a rate-distortion cost calculation method associated with an adaptive Lagrange multiplier. Experimental results demonstrate that the proposed method achieves average Bjøntegaard rate improvements of -3.50%, -5.56%, and -4.18% for the Luma, Cb, and Cr components, respectively, on dynamic point clouds, when compared with the state-of-the-art G-PCC reference software under the common test conditions recommended by MPEG.
