Table of Contents
Fetching ...

HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation

Wenjie Huang, Qi Yang, Shuting Xia, He Huang, Zhu Li, Yiling Xu

TL;DR

This work proposes HybridINR-PCGC, a novel hybrid framework that bridges the pretrained model and INR, and proposes a supervised model compression module to further supervise and minimize the bitrate of the enhancement layer parameters.

Abstract

Learning-based point cloud compression presents superior performance to handcrafted codecs. However, pretrained-based methods, which are based on end-to-end training and expected to generalize to all the potential samples, suffer from training data dependency. Implicit neural representation (INR) based methods are distribution-agnostic and more robust, but they require time-consuming online training and suffer from the bitstream overhead from the overfitted model. To address these limitations, we propose HybridINR-PCGC, a novel hybrid framework that bridges the pretrained model and INR. Our framework retains distribution-agnostic properties while leveraging a pretrained network to accelerate convergence and reduce model overhead, which consists of two parts: the Pretrained Prior Network (PPN) and the Distribution Agnostic Refiner (DAR). We leverage the PPN, designed for fast inference and stable performance, to generate a robust prior for accelerating the DAR's convergence. The DAR is decomposed into a base layer and an enhancement layer, and only the enhancement layer needed to be packed into the bitstream. Finally, we propose a supervised model compression module to further supervise and minimize the bitrate of the enhancement layer parameters. Based on experiment results, HybridINR-PCGC achieves a significantly improved compression rate and encoding efficiency. Specifically, our method achieves a Bpp reduction of approximately 20.43% compared to G-PCC on 8iVFB. In the challenging out-of-distribution scenario Cat1B, our method achieves a Bpp reduction of approximately 57.85% compared to UniPCGC. And our method exhibits a superior time-rate trade-off, achieving an average Bpp reduction of 15.193% relative to the LINR-PCGC on 8iVFB.

HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation

TL;DR

This work proposes HybridINR-PCGC, a novel hybrid framework that bridges the pretrained model and INR, and proposes a supervised model compression module to further supervise and minimize the bitrate of the enhancement layer parameters.

Abstract

Learning-based point cloud compression presents superior performance to handcrafted codecs. However, pretrained-based methods, which are based on end-to-end training and expected to generalize to all the potential samples, suffer from training data dependency. Implicit neural representation (INR) based methods are distribution-agnostic and more robust, but they require time-consuming online training and suffer from the bitstream overhead from the overfitted model. To address these limitations, we propose HybridINR-PCGC, a novel hybrid framework that bridges the pretrained model and INR. Our framework retains distribution-agnostic properties while leveraging a pretrained network to accelerate convergence and reduce model overhead, which consists of two parts: the Pretrained Prior Network (PPN) and the Distribution Agnostic Refiner (DAR). We leverage the PPN, designed for fast inference and stable performance, to generate a robust prior for accelerating the DAR's convergence. The DAR is decomposed into a base layer and an enhancement layer, and only the enhancement layer needed to be packed into the bitstream. Finally, we propose a supervised model compression module to further supervise and minimize the bitrate of the enhancement layer parameters. Based on experiment results, HybridINR-PCGC achieves a significantly improved compression rate and encoding efficiency. Specifically, our method achieves a Bpp reduction of approximately 20.43% compared to G-PCC on 8iVFB. In the challenging out-of-distribution scenario Cat1B, our method achieves a Bpp reduction of approximately 57.85% compared to UniPCGC. And our method exhibits a superior time-rate trade-off, achieving an average Bpp reduction of 15.193% relative to the LINR-PCGC on 8iVFB.
Paper Structure (25 sections, 10 equations, 13 figures, 14 tables, 1 algorithm)

This paper contains 25 sections, 10 equations, 13 figures, 14 tables, 1 algorithm.

Figures (13)

  • Figure 1: Principle comparison of Pretrained-based, INR-based, and our Hybrid approaches, highlighting the model components utilized during the Pretrain, Overfit, and Test phases.
  • Figure 2: Concept in octree.
  • Figure 3: Left: The framework includes the Pipeline. Top Right: The architecture of the PPN, which iteratively generates a robust prior. Bottom Right: The architecture of the DAR, which consumes the prior from the PPN to refine the final occupancy prediction $P_{occ}^{i+1,j}$.
  • Figure 4: Component of PPN. Left: FEM and IRN. FEM(k, C) denotes FEM with $k$ IRN Module and $C$ hidden channel. Right: Processing of Feature Masking.
  • Figure 5: Left: Principle of SMC. Right: Quantization (Q) module and Dequantization (DQ) module in SMC. AE is an arithmetic encoder, AD is an arithmetic decoder.
  • ...and 8 more figures