Diff-PCC: Diffusion-based Neural Compression for 3D Point Clouds
Kai Liu, Kang You, Pan Gao
TL;DR
This work introduces Diff-PCC, a diffusion-based lossy point cloud compression framework that employs a dual-space latent encoding to extract complementary low- and high-frequency shape information and a diffusion-based generator that denoises noisy point clouds under the guidance of these latents. By integrating a hyperprior-driven rate model and a rate-distortion objective, Diff-PCC achieves superior compression performance compared to G-PCC and recent learning-based methods, with substantial BD-PSNR gains at ultra-low bitrates and improved perceptual quality. The method advances neural point cloud compression by addressing the limitations of Gaussian priors in VAEs and leveraging diffusion models for high-fidelity reconstruction, while also introducing architecture elements like AdaLN-based conditioning and cross-frequency feature fusion. Although effective, the approach incurs higher coding complexity and currently targets smaller-scale point clouds, pointing to future work in acceleration and scalability for broader 3D workloads.
Abstract
Stable diffusion networks have emerged as a groundbreaking development for their ability to produce realistic and detailed visual content. This characteristic renders them ideal decoders, capable of producing high-quality and aesthetically pleasing reconstructions. In this paper, we introduce the first diffusion-based point cloud compression method, dubbed Diff-PCC, to leverage the expressive power of the diffusion model for generative and aesthetically superior decoding. Different from the conventional autoencoder fashion, a dual-space latent representation is devised in this paper, in which a compressor composed of two independent encoding backbones is considered to extract expressive shape latents from distinct latent spaces. At the decoding side, a diffusion-based generator is devised to produce high-quality reconstructions by considering the shape latents as guidance to stochastically denoise the noisy point clouds. Experiments demonstrate that the proposed Diff-PCC achieves state-of-the-art compression performance (e.g., 7.711 dB BD-PSNR gains against the latest G-PCC standard at ultra-low bitrate) while attaining superior subjective quality. Source code will be made publicly available.
