Unified Geometry and Color Compression Framework for Point Clouds via Generative Diffusion Priors
Tianxin Huang, Gim Hee Lee
TL;DR
This work presents a novel, test-time unified framework for compressing both geometry and color in 3D point clouds by leveraging a pretrained diffusion prior (Point-E) and prompt tuning to produce sparse seeds. The pipeline uses patch division, seed optimization, and a loss that accounts for permutation invariance to enable accurate reconstruction through diffusion denoising, all without dataset-specific training. Across colored objects, indoor scenes, and geometry-only baselines, the method yields notable geometry gains and competitive color performance, with strong generalization and applicability to offline tasks. While compression time is higher due to prompt tuning, the approach eliminates the need for training on target datasets and benefits from potential speedups via optimization techniques.
Abstract
With the growth of 3D applications and the rapid increase in sensor-collected 3D point cloud data, there is a rising demand for efficient compression algorithms. Most existing learning-based compression methods handle geometry and color attributes separately, treating them as distinct tasks, making these methods challenging to apply directly to point clouds with colors. Besides, the limited capacities of training datasets also limit their generalizability across points with different distributions. In this work, we introduce a test-time unified geometry and color compression framework of 3D point clouds. Instead of training a compression model based on specific datasets, we adapt a pre-trained generative diffusion model to compress original colored point clouds into sparse sets, termed 'seeds', using prompt tuning. Decompression is then achieved through multiple denoising steps with separate sampling processes. Experiments on objects and indoor scenes demonstrate that our method has superior performances compared to existing baselines for the compression of geometry and color.
