Efficient Implicit Neural Compression of Point Clouds via Learnable Activation in Latent Space
Yichi Zhang, Qianqian Yang
TL;DR
This work tackles the challenge of efficiently compressing LiDAR-derived point clouds by reframing PCC as neural parameter compression. It introduces PICO, a two-stage framework that separately optimizes geometry and attribute INRs, and LeAFNet, a Kolmogorov-Arnold Network–inspired backbone with learnable activations to improve implicit function fitting while remaining parameter-efficient. The method integrates adaptive sampling, a dynamic occupancy threshold, voxel-wise color regression with nearest-neighbor mapping, and a quantization-then-entropy coding pipeline (DeepCABAC) to produce compact bitstreams. Empirical results on the 8iVFB dataset show substantial RD gains over MPEG G-PCC and V-PCC for both geometry and joint geometry-attribute compression, validating the approach and its potential to influence future intelligent PCC standards.
Abstract
Implicit Neural Representations (INRs), also known as neural fields, have emerged as a powerful paradigm in deep learning, parameterizing continuous spatial fields using coordinate-based neural networks. In this paper, we propose \textbf{PICO}, an INR-based framework for static point cloud compression. Unlike prevailing encoder-decoder paradigms, we decompose the point cloud compression task into two separate stages: geometry compression and attribute compression, each with distinct INR optimization objectives. Inspired by Kolmogorov-Arnold Networks (KANs), we introduce a novel network architecture, \textbf{LeAFNet}, which leverages learnable activation functions in the latent space to better approximate the target signal's implicit function. By reformulating point cloud compression as neural parameter compression, we further improve compression efficiency through quantization and entropy coding. Experimental results demonstrate that \textbf{LeAFNet} outperforms conventional MLPs in INR-based point cloud compression. Furthermore, \textbf{PICO} achieves superior geometry compression performance compared to the current MPEG point cloud compression standard, yielding an average improvement of $4.92$ dB in D1 PSNR. In joint geometry and attribute compression, our approach exhibits highly competitive results, with an average PCQM gain of $2.7 \times 10^{-3}$.
