Compression of Large-Scale 3D Point Clouds Based on Joint Optimization of Point Sampling and Feature Extraction
Jae-Young Yim, Jae-Young Sim
TL;DR
This work tackles the compression of large-scale 3D LiDAR point clouds by addressing the inefficiency of separating point sampling from feature extraction. It introduces a fully end-to-end framework that jointly optimizes a learnable point sampling network and adaptive feature aggregation within a range-image partitioning scheme, coupled with an expansion-and-fusion decoder for robust reconstruction. The method employs a hyperprior-based entropy model to approximate rate and defines a rate-distortion objective L = D + λR, achieving significantly better rate-distortion performance than state-of-the-art baselines on SemanticKITTI and nuScenes, including strong cross-dataset generalization. The approach reduces encoding time and memory while delivering higher fidelity reconstructions, making LS3DPC compression more practical for large-scale applications.
Abstract
Large-scale 3D point clouds (LS3DPC) obtained by LiDAR scanners require huge storage space and transmission bandwidth due to a large amount of data. The existing methods of LS3DPC compression separately perform rule-based point sampling and learnable feature extraction, and hence achieve limited compression performance. In this paper, we propose a fully end-to-end training framework for LS3DPC compression where the point sampling and the feature extraction are jointly optimized in terms of the rate and distortion losses. To this end, we first make the point sampling module to be trainable such that an optimal position of the downsampled point is estimated via aggregation with learnable weights. We also develop a reliable point reconstruction scheme that adaptively aggregates the expanded candidate points to refine the positions of upsampled points. Experimental results evaluated on the SemanticKITTI and nuScenes datasets show that the proposed method achieves significantly higher compression ratios compared with the existing state-of-the-art methods.
