Representation Learning of Point Cloud Upsampling in Global and Local Inputs
Tongxu Zhang, Bei Wang
TL;DR
To address sparse and noisy point clouds, this paper introduces ReLPU, a parallel global-local input learning framework for point cloud upsampling. ReLPU uses two parallel autoencoders to extract global and local features (via Average Segment and patch inputs), fusing them before a shared decoder, and it provides gradient-based saliency maps in spherical coordinates for interpretability. The approach is general and backbones-compatible, validated on PU1K and ABC-10k where it consistently improves geometric fidelity and robustness over state-of-the-art methods; ablations and visualizations confirm the benefit of parallel fusion and the interpretability of feature contributions. The work advances practical point cloud upsampling by delivering a flexible, explainable framework that can accommodate future backbone architectures.
Abstract
In recent years, point cloud upsampling has been widely applied in tasks such as 3D reconstruction and object recognition. This study proposed a novel framework, ReLPU, which enhances upsampling performance by explicitly learning from both global and local structural features of point clouds. Specifically, we extracted global features from uniformly segmented inputs (Average Segments) and local features from patch-based inputs of the same point cloud. These two types of features were processed through parallel autoencoders, fused, and then fed into a shared decoder for upsampling. This dual-input design improved feature completeness and cross-scale consistency, especially in sparse and noisy regions. Our framework was applied to several state-of-the-art autoencoder-based networks and validated on standard datasets. Experimental results demonstrated consistent improvements in geometric fidelity and robustness. In addition, saliency maps confirmed that parallel global-local learning significantly enhanced the interpretability and performance of point cloud upsampling.
