Filling Missing Values Matters for Range Image-Based Point Cloud Segmentation
Bike Chen, Chen Gong, Juha Röning
TL;DR
This work tackles the detrimental impact of missing values in range-image representations for LiDAR PCS by introducing Scan Unfolding++ (SU++) to reduce missing data, and a simple yet effective range-dependent $K$-nearest neighbor interpolation ($K$NNI) to fill residual gaps. Building on these inputs, the Filling Missing Values Network (FMVNet) and its Fast variant achieve state-of-the-art speed-accuracy trade-offs for range-image based segmentation, demonstrated across SemanticKITTI, SemanticPOSS, and nuScenes. The authors provide rigorous evaluation of projection strategies, interpolation techniques, and architectural choices, establishing SU++ and $K$NNI as broadly beneficial for range-image tasks. The methods offer practical impact for robot perception and navigation, with potential extension to moving-object segmentation and multimodal systems.
Abstract
Point cloud segmentation (PCS) plays an essential role in robot perception and navigation tasks. To efficiently understand large-scale outdoor point clouds, their range image representation is commonly adopted. This image-like representation is compact and structured, making range image-based PCS models practical. However, undesirable missing values in the range images damage the shapes and patterns of objects. This problem creates difficulty for the models in learning coherent and complete geometric information from the objects. Consequently, the PCS models only achieve inferior performance. Delving deeply into this issue, we find that the use of unreasonable projection approaches and deskewing scans mainly leads to unwanted missing values in the range images. Besides, almost all previous works fail to consider filling in the unexpected missing values in the PCS task. To alleviate this problem, we first propose a new projection method, namely scan unfolding++ (SU++), to avoid massive missing values in the generated range images. Then, we introduce a simple yet effective approach, namely range-dependent $K$-nearest neighbor interpolation ($K$NNI), to further fill in missing values. Finally, we introduce the Filling Missing Values Network (FMVNet) and Fast FMVNet. Extensive experimental results on SemanticKITTI, SemanticPOSS, and nuScenes datasets demonstrate that by employing the proposed SU++ and $K$NNI, existing range image-based PCS models consistently achieve better performance than the baseline models. Besides, both FMVNet and Fast FMVNet achieve state-of-the-art performance in terms of the speed-accuracy trade-off. The proposed methods can be applied to other range image-based tasks and practical applications.
