Depth-Aware Range Image-Based Model for Point Cloud Segmentation
Bike Chen, Antti Tikanmäki, Juha Röning
TL;DR
The paper addresses the challenge of range image–based PCS by exploiting the implicit but ordered depth information that standard backbones overlook. It introduces the Depth-Aware Module (DAM), which fuses global context via GAP with a sinusoidal positional encoding to produce depth-aware channel scales, and integrates DAM into the last block of each stage of Fast FMVNet V3. Through extensive experiments on SemanticKITTI, nuScenes, and SemanticPOSS, the approach achieves strong mIoU scores with a favorable speed-accuracy trade-off (e.g., $mIoU$ up to 69.6% on SemanticKITTI at 25.5 FPS) and demonstrates the generalizability of DAM to other range image–based models. The work suggests that depth-aware channel recalibration is a practical path to improve real-time PCS for outdoor robotics and opens avenues for range-image–based semantic SLAM.
Abstract
Point cloud segmentation (PCS) aims to separate points into different and meaningful groups. The task plays an important role in robotics because PCS enables robots to understand their physical environments directly. To process sparse and large-scale outdoor point clouds in real time, range image-based models are commonly adopted. However, in a range image, the lack of explicit depth information inevitably causes some separate objects in 3D space to touch each other, bringing difficulty for the range image-based models in correctly segmenting the objects. Moreover, previous PCS models are usually derived from the existing color image-based models and unable to make full use of the implicit but ordered depth information inherent in the range image, thereby achieving inferior performance. In this paper, we propose Depth-Aware Module (DAM) and Fast FMVNet V3. DAM perceives the ordered depth information in the range image by explicitly modelling the interdependence among channels. Fast FMVNet V3 incorporates DAM by integrating it into the last block in each architecture stage. Extensive experiments conducted on SemanticKITTI, nuScenes, and SemanticPOSS demonstrate that DAM brings a significant improvement for Fast FMVNet V3 with negligible computational cost.
