Small, Versatile and Mighty: A Range-View Perception Framework
Qiang Meng, Xiao Wang, JiaBao Wang, Liujiang Yan, Ke Wang
TL;DR
This paper tackles 3D perception from LiDAR by leveraging the range-view representation to achieve both efficiency and multi-task capability. It introduces a lightweight, fully convolutional framework called Small, Versatile, Mighty (SVM) that integrates Perspective Centric Label Assignment (PCLA) and View Adaptive Regression (VAR) to boost 3D detection while enabling semantic and panoptic segmentation without extra modules. The approach achieves state-of-the-art performance among range-view detectors on the Waymo Open Dataset, with notable gains for the vehicle class and strong segmentation results. These findings demonstrate the viability of range-view representations for real-time, multi-task LiDAR perception in autonomous driving.
Abstract
Despite its compactness and information integrity, the range view representation of LiDAR data rarely occurs as the first choice for 3D perception tasks. In this work, we further push the envelop of the range-view representation with a novel multi-task framework, achieving unprecedented 3D detection performances. Our proposed Small, Versatile, and Mighty (SVM) network utilizes a pure convolutional architecture to fully unleash the efficiency and multi-tasking potentials of the range view representation. To boost detection performances, we first propose a range-view specific Perspective Centric Label Assignment (PCLA) strategy, and a novel View Adaptive Regression (VAR) module to further refine hard-to-predict box properties. In addition, our framework seamlessly integrates semantic segmentation and panoptic segmentation tasks for the LiDAR point cloud, without extra modules. Among range-view-based methods, our model achieves new state-of-the-art detection performances on the Waymo Open Dataset. Especially, over 10 mAP improvement over convolutional counterparts can be obtained on the vehicle class. Our presented results for other tasks further reveal the multi-task capabilities of the proposed small but mighty framework.
