NIV-SSD: Neighbor IoU-Voting Single-Stage Object Detector From Point Cloud
Shuai Liu, Di Wang, Quan Wang, Kai Huang
TL;DR
The paper tackles the persistent misalignment between localization quality and classification confidence in LiDAR-based 3D object detection by introducing a post-processing Neighbor IoU-Voting (NIV) strategy that rectifies confidence using neighbor-derived statistics, without altering network architecture. It pairs NIV with an object resampling augmentation to address the imbalance between easy and difficult objects, producing an efficient single-stage detector called NIV-SSD. Through extensive experiments on KITTI, ONCE, and Waymo, NIV-SSD demonstrates improved confidence calibration, competitive accuracy, and favorable speed-accuracy trade-offs, validating the generality of NIV across datasets. The approach offers practical impact by providing a plug-in rectification method and a simple augmentation to boost performance in real-time autonomous driving systems.
Abstract
Previous single-stage detectors typically suffer the misalignment between localization accuracy and classification confidence. To solve the misalignment problem, we introduce a novel rectification method named neighbor IoU-voting (NIV) strategy. Typically, classification and regression are treated as separate branches, making it challenging to establish a connection between them. Consequently, the classification confidence cannot accurately reflect the regression quality. NIV strategy can serve as a bridge between classification and regression branches by calculating two types of statistical data from the regression output to correct the classification confidence. Furthermore, to alleviate the imbalance of detection accuracy for complete objects with dense points (easy objects) and incomplete objects with sparse points (difficult objects), we propose a new data augmentation scheme named object resampling. It undersamples easy objects and oversamples difficult objects by randomly transforming part of easy objects into difficult objects. Finally, combining the NIV strategy and object resampling augmentation, we design an efficient single-stage detector termed NIV-SSD. Extensive experiments on several datasets indicate the effectiveness of the NIV strategy and the competitive performance of the NIV-SSD detector. The code will be available at https://github.com/Say2L/NIV-SSD.
