RadarDistill: Boosting Radar-based Object Detection Performance via Knowledge Distillation from LiDAR Features
Geonho Bang, Kwangjin Choi, Jisong Kim, Dongsuk Kum, Jun Won Choi
TL;DR
RadarDistill tackles the challenge of noisy, sparse radar data for 3D object detection by transferring richly structured LiDAR representations into the radar domain through a three-component KD framework. CMA densifies radar BEV features to enable denser cross-modality transfer, while Activation-based (AFD) and Proposal-based (PFD) Distillation selectively align low- and high-level features in informative regions and object proposals. The method, trained with LiDAR supervision only and tests using a radar-equipped PillarNet baseline, achieves state-of-the-art radar-only performance on nuScenes and provides notable gains in radar-camera fusion. This approach offers a practical path to leverage abundant LiDAR-like semantics during training to improve radar-based perception in adverse conditions.
Abstract
The inherent noisy and sparse characteristics of radar data pose challenges in finding effective representations for 3D object detection. In this paper, we propose RadarDistill, a novel knowledge distillation (KD) method, which can improve the representation of radar data by leveraging LiDAR data. RadarDistill successfully transfers desirable characteristics of LiDAR features into radar features using three key components: Cross-Modality Alignment (CMA), Activation-based Feature Distillation (AFD), and Proposal-based Feature Distillation (PFD). CMA enhances the density of radar features by employing multiple layers of dilation operations, effectively addressing the challenge of inefficient knowledge transfer from LiDAR to radar. AFD selectively transfers knowledge based on regions of the LiDAR features, with a specific focus on areas where activation intensity exceeds a predefined threshold. PFD similarly guides the radar network to selectively mimic features from the LiDAR network within the object proposals. Our comparative analyses conducted on the nuScenes datasets demonstrate that RadarDistill achieves state-of-the-art (SOTA) performance for radar-only object detection task, recording 20.5% in mAP and 43.7% in NDS. Also, RadarDistill significantly improves the performance of the camera-radar fusion model.
