LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection
Jingyu Song, Lingjun Zhao, Katherine A. Skinner
TL;DR
LiRaFusion addresses the performance gap in LiDAR-radar fusion for 3D object detection by introducing an early fusion voxel feature encoder and a channel-wise adaptive gated middle fusion. The architecture enables robust cross-modality feature extraction and fusion within a BEV framework, and it demonstrates superior performance on the nuScenes dataset against existing LR and LCR methods, with notable gains at long range and under rain. It also shows that LiDAR and radar modalities can be effectively combined with a gated fusion strategy, and that the approach is extendable to LiDAR-camera-radar fusion. The work highlights practical impact for safer autonomous driving under adverse conditions and offers a reusable fusion backbone for multi-modality detectors.
Abstract
We propose LiRaFusion to tackle LiDAR-radar fusion for 3D object detection to fill the performance gap of existing LiDAR-radar detectors. To improve the feature extraction capabilities from these two modalities, we design an early fusion module for joint voxel feature encoding, and a middle fusion module to adaptively fuse feature maps via a gated network. We perform extensive evaluation on nuScenes to demonstrate that LiRaFusion leverages the complementary information of LiDAR and radar effectively and achieves notable improvement over existing methods.
