RadarPillars: Efficient Object Detection from 4D Radar Point Clouds
Alexander Musiat, Laurenz Reichardt, Michael Schulze, Oliver Wasenmüller
TL;DR
RadarPillars tackles efficient 4D radar object detection by designing a pillar-based network that exploits velocity information through a novel velocity decomposition and a pillar-level self-attention mechanism. The approach introduces 4D Radar Features with $v_r$ decomposed into $v_{r,x}$ and $v_{r,y}$, plus velocity-offset features, and a PillarAttention layer that treats each pillar as a token to achieve a global receptive field with low computation. A uniform backbone scaling strategy further aligns model capacity with the extreme sparsity of radar data, yielding a lightweight model of $0.27$M parameters and $1.99$ GFLOPS that achieves state-of-the-art results on View-of-Delft while enabling real-time edge performance. Together, these design choices substantively improve radar-only detection efficiency and accuracy, and point to future work in end-to-end transformer-based radar perception and broader sensor fusion applications.
Abstract
Automotive radar systems have evolved to provide not only range, azimuth and Doppler velocity, but also elevation data. This additional dimension allows for the representation of 4D radar as a 3D point cloud. As a result, existing deep learning methods for 3D object detection, which were initially developed for LiDAR data, are often applied to these radar point clouds. However, this neglects the special characteristics of 4D radar data, such as the extreme sparsity and the optimal utilization of velocity information. To address these gaps in the state-of-the-art, we present RadarPillars, a pillar-based object detection network. By decomposing radial velocity data, introducing PillarAttention for efficient feature extraction, and studying layer scaling to accommodate radar sparsity, RadarPillars significantly outperform state-of-the-art detection results on the View-of-Delft dataset. Importantly, this comes at a significantly reduced parameter count, surpassing existing methods in terms of efficiency and enabling real-time performance on edge devices.
