Redefining Radar Segmentation: Simultaneous Static-Moving Segmentation and Ego-Motion Estimation using Radar Point Clouds
Simin Zhu, Satish Ravindran, Alexander Yarovoy, Francesco Fioranelli
TL;DR
This work redefines radar perception by jointly performing static/moving segmentation and ego-motion estimation directly from unprocessed radar point clouds, without requiring cloud aggregation or external odometry. It introduces a lightweight neural network with spatialMLP and temporal GRU backbones and a dual-head prediction scheme complemented by two weight-update heads, enabling instantaneous predictions over a moving window of frames ( $T=8$ ). Using RadarScenes and novel evaluation protocols, the approach achieves a moving-object IoU of about $0.86$ and F1 around $0.92$, while delivering a precise ego-motion estimate with $RTE_{50}$ near $1.8$ m, all with roughly $0.15$ million parameters. The results demonstrate the feasibility and practical impact of single-frame radar-based dual-task perception, offering immediate benefits for downstream tasks such as tracking, SLAM, and mapping in adverse weather and low-light conditions.
Abstract
Conventional radar segmentation research has typically focused on learning category labels for different moving objects. Although fundamental differences between radar and optical sensors lead to differences in the reliability of predicting accurate and consistent category labels, a review of common radar perception tasks in automotive reveals that determining whether an object is moving or static is a prerequisite for most tasks. To fill this gap, this study proposes a neural network based solution that can simultaneously segment static and moving objects from radar point clouds. Furthermore, since the measured radial velocity of static objects is correlated with the motion of the radar, this approach can also estimate the instantaneous 2D velocity of the moving platform or vehicle (ego motion). However, despite performing dual tasks, the proposed method employs very simple yet effective building blocks for feature extraction: multi layer perceptrons (MLPs) and recurrent neural networks (RNNs). In addition to being the first of its kind in the literature, the proposed method also demonstrates the feasibility of extracting the information required for the dual task directly from unprocessed point clouds, without the need for cloud aggregation, Doppler compensation, motion compensation, or any other intermediate signal processing steps. To measure its performance, this study introduces a set of novel evaluation metrics and tests the proposed method using a challenging real world radar dataset, RadarScenes. The results show that the proposed method not only performs well on the dual tasks, but also has broad application potential in other radar perception tasks.
