Super4DR: 4D Radar-centric Self-supervised Odometry and Gaussian-based Map Optimization
Zhiheng Li, Weihua Wang, Qiang Shen, Yichen Zhao, Zheng Fang
TL;DR
<3-5 sentence high-level summary> Super4DR tackles robust localization and mapping in adverse environments by centering 4D radar data. It couples a cluster-aware, multi-level self-supervised odometry network with a Gaussian-based map optimizer that uses depth-guided ground completion, geometry-aware densification, and multi-view regularization to reconstruct dense, image-renderable maps from sparse radar points. The approach delivers significant gains in self-supervised odometry, narrows the gap to supervised methods, and achieves superior radar map quality and rendering fidelity compared to prior Gaussian-based methods. The framework demonstrates cross-modal applicability, including radar-thermal reconstruction, and is poised to enhance real-world autonomous systems operating under poor visibility conditions.
Abstract
Conventional SLAM systems using visual or LiDAR data often struggle in poor lighting and severe weather. Although 4D radar is suited for such environments, its sparse and noisy point clouds hinder accurate odometry estimation, while the radar maps suffer from obscure and incomplete structures. Thus, we propose Super4DR, a 4D radar-centric framework for learning-based odometry estimation and gaussian-based map optimization. First, we design a cluster-aware odometry network that incorporates object-level cues from the clustered radar points for inter-frame matching, alongside a hierarchical self-supervision mechanism to overcome outliers through spatio-temporal consistency, knowledge transfer, and feature contrast. Second, we propose using 3D gaussians as an intermediate representation, coupled with a radar-specific growth strategy, selective separation, and multi-view regularization, to recover blurry map areas and those undetected based on image texture. Experiments show that Super4DR achieves a 67% performance gain over prior self-supervised methods, nearly matches supervised odometry, and narrows the map quality disparity with LiDAR while enabling multi-modal image rendering.
