Enabling Visual Recognition at Radio Frequency
Haowen Lai, Gaoxiang Luo, Yifei Liu, Mingmin Zhao
TL;DR
PanoRadar introduces a LiDAR-like RF imaging system that uses a rotating mmWave radar to form a dense cylindrical aperture, enabling 3D RF imaging and first-time visual recognition tasks such as surface normal estimation, semantic segmentation, and object detection. It combines robust motion estimation to compensate for platform movement, learning-based elevation-resolution enhancement using 2D convolutions, and cross-modal supervision from LiDAR to recover high-fidelity 3D structures. The approach demonstrates accurate range imaging (MAE ≈ $15.76\text{ cm}$ with median $3.39\text{ cm}$), competitive surface normal and semantic/detection metrics, and strong cross-building generalization across 12 buildings, culminating in a practical, low-cost RF perception pipeline. The work presents a comprehensive dataset of 11,033 synchronized RF–LiDAR scenes (461 GB) and opens avenues for RF-based perception in robotics and harsh-environment settings, potentially complementing or replacing LiDAR in certain applications.
Abstract
This paper introduces PanoRadar, a novel RF imaging system that brings RF resolution close to that of LiDAR, while providing resilience against conditions challenging for optical signals. Our LiDAR-comparable 3D imaging results enable, for the first time, a variety of visual recognition tasks at radio frequency, including surface normal estimation, semantic segmentation, and object detection. PanoRadar utilizes a rotating single-chip mmWave radar, along with a combination of novel signal processing and machine learning algorithms, to create high-resolution 3D images of the surroundings. Our system accurately estimates robot motion, allowing for coherent imaging through a dense grid of synthetic antennas. It also exploits the high azimuth resolution to enhance elevation resolution using learning-based methods. Furthermore, PanoRadar tackles 3D learning via 2D convolutions and addresses challenges due to the unique characteristics of RF signals. Our results demonstrate PanoRadar's robust performance across 12 buildings.
