Radon Implicit Field Transform (RIFT): Learning Scenes from Radar Signals
Daqian Bao, Alex Saad-Falcon, Justin Romberg
TL;DR
RIFT introduces a Radon Implicit Field Transform that fuses the Generalized Radon Transform forward model with an Implicit Neural Representation to learn a scene's complex reflectivity directly from radar signals. By modeling $ ho(oldsymbol{x})$ with an INR and optimizing against radar measurements through a data-fidelity objective that leverages an $L_1$ loss for stability, RIFT enables high-quality scene reconstruction and viewpoint interpolation from a fraction of the conventional data. The work defines new metrics, including $p$-RMSE for phase (viewpoint interpolation) and $m$-SSIM/$m$-COS/$tIoU$ for magnitude-based scene recovery, and demonstrates that RIFT achieves substantial data-efficiency gains across simple and complex scenes, a weak-target case in the far-field, and a real-simulator scenario. These results highlight the potential for compact, data-efficient radar scene learning while noting stability challenges and the need for real-world datasets to fully validate the approach.
Abstract
Data acquisition in array signal processing (ASP) is costly because achieving high angular and range resolutions necessitates large antenna apertures and wide frequency bandwidths, respectively. The data requirements for ASP problems grow multiplicatively with the number of viewpoints and frequencies, significantly increasing the burden of data collection, even for simulation. Implicit Neural Representations (INRs) -- neural network-based models of 3D objects and scenes -- offer compact and continuous representations with minimal radar data. They can interpolate to unseen viewpoints and potentially address the sampling cost in ASP problems. In this work, we select Synthetic Aperture Radar (SAR) as a case from ASP and propose Radon Implicit Field Transform (RIFT). RIFT consists of two components: a classical forward model for radar (Generalized Radon Transform, GRT), and an INR based scene representation learned from radar signals. This method can be extended to other ASP problems by replacing the GRT with appropriate algorithms corresponding to different data modalities. In our experiments, we first synthesize radar data using the GRT. We then train the INR model on this synthetic data by minimizing the reconstruction error of the radar signal. After training, we render the scene using the trained INR and evaluate our scene representation against the ground truth scene. Due to the lack of existing benchmarks, we introduce two main new error metrics: phase-Root Mean Square Error (p-RMSE) for radar signal interpolation, and magnitude-Structural Similarity Index measure(m-SSIM) for scene reconstruction. These metrics adapt traditional error measures to account for the complex nature of radar signals. Compared to traditional scene models in radar signal processing, with only 10% data footprint, our RIFT model achieves up to 188% improvement in scene reconstruction.
