Low-Rank Adaptation of Pre-Trained Stable Diffusion for Rigid-Body Target ISAR Imaging

Boan Zhang; Hang Dong; Jiongge Zhang; Long Tian; Rongrong Wang; Zhenhua Wu; Xiyang Liu; Hongwei Liu

Low-Rank Adaptation of Pre-Trained Stable Diffusion for Rigid-Body Target ISAR Imaging

Boan Zhang, Hang Dong, Jiongge Zhang, Long Tian, Rongrong Wang, Zhenhua Wu, Xiyang Liu, Hongwei Liu

TL;DR

The paper tackles the low-resolution challenge of RID-based ISAR imaging by introducing LoRA-SD, a texture-aware super-resolution approach that fine-tunes a pre-trained Stable Diffusion Turbo model with low-rank adapters and adversarial training to enhance time-frequency representations. The method maps low-resolution TFRs to high-resolution counterparts via a constrained, parameter-efficient fine-tuning scheme, enabling sharper, denoised ISAR images and improved frequency estimation, without being limited by the classical uncertainty principle. Experiments on simulated and measured radar data show that LoRA-SD outperforms STFT, SBL, and MF baselines in RMSE across a range of SNRs, while maintaining feasible runtime and memory, and demonstrating strong generalization to real-world data. The approach holds promise for improved 3D pose estimation and robust ISAR imaging of rigid-body targets under complex motions such as spin and precession.

Abstract

Traditional range-instantaneous Doppler (RID) methods for rigid-body target imaging often suffer from low resolution due to the limitations of time-frequency analysis (TFA). To address this challenge, our primary focus is on obtaining high resolution time-frequency representations (TFRs) from their low resolution counterparts. Recognizing that the curve features of TFRs are a specific type of texture feature, we argue that pre trained generative models such as Stable Diffusion (SD) are well suited for enhancing TFRs, thanks to their powerful capability in capturing texture representations. Building on this insight, we propose a novel inverse synthetic aperture radar (ISAR) imaging method for rigid-body targets, leveraging the low-rank adaptation (LoRA) of a pre-trained SD model. Our approach adopts the basic structure and pre-trained parameters of SD Turbo while incorporating additional linear operations for LoRA and adversarial training to achieve super-resolution and noise suppression. Then we integrate LoRA-SD into the RID-based ISAR imaging, enabling sharply focused and denoised imaging with super-resolution capabilities. We evaluate our method using both simulated and real radar data. The experimental results demonstrate the superiority of our approach in frequency es timation and ISAR imaging compared to traditional methods. Notably, the generalization capability is verified by training on simulated radar data and testing on measured radar data.

Low-Rank Adaptation of Pre-Trained Stable Diffusion for Rigid-Body Target ISAR Imaging

TL;DR

Abstract

Low-Rank Adaptation of Pre-Trained Stable Diffusion for Rigid-Body Target ISAR Imaging

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (6)