Multi-task SAR Image Processing via GAN-based Unsupervised Manipulation
Xuran Hu, Mingzhe Zhu, Ziqiang Xu, Zhenpeng Feng, Ljubisa Stankovic
TL;DR
The paper tackles the challenge of unsupervised, interpretable, multi-task SAR image processing by introducing GUE, a framework that uncouples semantic directions in StyleGAN latent spaces and trains a reconstructor to link latent edits to outcomes. GUE enables despeckling, background segmentation, rotation editing, and guided SAR target recognition in a single training run without labeled data, achieved via a decoupled, orthogonal latent direction matrix and a second-stage reconstructor. The approach yields strong despeckling performance, competitive segmentation results, and enhanced recognition through rotation semantics, demonstrating the practical value of latent space editing for SAR. These results suggest a promising label-free path toward versatile SAR image processing and interpretation, with potential for extending long-range edits as GAN inversion and latent space models improve.
Abstract
Generative Adversarial Networks (GANs) have shown tremendous potential in synthesizing a large number of realistic SAR images by learning patterns in the data distribution. Some GANs can achieve image editing by introducing latent codes, demonstrating significant promise in SAR image processing. Compared to traditional SAR image processing methods, editing based on GAN latent space control is entirely unsupervised, allowing image processing to be conducted without any labeled data. Additionally, the information extracted from the data is more interpretable. This paper proposes a novel SAR image processing framework called GAN-based Unsupervised Editing (GUE), aiming to address the following two issues: (1) disentangling semantic directions in the GAN latent space and finding meaningful directions; (2) establishing a comprehensive SAR image processing framework while achieving multiple image processing functions. In the implementation of GUE, we decompose the entangled semantic directions in the GAN latent space by training a carefully designed network. Moreover, we can accomplish multiple SAR image processing tasks (including despeckling, localization, auxiliary identification, and rotation editing) in a single training process without any form of supervision. Extensive experiments validate the effectiveness of the proposed method.
