RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

Zhihong Tan; Jiayi Wang; Huiying Shi; Binyuan Huang; Hongchen Wei; Zhenzhong Chen

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

Zhihong Tan, Jiayi Wang, Huiying Shi, Binyuan Huang, Hongchen Wei, Zhenzhong Chen

TL;DR

RSFAKE-1M addresses the urgent need for robust detection of diffusion-generated remote sensing forgeries by creating a large-scale benchmark with 500k fake and 500k real RS images generated by ten diffusion models under six conditions. Real images are sourced from fMoW to match distribution, enabling controlled evaluation of generalization and robustness. Across extensive experiments, existing detectors struggle under domain shifts, but models trained on RSFAKE-1M show improved cross-dataset generalization and resilience to common degradations, highlighting the dataset’s utility for advancing RS forgery detection. This benchmark therefore provides a foundation for developing next-generation detectors tailored to diffusion-based RS forgeries and real-world deployment challenges.

Abstract

Detecting forged remote sensing images is becoming increasingly critical, as such imagery plays a vital role in environmental monitoring, urban planning, and national security. While diffusion models have emerged as the dominant paradigm for image generation, their impact on remote sensing forgery detection remains underexplored. Existing benchmarks primarily target GAN-based forgeries or focus on natural images, limiting progress in this critical domain. To address this gap, we introduce RSFAKE-1M, a large-scale dataset of 500K forged and 500K real remote sensing images. The fake images are generated by ten diffusion models fine-tuned on remote sensing data, covering six generation conditions such as text prompts, structural guidance, and inpainting. This paper presents the construction of RSFAKE-1M along with a comprehensive experimental evaluation using both existing detectors and unified baselines. The results reveal that diffusion-based remote sensing forgeries remain challenging for current methods, and that models trained on RSFAKE-1M exhibit notably improved generalization and robustness. Our findings underscore the importance of RSFAKE-1M as a foundation for developing and evaluating next-generation forgery detection approaches in the remote sensing domain. The dataset and other supplementary materials are available at https://huggingface.co/datasets/TZHSW/RSFAKE/.

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

TL;DR

Abstract

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)