Deepfake Geography: Detecting AI-Generated Satellite Images

Mansur Yerzhanuly

Deepfake Geography: Detecting AI-Generated Satellite Images

Mansur Yerzhanuly

TL;DR

This paper tackles the authenticity of satellite imagery in the era of AI-generated content. It systematically compares CNNs and Vision Transformers on a large RGB dataset derived from DM-AER and FSI, finding ViTs superior in accuracy and robustness. Explainability analyses using Grad-CAM for CNNs and Chefer's transformer attribution reveal complementary detection cues and bolster trust in the models. The work has practical implications for journalism, environmental science, and defense, and points to future work in multispectral/SAR data and frequency-domain artifact detection.

Abstract

The rapid advancement of generative models such as StyleGAN2 and Stable Diffusion poses a growing threat to the authenticity of satellite imagery, which is increasingly vital for reliable analysis and decision-making across scientific and security domains. While deepfake detection has been extensively studied in facial contexts, satellite imagery presents distinct challenges, including terrain-level inconsistencies and structural artifacts. In this study, we conduct a comprehensive comparison between Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) for detecting AI-generated satellite images. Using a curated dataset of over 130,000 labeled RGB images from the DM-AER and FSI datasets, we show that ViTs significantly outperform CNNs in both accuracy (95.11 percent vs. 87.02 percent) and overall robustness, owing to their ability to model long-range dependencies and global semantic structures. We further enhance model transparency using architecture-specific interpretability methods, including Grad-CAM for CNNs and Chefer's attention attribution for ViTs, revealing distinct detection behaviors and validating model trustworthiness. Our results highlight the ViT's superior performance in detecting structural inconsistencies and repetitive textural patterns characteristic of synthetic imagery. Future work will extend this research to multispectral and SAR modalities and integrate frequency-domain analysis to further strengthen detection capabilities and safeguard satellite imagery integrity in high-stakes applications.

Deepfake Geography: Detecting AI-Generated Satellite Images

TL;DR

Abstract

Deepfake Geography: Detecting AI-Generated Satellite Images

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (10)