Addressing Vulnerabilities in AI-Image Detection: Challenges and Proposed Solutions
Justin Jiang
TL;DR
This work addresses the challenge of reliably detecting AI-generated images in the face of evolving generative models. It systematically evaluates CNN-based detectors and DenseNets on CIFAKE-derived datasets produced by Stable Diffusion across versions, with variations such as Gaussian blur, LoRA, and prompt changes. The results reveal strong version-specific overfitting and vulnerability to common degradations, while DenseNets offer improved robustness and higher accuracy. The findings highlight the need for diverse, version-robust training data and architecture choices to build detectors that generalize in real-world, evolving-generation environments.
Abstract
The rise of advanced AI models like Generative Adversarial Networks (GANs) and diffusion models such as Stable Diffusion has made the creation of highly realistic images accessible, posing risks of misuse in misinformation and manipulation. This study evaluates the effectiveness of convolutional neural networks (CNNs), as well as DenseNet architectures, for detecting AI-generated images. Using variations of the CIFAKE dataset, including images generated by different versions of Stable Diffusion, we analyze the impact of updates and modifications such as Gaussian blurring, prompt text changes, and Low-Rank Adaptation (LoRA) on detection accuracy. The findings highlight vulnerabilities in current detection methods and propose strategies to enhance the robustness and reliability of AI-image detection systems.
