TextureCrop: Enhancing Synthetic Image Detection through Texture-based Cropping
Despina Konstantinidou, Christos Koutlis, Symeon Papadopoulos
TL;DR
TextureCrop addresses the challenge of detecting synthetic content in high-resolution images by introducing a texture-driven pre-processing step that selects a fixed number of high-frequency crops via a sliding window and aggregates their detector scores. By prioritizing texture-rich regions using Global Histogram Entropy and averaging per-crop logits, TextureCrop consistently improves SID performance across multiple detectors and high-resolution datasets, while reducing memory and compute relative to full-image processing. The method yields meaningful gains in BA, AP, and AUC (e.g., up to 12.1% BA, 18.2% AP, and 14.9% AUC over resizing) and demonstrates robustness across datasets like Forensynths, Synthbuster, and TWIGMA, with notable efficiency improvements on large images. Its simplicity and compatibility with existing SID models position TextureCrop as a practical, plug-in enhancement for real-world synthetic image mitigation and surveillance workflows.
Abstract
Generative AI technologies produce increasingly realistic imagery, which, despite its potential for creative applications, can also be misused to produce misleading and harmful content. This renders Synthetic Image Detection (SID) methods essential for identifying AI-generated content online. State-of-the-art SID methods typically resize or center-crop input images due to architectural or computational constraints, which hampers the detection of artifacts that appear in high-resolution images. To address this limitation, we propose TextureCrop, an image pre-processing component that can be plugged in any pre-trained SID model to improve its performance. By focusing on high-frequency image parts where generative artifacts are prevalent, TextureCrop enhances SID performance with manageable memory requirements. Experimental results demonstrate a consistent improvement in AUC across various detectors by 6.1% compared to center cropping and by 15% compared to resizing, across high-resolution images from the Forensynths, Synthbuster and TWIGMA datasets. Code available at https : //github.com/mever-team/texture-crop.
