HFI: A unified framework for training-free detection and implicit watermarking of latent diffusion model generated images
Sungik Choi, Sungwoo Park, Jaehoon Lee, Seunghyun Kim, Stanley Jungkyu Choi, Moontae Lee
TL;DR
The paper tackles the challenge of training-free detection of AI-generated images from latent diffusion models (LDMs), addressing generalization across diverse generators. It introduces High-frequency Influence (HFI), a score that leverages the autoencoder of an LDM as a downsampling/upsampling kernel and quantifies aliasing in reconstructions via a low-pass filter, with an ensemble approach over multiple autoencoders. Empirically, HFI outperforms existing training-free detectors on challenging benchmarks (GenImage, SynthBuster, DiffusionFace) and remains competitive with training-based methods, while offering substantial test-time efficiency. Additionally, HFI can serve as an implicit watermarking mechanism for tracing images produced by a specified LDM, delivering near-perfect attribution with large speedups over optimization-based baselines. The work advances practical, scalable detection and ownership tracing for LDM-generated content in real-world, data-free settings.
Abstract
Dramatic advances in the quality of the latent diffusion models (LDMs) also led to the malicious use of AI-generated images. While current AI-generated image detection methods assume the availability of real/AI-generated images for training, this is practically limited given the vast expressibility of LDMs. This motivates the training-free detection setup where no related data are available in advance. The existing LDM-generated image detection method assumes that images generated by LDM are easier to reconstruct using an autoencoder than real images. However, we observe that this reconstruction distance is overfitted to background information, leading the current method to underperform in detecting images with simple backgrounds. To address this, we propose a novel method called HFI. Specifically, by viewing the autoencoder of LDM as a downsampling-upsampling kernel, HFI measures the extent of aliasing, a distortion of high-frequency information that appears in the reconstructed image. HFI is training-free, efficient, and consistently outperforms other training-free methods in detecting challenging images generated by various generative models. We also show that HFI can successfully detect the images generated from the specified LDM as a means of implicit watermarking. HFI outperforms the best baseline method while achieving magnitudes of
