Detecting GAN-generated Imagery using Color Cues
Scott McCloskey, Michael Albright
TL;DR
This paper tackles the challenge of distinguishing GAN-generated imagery from real camera images to curb online disinformation. It analyzes the GAN generator’s color formation and normalization steps to derive two cues: depth-to-RGB color coupling and intensity-constraining normalization. Two detectors are proposed: a color-forensics approach using chromaticity histograms with an INH classifier, and a saturation-based approach using exposure-frequency features with a linear SVM. On MFC18 datasets, the saturation-based method achieves notable discrimination (AUC up to ≈0.70 for fully GAN images and ≈0.61 on mixed content), while color-based signals are weak, underscoring the value of architecture-aware forensics as GANs evolve and the need to complement artifact-based methods.
Abstract
Image forensics is an increasingly relevant problem, as it can potentially address online disinformation campaigns and mitigate problematic aspects of social media. Of particular interest, given its recent successes, is the detection of imagery produced by Generative Adversarial Networks (GANs), e.g. `deepfakes'. Leveraging large training sets and extensive computing resources, recent work has shown that GANs can be trained to generate synthetic imagery which is (in some ways) indistinguishable from real imagery. We analyze the structure of the generating network of a popular GAN implementation, and show that the network's treatment of color is markedly different from a real camera in two ways. We further show that these two cues can be used to distinguish GAN-generated imagery from camera imagery, demonstrating effective discrimination between GAN imagery and real camera images used to train the GAN.
