Characteristics and prevalence of fake social media profiles with AI-generated faces
Kai-Cheng Yang, Danishjeet Singh, Filippo Menczer
TL;DR
The paper tackles the lack of empirical evidence on fake social media profiles that use AI-generated faces. It introduces the TwitterGAN dataset (1,420 accounts) and a light yet effective detection approach based on the consistent eye placement of GAN faces, complemented by human annotation, with prevalence estimated from a 1% random Twitter sample. It further validates and anchors its analysis with the AcademicGAN dataset and provides practical heuristics, releasing code and data to enable broader investigation. The findings reveal that GAN-generated profiles participate in scams, spam, and coordinated amplification, highlighting emerging threats to online integrity and motivating calls for improved detection, regulation, and AI literacy.
Abstract
Recent advancements in generative artificial intelligence (AI) have raised concerns about their potential to create convincing fake social media accounts, but empirical evidence is lacking. In this paper, we present a systematic analysis of Twitter (X) accounts using human faces generated by Generative Adversarial Networks (GANs) for their profile pictures. We present a dataset of 1,420 such accounts and show that they are used to spread scams, spam, and amplify coordinated messages, among other inauthentic activities. Leveraging a feature of GAN-generated faces -- consistent eye placement -- and supplementing it with human annotation, we devise an effective method for identifying GAN-generated profiles in the wild. Applying this method to a random sample of active Twitter users, we estimate a lower bound for the prevalence of profiles using GAN-generated faces between 0.021% and 0.044% -- around 10K daily active accounts. These findings underscore the emerging threats posed by multimodal generative AI. We release the source code of our detection method and the data we collect to facilitate further investigation. Additionally, we provide practical heuristics to assist social media users in recognizing such accounts.
