Adversarial Botometer: Adversarial Analysis for Social Bot Detection
Shaghayegh Najari, Davood Rafiee, Mostafa Salehi, Reza Farahbakhsh
TL;DR
This work tackles the challenge of detecting social bots under adversarial, human-like manipulation by GenAI. It introduces Adversarial Botometer, a GAN-inspired framework where a Seq2Seq bot generator competes with a Contextual-LSTM detector in adversarial training, including a pre-training phase and an MCST-guided minimax objective. The evaluation on Cresci 2017 and Midterm 2018 spans three scenarios—adversarial play, data poisoning, and domain/model explanations—revealing that detector robustness varies across bot types and domains and highlighting the need for dynamic, adversarially trained defenses. The study further uses SHAP to interpret feature importance and compares multiple modeling approaches, offering practical guidance for building more resilient bot-detection systems in real-world, evolving OSN environments.
Abstract
Social bots play a significant role in many online social networks (OSN) as they imitate human behavior. This fact raises difficult questions about their capabilities and potential risks. Given the recent advances in Generative AI (GenAI), social bots are capable of producing highly realistic and complex content that mimics human creativity. As the malicious social bots emerge to deceive people with their unrealistic content, identifying them and distinguishing the content they produce has become an actual challenge for numerous social platforms. Several approaches to this problem have already been proposed in the literature, but the proposed solutions have not been widely evaluated. To address this issue, we evaluate the behavior of a text-based bot detector in a competitive environment where some scenarios are proposed: \textit{First}, the tug-of-war between a bot and a bot detector is examined. It is interesting to analyze which party is more likely to prevail and which circumstances influence these expectations. In this regard, we model the problem as a synthetic adversarial game in which a conversational bot and a bot detector are engaged in strategic online interactions. \textit{Second}, the bot detection model is evaluated under attack examples generated by a social bot; to this end, we poison the dataset with attack examples and evaluate the model performance under this condition. \textit{Finally}, to investigate the impact of the dataset, a cross-domain analysis is performed. Through our comprehensive evaluation of different categories of social bots using two benchmark datasets, we were able to demonstrate some achivement that could be utilized in future works.
