Unveiling Behavioral Differences in Bilingual Information Operations: A Network-Based Approach
Bowen Yi
TL;DR
This study develops a language-aware, network-based framework to detect information-operation drivers on Twitter during the 2024 U.S. election by fusing Co-Domain, Co-Hashtag, and Text Similarity networks and applying unsupervised clustering. It demonstrates that English and Spanish IO drivers exhibit distinct topics, domains, and engagement patterns, and reveals that bilingual users play unique, bridging roles with language-dependent behaviors. The authors also introduce a novel, label-free evaluation method for clustering quality and show that fixed-edge-filtering or pruning parameters may hamper cross-language performance, underscoring the need for language-specific tuning. Overall, the work highlights the importance of culturally and linguistically adaptable IO detection to mitigate influence campaigns and lays groundwork for multilingual, human-centered IO detection systems, with open-source code and data forthcoming on GitHub.
Abstract
Twitter has become a pivotal platform for conducting information operations (IOs), particularly during high-stakes political events. In this study, we analyze over a million tweets about the 2024 U.S. presidential election to explore an under-studied area: the behavioral differences of IO drivers from English- and Spanish-speaking communities. Using similarity graphs constructed from behavioral patterns, we identify IO drivers in both languages and evaluate the clustering quality of these graphs in an unsupervised setting. Our analysis demonstrates how different network dismantling strategies, such as node pruning and edge filtering, can impact clustering quality and the identification of coordinated IO drivers. We also reveal significant differences in the topics and political indicators between English and Spanish IO drivers. Additionally, we investigate bilingual users who post in both languages, systematically uncovering their distinct roles and behaviors compared to monolingual users. These findings underscore the importance of robust, culturally and linguistically adaptable IO detection methods to mitigate the risks of influence campaigns on social media. Our code and data are available on GitHub: https://github.com/bowenyi-pierre/humans-lab-hackathon-24.
