Using Causality to Infer Coordinated Attacks in Social Media
Isura Manchanayaka, Zainab Razia Zaidi, Shanika Karunasekera, Christopher Leckie
TL;DR
This work reframes coordinated manipulation on social media as a causal inference problem and applies Convergent Cross Mapping (CCM) to user activity traces, augmented by topic modeling to improve efficiency. By evaluating on the IRA dataset and a COVID-19 Twitter case study, the authors demonstrate that CCM can identify coordinating pairs with high precision (up to 75.3% F1 in certain settings) and uncover influential community members, offering a causality-based alternative to theme- or network-centric methods. The key contributions include a CCM-based pipeline for detecting coordination, an optimization via Non-negative Matrix Factorization (NMF) for topic clustering to reduce search space, and robust comparisons against baselines like Granger causality. The approach advances practical detection of coordinated campaigns and provides insights into leadership and information diffusion, with implications for platform defense and content moderation. Overall, the paper shows CCM’s promise for uncovering causal structures of coordinated behavior in large-scale social media data, highlighting both performance gains and scalability challenges.
Abstract
The rise of social media has been accompanied by a dark side with the ease of creating fake accounts and disseminating misinformation through coordinated attacks. Existing methods to identify such attacks often rely on thematic similarities or network-based approaches, overlooking the intricate causal relationships that underlie coordinated actions. This work introduces a novel approach for detecting coordinated attacks using Convergent Cross Mapping (CCM), a technique that infers causality from temporal relationships between user activity. We build on the theoretical framework of CCM by incorporating topic modelling as a basis for further optimizing its performance. We apply CCM to real-world data from the infamous IRA attack on US elections, achieving F1 scores up to 75.3% in identifying coordinated accounts. Furthermore, we analyse the output of our model to identify the most influential users in a community. We apply our model to a case study involving COVID-19 anti-vax related discussions on Twitter. Our results demonstrate the effectiveness of our model in uncovering causal structures of coordinated behaviour, offering a promising avenue for mitigating the threat of malicious campaigns on social media platforms.
