Identifying Coordinated Activities on Online Social Networks Using Contrast Pattern Mining
Isura Manchanayaka, Zainab Zaidi, Shanika Karunasekera, Christopher Leckie
TL;DR
The paper tackles the problem of detecting coordinated activity on online social networks by viewing coordination as abnormal growth in behavioural patterns over time. It introduces a framework built on contrast pattern mining using EPClose to extract closed contrast patterns from two time windows: a background window $D_b$ and a target window $D_t$, and quantifies pattern growth via $gr(X,D_t,D_b)=\frac{supp(X,D_t)}{supp(X,D_b)}$ and $supp_\delta(X,D_t,D_b)$. The approach is evaluated on real-world data from Russia's IRA influencing the 2016 USA elections, augmented with noisy background data, achieving up to $F1$ scores of $86\%$ and surpassing several baselines by more than $10\%$ in performance, while remaining memory-efficient. Contributions include (1) formalizing contrast-pattern mining for coordination detection, (2) proposing a practical framework with real-data application, and (3) conducting extensive parameter and ablation analyses to show the importance of temporal attributes. The work highlights the potential of using growth in behavioural patterns to identify coordinating users and points to future work on incorporating richer attributes (sentiment, topics) and automatic time-interval determination for real-time deployment.
Abstract
The proliferation of misinformation and disinformation on social media networks has become increasingly concerning. With a significant portion of the population using social media on a regular basis, there are growing efforts by malicious organizations to manipulate public opinion through coordinated campaigns. Current methods for identifying coordinated user accounts typically rely on either similarities in user behaviour, latent coordination in activity traces, or classification techniques. In our study, we propose a framework based on the hypothesis that coordinated users will demonstrate abnormal growth in their behavioural patterns over time relative to the wider population. Specifically, we utilize the EPClose algorithm to extract contrasting patterns of user behaviour during a time window of malicious activity, which we then compare to a historical time window. We evaluated the effectiveness of our approach using real-world data, and our results show a minimum increase of 10% in the F1 score compared to existing approaches.
