SeBot: Structural Entropy Guided Multi-View Contrastive Learning for Social Bot Detection
Yingguang Yang, Qi Wu, Buyun He, Hao Peng, Renyu Yang, Zhifeng Hao, Yong Liao
TL;DR
SeBot tackles social bot detection on multi-relational social graphs by integrating structural entropy-guided encoding trees to reveal hierarchical communities, a heterophily-aware encoder to handle adversarial bots, and multi-view self-supervised contrastive learning to unify diverse representations. It introduces three coupled representations from different views, optimizes them with a combined objective $\,\mathcal{L}=\mathcal{L}_{CE}+\lambda_1\mathcal{L}_{NCL}+\lambda_2\mathcal{L}_{SCL}$, and demonstrates superior performance over state-of-the-art baselines on TwiBot-20 and MGTAB. The work also provides thorough ablations, sensitivity analyses, and visualizations to validate the contributions of encoding trees, relational aggregation, and contrastive learning, highlighting robustness to adversarial behavior and improved generalization for real-world deployment.
Abstract
Recent advancements in social bot detection have been driven by the adoption of Graph Neural Networks. The social graph, constructed from social network interactions, contains benign and bot accounts that influence each other. However, previous graph-based detection methods that follow the transductive message-passing paradigm may not fully utilize hidden graph information and are vulnerable to adversarial bot behavior. The indiscriminate message passing between nodes from different categories and communities results in excessively homogeneous node representations, ultimately reducing the effectiveness of social bot detectors. In this paper, we propose SEBot, a novel multi-view graph-based contrastive learning-enabled social bot detector. In particular, we use structural entropy as an uncertainty metric to optimize the entire graph's structure and subgraph-level granularity, revealing the implicitly existing hierarchical community structure. And we design an encoder to enable message passing beyond the homophily assumption, enhancing robustness to adversarial behaviors of social bots. Finally, we employ multi-view contrastive learning to maximize mutual information between different views and enhance the detection performance through multi-task learning. Experimental results demonstrate that our approach significantly improves the performance of social bot detection compared with SOTA methods.
