PopSim: Social Network Simulation for Social Media Popularity Prediction
Yijun Liu, Wu Liu, Xiaoyan Gu, Allen He, Weiping Wang, Yongdong Zhang
TL;DR
PopSim introduces a simulation-based paradigm for Social Media Popularity Prediction by deploying an LLM-based multi-agent social network sandbox to model dynamic UGC propagation. A dual-channel social-mean-field mechanism captures both textual context and agent opinions, while a multi-source information aggregation module unifies heterogeneous metadata for a unified prediction. Empirical results on two real-world datasets show consistent improvements over state-of-the-art inductive methods, with notable reductions in prediction error and gains in rank correlation; ablations validate the contributions of propagation simulation and SMF interactions. The framework offers a scalable, dynamic, and interpretable approach to SMPP that can inform broader social media analytics and recommendation tasks.
Abstract
Accurately predicting the popularity of user-generated content (UGC) is essential for advancing social media analytics and recommendation systems. Existing approaches typically follow an inductive paradigm, where researchers train static models on historical data for popularity prediction. However, the UGC propagation is inherently a dynamic process, and static modeling based on historical features fails to capture the complex interactions and nonlinear evolution. In this paper, we propose PopSim, a novel simulation-based paradigm for social media popularity prediction (SMPP). Unlike the inductive paradigm, PopSim leverages the large language models (LLMs)-based multi-agent social network sandbox to simulate UGC propagation dynamics for popularity prediction. Specifically, to effectively model the UGC propagation process in the network, we design a social-mean-field-based agent interaction mechanism, which models the dual-channel and bidirectional individual-population interactions, enhancing agents' global perception and decision-making capabilities. In addition, we propose a multi-source information aggregation module that transforms heterogeneous social metadata into a uniform formulation for LLMs. Finally, propagation dynamics with multimodal information are fused to provide comprehensive popularity prediction. Extensive experiments on real-world datasets demonstrate that SimPop consistently outperforms the state-of-the-art methods, reducing prediction error by an average of 8.82%, offering a new perspective for research on the SMPP task.
