Detecting Viral Social Events through Censored Observation with Deep Survival Analysis
Maryam Ramezani, Hossein Goli, AmirMohammad Izad, Hamid R. Rabiee
TL;DR
This work addresses detecting viral social events from censored cascade data by framing virality as a time-to-event problem and applying deep survival analysis. It introduces VEDSA, a two-stage framework with a γ model that learns a parametric survival function via an LSTM, and a δ discriminator that uses the inferred survival to classify cascades as viral or not, leveraging discretized bin covariates $B^{(i)}$ and a labeled viral threshold $\zeta$. The method evaluates Exponential, Rayleigh, and Weibull distributions, and demonstrates that Weibull generally provides the strongest predictive performance across Twitter, Digg, and Weibo datasets under censored observations. The results highlight the practical potential for early rumor detection, influence estimation, and targeted information management, with robust performance even when only partial cascade data is available.
Abstract
Users increasing activity across various social networks made it the most widely used platform for exchanging and propagating information among individuals. To spread information within a network, a user initially shared information on a social network, and then other users in direct contact with him might have shared that information. Information expanded throughout the network by repeatedly following this process. A set of information that became popular and was repeatedly shared by different individuals was called viral events. Identifying and analyzing viral social events led to valuable insights into the dynamics of information dissemination within a network. However, more importantly, proactive approaches emerged. In other words, by observing the dissemination pattern of a piece of information in the early stages of expansion, it became possible to determine whether this cascade would become viral in the future. This research aimed to predict and detect viral events in social networks by observing granular information and using a deep survival analysis-based method. This model could play a significant role in identifying rumors, predicting the impact of information, and assisting in optimal decision-making in information management and marketing. Ultimately, the proposed method was tested on various real-world datasets from Twitter, Weibo, and Digg.
