Interpretable Early Warnings using Machine Learning in an Online Game-experiment
Guillaume Falmagne, Anna B. Stephenson, Simon A. Levin
TL;DR
This work addresses the challenge of forecasting regime shifts in a large-scale online social system by building an interpretable, data-driven early-warning framework. It uses gradient-boosted decision trees trained on system-specific time-series with a $7\text{h}$ memory and interprets predictions via SHAP to reveal underlying drivers of transitions in Reddit's $r/place$ canvases. The model achieves a ROC AUC of $0.833$ and detects about half of incoming transitions within $20$ minutes at a false-positive rate of $3.7\%$, with robust generalization to the 2023 event (AUC around $0.69$ for a $6$-hour horizon). SHAP-based analysis uncovers 12 pre-transition behavioral patterns, including aspects of critical slowing down, innovation, and coordination, offering human-readable warnings that could inform monitoring in socio-ecological systems and other dense, high-dimensional domains.
Abstract
Stemming from physics and later applied to other fields such as ecology, the theory of critical transitions suggests that some regime shifts are preceded by statistical early warning signals. Reddit's r/place experiment, a large-scale social game, provides a unique opportunity to test these signals consistently across thousands of subsystems undergoing critical transitions. In r/place, millions of users collaboratively created compositions, or pixel-art drawings, in which transitions occur when one composition rapidly replaces another. We develop a machine-learning-based early warning system that combines the predictive power of multiple system-specific time series via gradient-boosted decision trees with memory-retaining features. Our method significantly outperforms standard early warning indicators. Trained on the 2022 r/place data, our algorithm detects half of the transitions occurring within 20 minutes at a false positive rate of just 3.7%. Its performance remains robust when tested on the 2023 r/place event, demonstrating generalizability across different contexts. Using SHapley Additive exPlanations (SHAP) for interpreting the predictions, we investigate the underlying drivers of warnings, which could be relevant to other complex systems, especially online social systems. We reveal an interplay of patterns preceding transitions, such as critical slowing down or speeding up, a lack of innovation or coordination, turbulent histories, and a lack of image complexity. These findings show the potential of machine learning indicators in socio-ecological systems for predicting regime shifts and understanding their dynamics.
