Engagement, User Satisfaction, and the Amplification of Divisive Content on Social Media
Smitha Milli, Micah Carroll, Yike Wang, Sashrika Pandey, Sebastian Zhao, Anca D. Dragan
TL;DR
The paper investigates whether Twitter's engagement-based ranking amplifies emotionally charged, partisan, and out-group content, using a pre-registered algorithmic audit with 806 participants comparing the engagement-based timeline to a reverse-chronological baseline and to a stated-preference (SP) timeline derived from participant surveys. It employs reader judgments and GPT-4 labeling to assess tweet-level emotions, partisanship, and out-group animus, finding that the engagement-based timeline increases anger, polarization, and negative attitudes toward out-groups (e.g., $0.24$ SD increases in partisanship and out-group animus; reader-out-group affect worsens by $-0.17$ SD; author anger rises by $0.47$ SD). The SP timeline reduces negativity and animus relative to engagement, but may intensify in-group bias; a SP-OA variant that down-ranks out-group hostility further lowers out-group animus to about $17$ extpercent of political tweets, suggesting a feasible path to align content with stated preferences while mitigating divisive content. The findings highlight a need for nuanced ranking strategies that balance engagement with users' stated preferences to curb polarization and misperception on social platforms.
Abstract
In a pre-registered algorithmic audit, we found that, relative to a reverse-chronological baseline, Twitter's engagement-based ranking algorithm amplifies emotionally charged, out-group hostile content that users say makes them feel worse about their political out-group. Furthermore, we find that users do \emph{not} prefer the political tweets selected by the algorithm, suggesting that the engagement-based algorithm underperforms in satisfying users' stated preferences. Finally, we explore the implications of an alternative approach that ranks content based on users' stated preferences and find a reduction in angry, partisan, and out-group hostile content, but also a potential reinforcement of pro-attitudinal content. The evidence underscores the necessity for a more nuanced approach to content ranking that balances engagement and users' stated preferences.
