The Chilling: Identifying Strategic Antisocial Behavior Online and Examining the Impact on Journalists
Yian Wang, Mukhilshankar Umashankar, Eshwar Chandrasekharan, Hari Sundaram
TL;DR
This paper addresses the problem of identifying strategic antisocial behavior online, particularly coordinated attacks against female journalists on Twitter. It introduces ConvTreeTrans, a tree-structured Transformer that leverages conversation hierarchies to classify replies into attackers, supporters, and bystanders and to infer latent strategies via a joint loss framework. The authors show ConvTreeTrans outperforms baselines in both group classification and strategy discovery, reveal that attacker presence correlates with chilling effects on journalists’ posting behavior, and illuminate distinct strategic patterns across groups. The findings highlight practical implications for platform moderation and journalist-facing tools to curb coordinated toxicity and foster healthier online conversations. The work advances understanding of how structure in online conversations relates to strategic behavior and contributes actionable insights for real-time toxicity detection and moderation design.
Abstract
On social platforms like Twitter, strategic targeted attacks are becoming increasingly common, especially against vulnerable groups such as female journalists. Two key challenges in identifying strategic online behavior are the complex structure of online conversations and the hidden nature of potential strategies that drive user behavior. To address these, we develop a new tree structured Transformer model that categorizes replies based on their hierarchical conversation structures. Extensive experiments demonstrate that our proposed classification model can effectively detect different user groups, namely attackers, supporters, and bystanders, and their latent strategies. To demonstrate the utility of our approach, we apply this classifier to real time Twitter data and conduct a series of quantitative analyses on the interactions between journalists with different groups of users. Our classification approach allows us to not only explore strategic behaviors of attackers but also those of supporters and bystanders who engage in online interactions. When examining the impact of online attacks, we find a strong correlation between the presence of attackers' interactions and chilling effects, where journalists tend to slow their subsequent posting behavior. This paper provides a deeper understanding of how different user groups engage in online discussions and highlights the detrimental effects of attacker presence on journalists, other users, and conversational outcomes. Our findings underscore the need for social platforms to develop tools that address coordinated toxicity. By detecting patterns of coordinated attacks early, platforms could limit the visibility of toxic content to prevent escalation. Additionally, providing journalists and users with tools for real time reporting could empower them to manage hostile interactions more effectively.
