Detecting Anti-vaccine Content on Twitter using Multiple Message-Based Network Representations
James R. Ashford
TL;DR
This study investigates detecting anti-vaccine content on Twitter by constructing three message-based interaction networks (mentions, replies, and quote retweets) for each term related to COVID-19. It ranks terms via crowdsourced Likert ratings, hydrates tweets to obtain content and metadata, and extracts both global and local graph features from 597 term–interaction networks to train binary classifiers. Global network features, particularly when using a random forest classifier and combining all interaction types, yield the strongest performance (about 0.886 accuracy), while local features offer competitive but generally lower gains. The findings demonstrate that language-agnostic, graph-based representations can effectively distinguish controversial from non-controversial terms at scale, with potential applications in content moderation across platforms and languages.
Abstract
Social media platforms such as Twitter have a fundamental role in facilitating the spread and discussion of ideas online through the concept of retweeting and replying. However, these features also contribute to the spread of mis/disinformation during the vaccine rollout of the COVID-19 pandemic. Using COVID-19 vaccines as a case study, we analyse multiple social network representation derived from three message-based interactions on Twitter (quote retweets, mentions and replies) based upon a set of known anti-vax hashtags and keywords. Each network represents a certain hashtag or keyword which were labelled as "controversial" and "non-controversial" according to a small group of participants. For each network, we extract a combination of global and local network-based metrics which are used as feature vectors for binary classification. Our results suggest that it is possible to detect controversial from non-controversial terms with high accuracy using simple network-based metrics. Furthermore, these results demonstrate the potential of network representations as language-agnostic models for detecting mis/disinformation at scale, irrespective of content and across multiple social media platforms.
