Algorithmic Behaviors Across Regions: A Geolocation Audit of YouTube Search for COVID-19 Misinformation Between the United States and South Africa
Hayoung Jung, Prerna Juneja, Tanushree Mitra
TL;DR
This study conducts the first large-scale geolocation-based audit of YouTube search for COVID-19 misinformation across the US and SA, using sock-puppet bots across three geolocations per country and 48 queries over 10 days to collect 915K results. It develops a ground-truth labelled set of 3,075 videos and trains multiple classifiers, finding DeBERTa-v3-large performs best for English content with an accuracy of about 0.85; non-English videos are handled separately. The results reveal that YouTube search behaves differently by geolocation, with SA users encountering more misinformative top-tier results, and several topics (e.g., 5G, Bill Gates claims, and vaccine content) showing regional disparities, while some topics are mitigated by YouTube’s moderation in the US. The authors argue for regionally consistent algorithmic regulation and highlight the practical health risks of misinformative SERPs, especially in the Global South, while also discussing methodological challenges in Global South audits and suggesting directions for future work. Overall, the paper contributes a rigorous cross-region methodological framework, a rich labeled dataset, and empirical evidence of geographic inequities in misinformation exposure on YouTube, with implications for platform governance and public health policy.
Abstract
Despite being an integral tool for finding health-related information online, YouTube has faced criticism for disseminating COVID-19 misinformation globally to its users. Yet, prior audit studies have predominantly investigated YouTube within the Global North contexts, often overlooking the Global South. To address this gap, we conducted a comprehensive 10-day geolocation-based audit on YouTube to compare the prevalence of COVID-19 misinformation in search results between the United States (US) and South Africa (SA), the countries heavily affected by the pandemic in the Global North and the Global South, respectively. For each country, we selected 3 geolocations and placed sock-puppets, or bots emulating "real" users, that collected search results for 48 search queries sorted by 4 search filters for 10 days, yielding a dataset of 915K results. We found that 31.55% of the top-10 search results contained COVID-19 misinformation. Among the top-10 search results, bots in SA faced significantly more misinformative search results than their US counterparts. Overall, our study highlights the contrasting algorithmic behaviors of YouTube search between two countries, underscoring the need for the platform to regulate algorithmic behavior consistently across different regions of the Globe.
