Divided by discipline? A systematic literature review on the quantification of online sexism and misogyny using a semi-automated approach
Aditi Dutta, Susan Banducci, Chico Q. Camargo
TL;DR
This systematic literature review addresses how online sexism and misogyny are quantified across social science and computer science, revealing a persistent disciplinary divide in definitions and methods. The authors introduce a semi-automated PRISMA-based pipeline that combines BERTopic topic modeling, KeyBERT keyword networks, and manual screening to map 2012–2022 literature, identifying five core themes and critical gaps, especially around intersectionality and non-Western contexts. The study highlights that most computational work emphasizes binary detection and text-based classification, while social science work emphasizes qualitative, contextual analyses, pointing to the need for integrated, interdisciplinary taxonomies and diverse datasets. By outlining methodological gaps and proposing a replicable semi-automated workflow, the paper contributes a practical framework for future research aimed at more nuanced, equitable, and globally representative detection and mitigation of online sexism and misogyny.
Abstract
Several computational tools have been developed to detect and identify sexism, misogyny, and gender-based hate speech, particularly on online platforms. These tools draw on insights from both social science and computer science. Given the increasing concern over gender-based discrimination in digital spaces, the contested definitions and measurements of sexism, and the rise of interdisciplinary efforts to understand its online manifestations, a systematic literature review is essential for capturing the current state and trajectory of this evolving field. In this review, we make four key contributions: (1) we synthesize the literature into five core themes: definitions of sexism and misogyny, disciplinary divergences, automated detection methods, associated challenges, and design-based interventions; (2) we adopt an interdisciplinary lens, bridging theoretical and methodological divides across disciplines; (3) we highlight critical gaps, including the need for intersectional approaches, the under-representation of non-Western languages and perspectives, and the limited focus on proactive design strategies beyond text classification; and (4) we offer a methodological contribution by applying a rigorous semi-automated systematic review process guided by PRISMA, establishing a replicable standard for future work in this domain. Our findings reveal a clear disciplinary divide in how sexism and misogyny are conceptualized and measured. Through an evidence-based synthesis, we examine how existing studies have attempted to bridge this gap through interdisciplinary collaboration. Drawing on both social science theories and computational modeling practices, we assess the strengths and limitations of current methodologies. Finally, we outline key challenges and future directions for advancing research on the detection and mitigation of online sexism and misogyny.
