Sociocultural Considerations in Monitoring Anti-LGBTQ+ Content on Social Media
Sidney G. -J. Wong
TL;DR
This paper investigates how sociocultural factors shape hate-speech detection systems for anti-LGBTQ+ content on social media. It compares two open-source datasets (mlma and ltedi) and trains transformer-based detectors, applying them to georeferenced samples across national-English varieties to examine cross-cultural performance. The findings show that training-data sociolinguistic alignment biases model outputs and that keyword-based data collection can overfit on slurs, potentially missing non-slur anti-LGBTQ+ content; domain adaptation and qualitative interpretation are necessary. The study highlights the need to integrate sociolinguistic context in both data collection and model evaluation to produce fit-for-purpose monitoring tools and informs future work toward more inclusive, context-aware hate-speech detection.
Abstract
The purpose of this paper is to ascertain the influence of sociocultural factors (i.e., social, cultural, and political) in the development of hate speech detection systems. We set out to investigate the suitability of using open-source training data to monitor levels of anti-LGBTQ+ content on social media across different national-varieties of English. Our findings suggests the social and cultural alignment of open-source hate speech data sets influences the predicted outputs. Furthermore, the keyword-search approach of anti-LGBTQ+ slurs in the development of open-source training data encourages detection models to overfit on slurs; therefore, anti-LGBTQ+ content may go undetected. We recommend combining empirical outputs with qualitative insights to ensure these systems are fit for purpose.
