Theories of "Sexuality" in Natural Language Processing Bias Research
Jacob Hobbs
TL;DR
The paper investigates how sexuality is theorized and operationalized in NLP bias research, revealing a widespread lack of explicit theoretical grounding and a tendency to equate sexuality with gender or to default to heterosexual norms. By surveying 55 papers across ACM and ACL venues, it shows heavy reliance on identity word lists, limited use of intersectionality, and prevalent heteronormative biases in methodology. The authors synthesize queer/feminist theory to critique current practices and provide actionable recommendations—explicit theory, intersectional analyses, power-dynamics-aware datasets, and engagement with queer methodologies—to improve validity and inclusivity in sexuality-bias analyses. This work advances sociotechnical critique in NLP and offers a framework to align bias research with lived queer experiences and interdisciplinary scholarship.
Abstract
In recent years, significant advancements in the field of Natural Language Processing (NLP) have positioned commercialized language models as wide-reaching, highly useful tools. In tandem, there has been an explosion of multidisciplinary research examining how NLP tasks reflect, perpetuate, and amplify social biases such as gender and racial bias. A significant gap in this scholarship is a detailed analysis of how queer sexualities are encoded and (mis)represented by both NLP systems and practitioners. Following previous work in the field of AI fairness, we document how sexuality is defined and operationalized via a survey and analysis of 55 articles that quantify sexuality-based NLP bias. We find that sexuality is not clearly defined in a majority of the literature surveyed, indicating a reliance on assumed or normative conceptions of sexual/romantic practices and identities. Further, we find that methods for extracting biased outputs from NLP technologies often conflate gender and sexual identities, leading to monolithic conceptions of queerness and thus improper quantifications of bias. With the goal of improving sexuality-based NLP bias analyses, we conclude with recommendations that encourage more thorough engagement with both queer communities and interdisciplinary literature.
