A Multilingual Perspective on Probing Gender Bias
Karolina Stańczak
TL;DR
This work tackles gender bias in multilingual NLP by integrating gender theories, linguistic phenomena, and probing techniques across languages. It introduces dataset collections (social media, historical newspapers, ambassadors on Twitter) and novel probing methods (latent-variable intrinsic probes; template-based and identity-diverse datasets) to examine linguistic and societal biases encoded in language and language models. The studies reveal multilingual and cross-linguistic patterns, including causal analyses challenging the notion that grammatical gender strongly drives adjective choice, and highlight that bias manifests in nuanced ways across visibility, language, and sentiment rather than solely through negativity. Collectively, the contributions advance multidisciplinary bias analysis, provide benchmarks for multilingual bias detection, and offer methodological guidance for probing both natural language and language models in diverse linguistic contexts.
Abstract
Gender bias represents a form of systematic negative treatment that targets individuals based on their gender. This discrimination can range from subtle sexist remarks and gendered stereotypes to outright hate speech. Prior research has revealed that ignoring online abuse not only affects the individuals targeted but also has broader societal implications. These consequences extend to the discouragement of women's engagement and visibility within public spheres, thereby reinforcing gender inequality. This thesis investigates the nuances of how gender bias is expressed through language and within language technologies. Significantly, this thesis expands research on gender bias to multilingual contexts, emphasising the importance of a multilingual and multicultural perspective in understanding societal biases. In this thesis, I adopt an interdisciplinary approach, bridging natural language processing with other disciplines such as political science and history, to probe gender bias in natural language and language models.
