Towards Region-aware Bias Evaluation Metrics
Angana Borah, Aparna Garimella, Rada Mihalcea
TL;DR
This paper addresses regional variation in gender bias as expressed by language models, arguing that fixed, universal bias dimensions fail to capture culturally specific stereotypes. It introduces a bottom-up, region-aware framework that first identifies region-specific F/M topics via topic modeling, then forms region-biased topic pairs through embedding similarities, and finally validates these dimensions with IAT-style human studies and region-aware WEAT evaluations. The authors demonstrate region-dependent bias dimensions across GeoWAC-derived topics and show that LLM outputs align with region-aware biases in highly represented regions but misalign for underrepresented regions, highlighting the need for region-sensitive bias metrics. The work advances bias evaluation in NLP by coupling data-driven topic discovery with human validation and cross-model analyses, informing more accurate region-aware assessment and potential mitigation strategies for multilingual and culturally diverse contexts.
Abstract
When exposed to human-generated data, language models are known to learn and amplify societal biases. While previous works introduced benchmarks that can be used to assess the bias in these models, they rely on assumptions that may not be universally true. For instance, a gender bias dimension commonly used by these metrics is that of family--career, but this may not be the only common bias in certain regions of the world. In this paper, we identify topical differences in gender bias across different regions and propose a region-aware bottom-up approach for bias assessment. Our proposed approach uses gender-aligned topics for a given region and identifies gender bias dimensions in the form of topic pairs that are likely to capture gender societal biases. Several of our proposed bias topic pairs are on par with human perception of gender biases in these regions in comparison to the existing ones, and we also identify new pairs that are more aligned than the existing ones. In addition, we use our region-aware bias topic pairs in a Word Embedding Association Test (WEAT)-based evaluation metric to test for gender biases across different regions in different data domains. We also find that LLMs have a higher alignment to bias pairs for highly-represented regions showing the importance of region-aware bias evaluation metric.
