Table of Contents
Fetching ...

Investigating Markers and Drivers of Gender Bias in Machine Translations

Peter J Barclay, Ashkan Sami

TL;DR

This study tackles gender bias in automatic translations by applying a back-translation approach across five genderless intermediate languages and introducing an adjusted variation metric (UCA) to quantify pronoun uncertainty without over-interpreting single pronouns. It demonstrates that pronoun variation clusters by language, identifies the main verb as a driver of gender uncertainty, and shows reproducibility across time despite an API shift in DeepL. The work extends previous single-language analyses, emphasizes multi-language generalizability, and provides a robust quantitative framework for studying bias in translation outputs. The findings offer a path toward more reliable bias diagnostics in translation systems and underscore the need for broader, cross-language analyses to inform remediation efforts.

Abstract

Implicit gender bias in Large Language Models (LLMs) is a well-documented problem, and implications of gender introduced into automatic translations can perpetuate real-world biases. However, some LLMs use heuristics or post-processing to mask such bias, making investigation difficult. Here, we examine bias in LLMss via back-translation, using the DeepL translation API to investigate the bias evinced when repeatedly translating a set of 56 Software Engineering tasks used in a previous study. Each statement starts with 'she', and is translated first into a 'genderless' intermediate language then back into English; we then examine pronoun-choice in the back-translated texts. We expand prior research in the following ways: (1) by comparing results across five intermediate languages, namely Finnish, Indonesian, Estonian, Turkish and Hungarian; (2) by proposing a novel metric for assessing the variation in gender implied in the repeated translations, avoiding the over-interpretation of individual pronouns, apparent in earlier work; (3) by investigating sentence features that drive bias; (4) and by comparing results from three time-lapsed datasets to establish the reproducibility of the approach. We found that some languages display similar patterns of pronoun use, falling into three loose groups, but that patterns vary between groups; this underlines the need to work with multiple languages. We also identify the main verb appearing in a sentence as a likely significant driver of implied gender in the translations. Moreover, we see a good level of replicability in the results, and establish that our variation metric proves robust despite an obvious change in the behaviour of the DeepL translation API during the course of the study. These results show that the back-translation method can provide further insights into bias in language models.

Investigating Markers and Drivers of Gender Bias in Machine Translations

TL;DR

This study tackles gender bias in automatic translations by applying a back-translation approach across five genderless intermediate languages and introducing an adjusted variation metric (UCA) to quantify pronoun uncertainty without over-interpreting single pronouns. It demonstrates that pronoun variation clusters by language, identifies the main verb as a driver of gender uncertainty, and shows reproducibility across time despite an API shift in DeepL. The work extends previous single-language analyses, emphasizes multi-language generalizability, and provides a robust quantitative framework for studying bias in translation outputs. The findings offer a path toward more reliable bias diagnostics in translation systems and underscore the need for broader, cross-language analyses to inform remediation efforts.

Abstract

Implicit gender bias in Large Language Models (LLMs) is a well-documented problem, and implications of gender introduced into automatic translations can perpetuate real-world biases. However, some LLMs use heuristics or post-processing to mask such bias, making investigation difficult. Here, we examine bias in LLMss via back-translation, using the DeepL translation API to investigate the bias evinced when repeatedly translating a set of 56 Software Engineering tasks used in a previous study. Each statement starts with 'she', and is translated first into a 'genderless' intermediate language then back into English; we then examine pronoun-choice in the back-translated texts. We expand prior research in the following ways: (1) by comparing results across five intermediate languages, namely Finnish, Indonesian, Estonian, Turkish and Hungarian; (2) by proposing a novel metric for assessing the variation in gender implied in the repeated translations, avoiding the over-interpretation of individual pronouns, apparent in earlier work; (3) by investigating sentence features that drive bias; (4) and by comparing results from three time-lapsed datasets to establish the reproducibility of the approach. We found that some languages display similar patterns of pronoun use, falling into three loose groups, but that patterns vary between groups; this underlines the need to work with multiple languages. We also identify the main verb appearing in a sentence as a likely significant driver of implied gender in the translations. Moreover, we see a good level of replicability in the results, and establish that our variation metric proves robust despite an obvious change in the behaviour of the DeepL translation API during the course of the study. These results show that the back-translation method can provide further insights into bias in language models.
Paper Structure (14 sections, 2 equations, 3 figures, 7 tables)

This paper contains 14 sections, 2 equations, 3 figures, 7 tables.

Figures (3)

  • Figure 1: Pronoun distribution across sentences for Estonian
  • Figure 2: UCA per verb, Estonian back-translation
  • Figure 3: UCA per sentence, FI0 versus FI datasets