More Women, Same Stereotypes: Unpacking the Gender Bias Paradox in Large Language Models

Evan Chen; Run-Jun Zhan; Yan-Bai Lin; Hung-Hsuan Chen

More Women, Same Stereotypes: Unpacking the Gender Bias Paradox in Large Language Models

Evan Chen, Run-Jun Zhan, Yan-Bai Lin, Hung-Hsuan Chen

TL;DR

This study introduces a novel evaluation framework to uncover gender biases in LLMs: using free-form storytelling to surface biases embedded within the models.

Abstract

Large Language Models (LLMs) have revolutionized natural language processing, yet concerns persist regarding their tendency to reflect or amplify social biases. This study introduces a novel evaluation framework to uncover gender biases in LLMs: using free-form storytelling to surface biases embedded within the models. A systematic analysis of ten prominent LLMs shows a consistent pattern of overrepresenting female characters across occupations, likely due to supervised fine-tuning (SFT) and reinforcement learning from human feedback (RLHF). Paradoxically, despite this overrepresentation, the occupational gender distributions produced by these LLMs align more closely with human stereotypes than with real-world labor data. This highlights the challenge and importance of implementing balanced mitigation measures to promote fairness and prevent the establishment of potentially new biases. We release the prompts and LLM-generated stories at GitHub.

More Women, Same Stereotypes: Unpacking the Gender Bias Paradox in Large Language Models

TL;DR

Abstract

More Women, Same Stereotypes: Unpacking the Gender Bias Paradox in Large Language Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (3)