Table of Contents
Fetching ...

Stable Diffusion Exposed: Gender Bias from Prompt to Image

Yankun Wu, Yuta Nakashima, Noa Garcia

TL;DR

This work tackles gender bias in text-to-image generation by introducing an automated, multi-stage evaluation protocol for Stable Diffusion. It leverages free-form triplet prompts (neutral, feminine, masculine) to trace bias from prompt space through denoising and image spaces, revealing that neutral prompts tend to resemble masculine prompts and that objects not explicitly mentioned in prompts exhibit gendered generation patterns. The authors identify the bias source in text embeddings and propagate it via the generation pipeline, analyze object-level and dependency-based disparities using advanced grounding and attention techniques, and provide actionable recommendations for developers and users to mitigate bias. The protocol enables systematic, automatic bias analysis across prompts, representations, and generated images, contributing a practical framework for reducing gender disparities in diffusion-based image synthesis. Overall, the study demonstrates robust, cross-model, cross-dataset evidence of gender-influenced object generation and layouts, and offers concrete debiasing strategies and user guidance to improve fairness in practice.

Abstract

Several studies have raised awareness about social biases in image generative models, demonstrating their predisposition towards stereotypes and imbalances. This paper contributes to this growing body of research by introducing an evaluation protocol that analyzes the impact of gender indicators at every step of the generation process on Stable Diffusion images. Leveraging insights from prior work, we explore how gender indicators not only affect gender presentation but also the representation of objects and layouts within the generated images. Our findings include the existence of differences in the depiction of objects, such as instruments tailored for specific genders, and shifts in overall layouts. We also reveal that neutral prompts tend to produce images more aligned with masculine prompts than their feminine counterparts. We further explore where bias originates through representational disparities and how it manifests in the images via prompt-image dependencies, and provide recommendations for developers and users to mitigate potential bias in image generation.

Stable Diffusion Exposed: Gender Bias from Prompt to Image

TL;DR

This work tackles gender bias in text-to-image generation by introducing an automated, multi-stage evaluation protocol for Stable Diffusion. It leverages free-form triplet prompts (neutral, feminine, masculine) to trace bias from prompt space through denoising and image spaces, revealing that neutral prompts tend to resemble masculine prompts and that objects not explicitly mentioned in prompts exhibit gendered generation patterns. The authors identify the bias source in text embeddings and propagate it via the generation pipeline, analyze object-level and dependency-based disparities using advanced grounding and attention techniques, and provide actionable recommendations for developers and users to mitigate bias. The protocol enables systematic, automatic bias analysis across prompts, representations, and generated images, contributing a practical framework for reducing gender disparities in diffusion-based image synthesis. Overall, the study demonstrates robust, cross-model, cross-dataset evidence of gender-influenced object generation and layouts, and offers concrete debiasing strategies and user guidance to improve fairness in practice.

Abstract

Several studies have raised awareness about social biases in image generative models, demonstrating their predisposition towards stereotypes and imbalances. This paper contributes to this growing body of research by introducing an evaluation protocol that analyzes the impact of gender indicators at every step of the generation process on Stable Diffusion images. Leveraging insights from prior work, we explore how gender indicators not only affect gender presentation but also the representation of objects and layouts within the generated images. Our findings include the existence of differences in the depiction of objects, such as instruments tailored for specific genders, and shifts in overall layouts. We also reveal that neutral prompts tend to produce images more aligned with masculine prompts than their feminine counterparts. We further explore where bias originates through representational disparities and how it manifests in the images via prompt-image dependencies, and provide recommendations for developers and users to mitigate potential bias in image generation.
Paper Structure (72 sections, 7 equations, 12 figures, 14 tables)

This paper contains 72 sections, 7 equations, 12 figures, 14 tables.

Figures (12)

  • Figure 1: We use free-form triplet prompts to analyze the influence of gender indicators on the overall image generation process. We show that 1) gender indicators influence the generation of objects (left) and their layouts (right), and 2) the use of gender neutral words tends to produce images more similar to those prompted by masculine indicators rather than feminine ones.
  • Figure 2: Bias score in Flickr30k. The higher values (in blue) suggest an object is biased toward masculine prompts, while lower values (in orange) indicate a preference toward feminine prompts. $\text{BS}(o)= 0.5$ (green line) shows the object does not skew toward a certain gender. We filter objects if the maximum co-occurrence is less than $20$.
  • Figure 3: Prompt-image dependency groups.
  • Figure 4: Top-$10$ most frequent objects in each prompt-image dependency group in GCC on SD v2.0.
  • Figure 5: Bias score by groups on SD v1.4 and SD v2.0. Top: implicitly guided group in TextCaps on SD v1.4 and SD v2.0. Bottom: implicitly independent group in GCC on SD v1.4 and SD v2.0.
  • ...and 7 more figures

Theorems & Definitions (2)

  • Definition 6.1: Explicitly
  • Definition 6.2: Guided