Fair Text-to-Image Diffusion via Fair Mapping
Jia Li, Lijie Hu, Jingfeng Zhang, Tianhang Zheng, Hua Zhang, Di Wang
TL;DR
This work addresses demographically biased outputs in text-to-image diffusion models prompted by human-related descriptions. It introduces Fair Mapping, a lightweight, model-agnostic debiasing module that adds a linear mapping network after the text encoder and an inference-time detector to map conditioning embeddings into a debiased space, trained with $L_{text}$ and $L_{fair}$. Empirical results on face-generation tasks show reduced diffusion and language bias with minimal loss in image quality, along with improved alignment to human-related prompts and efficient training (e.g., an eight-layer mapping requiring modest compute). The approach is practical for real-world deployment due to its model-agnostic nature, low parameter overhead, and scalable fairness improvements.
Abstract
In this paper, we address the limitations of existing text-to-image diffusion models in generating demographically fair results when given human-related descriptions. These models often struggle to disentangle the target language context from sociocultural biases, resulting in biased image generation. To overcome this challenge, we propose Fair Mapping, a flexible, model-agnostic, and lightweight approach that modifies a pre-trained text-to-image diffusion model by controlling the prompt to achieve fair image generation. One key advantage of our approach is its high efficiency. It only requires updating an additional linear network with few parameters at a low computational cost. By developing a linear network that maps conditioning embeddings into a debiased space, we enable the generation of relatively balanced demographic results based on the specified text condition. With comprehensive experiments on face image generation, we show that our method significantly improves image generation fairness with almost the same image quality compared to conventional diffusion models when prompted with descriptions related to humans. By effectively addressing the issue of implicit language bias, our method produces more fair and diverse image outputs.
