FairGen: Controlling Sensitive Attributes for Fair Generations in Diffusion Models via Adaptive Latent Guidance
Mintong Kang, Vinayshekhar Bannihatti Kumar, Shamik Roy, Abhishek Kumar, Sopan Khosla, Balakrishnan Murali Narayanaswamy, Rashmi Gangadharaiah
TL;DR
FairGen tackles bias in text-to-image diffusion by enabling inference-time control of attribute distributions through adaptive latent guidance and a memory-driven indicator. An attribute-aware generator trained with SFT and DPO produces guidance prompts, while a memory module steers per-prompt attribute choices to align outputs with a target distribution. The Holistic Bias Evaluation Benchmark (HBE) evaluates bias across diverse domains and prompt complexities, revealing robustness and generalization gaps in prior methods. Empirical results show FairGen achieves substantial bias reduction with preserved image quality across multiple diffusion models, and ablations demonstrate flexible control over output distributions and robustness to prompt and step variations. This work offers a practical, training-free approach for distribution-aware, fair generation in diffusion-based systems and introduces a comprehensive benchmark to advance evaluation in this area.
Abstract
Text-to-image diffusion models often exhibit biases toward specific demographic groups, such as generating more males than females when prompted to generate images of engineers, raising ethical concerns and limiting their adoption. In this paper, we tackle the challenge of mitigating generation bias towards any target attribute value (e.g., "male" for "gender") in diffusion models while preserving generation quality. We propose FairGen, an adaptive latent guidance mechanism which controls the generation distribution during inference. In FairGen, a latent guidance module dynamically adjusts the diffusion process to enforce specific attributes, while a memory module tracks the generation statistics and steers latent guidance to align with the targeted fair distribution of the attribute values. Furthermore, we address the limitations of existing datasets by introducing the Holistic Bias Evaluation (HBE) benchmark, which covers diverse domains and incorporates complex prompts to assess bias more comprehensively. Extensive evaluations on HBE and Stable Bias datasets demonstrate that FairGen outperforms existing bias mitigation approaches, achieving substantial bias reduction (e.g., 68.5% gender bias reduction on Stable Diffusion 2). Ablation studies highlight FairGen's ability to flexibly control the output distribution at any user-specified granularity, ensuring adaptive and targeted bias mitigation.
