On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

Jordan Vice; Naveed Akhtar; Leonid Sigal; Richard Hartley; Ajmal Mian

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

Jordan Vice, Naveed Akhtar, Leonid Sigal, Richard Hartley, Ajmal Mian

TL;DR

This work addresses reliability and fairness concerns in text-to-image generation by introducing a training-free, grey-box evaluation framework based on embedding perturbations. It defines global and local reliability ($\mathcal{R}_G$, $\mathcal{R}_L$) and introduces generative diversity ($\mathcal{D}_{\tilde{x}_T}$) and fairness ($\mathcal{F}_{\tilde{x}_T}$) metrics, along with a bias-provenance retrieval mechanism. The framework enables detection and tracing of intentional biases (backdoors/triggers) and supports bias provenance, using both rare and natural-language triggers. The approach is validated across benign and intentionally-biased models with open-source code, offering a practical tool for auditing public T2I systems and guiding safer deployment.

Abstract

The rapid proliferation of multimodal generative models has sparked critical discussions on their reliability, fairness and potential for misuse. While text-to-image models excel at producing high-fidelity, user-guided content, they often exhibit unpredictable behaviors and vulnerabilities that can be exploited to manipulate class or concept representations. To address this, we propose an evaluation framework to assess model reliability by analyzing responses to global and local perturbations in the embedding space, enabling the identification of inputs that trigger unreliable or biased behavior. Beyond social implications, fairness and diversity are fundamental to defining robust and trustworthy model behavior. Our approach offers deeper insights into these essential aspects by evaluating: (i) generative diversity, measuring the breadth of visual representations for learned concepts, and (ii) generative fairness, which examines the impact that removing concepts from input prompts has on control, under a low guidance setup. Beyond these evaluations, our method lays the groundwork for detecting unreliable, bias-injected models and tracing the provenance of embedded biases. Our code is publicly available at https://github.com/JJ-Vice/T2I_Fairness_Diversity_Reliability. Keywords: Fairness, Reliability, AI Ethics, Bias, Text-to-Image Models

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

TL;DR

Abstract

On the Fairness, Diversity and Reliability of Text-to-Image Generative Models

TL;DR

Abstract

Paper Structure

Table of Contents

Figures (12)