Towards Effective Discrimination Testing for Generative AI
Thomas P. Zollo, Nikita Rajaneesh, Richard Zemel, Talia B. Gillis, Emily Black
TL;DR
GenAI fairness research currently lacks the specificity and context-sensitivity required by anti-discrimination regulation, risking deployment of systems that appear fair but cause discriminatory outcomes in practice. The paper synthesizes legal and technical perspectives and presents four case studies showing misalignment between common GenAI fairness tests and regulatory objectives, including downstream harms, red-teaming variability, complex interaction modes, and user-driven modifications. It argues for context-specific, robust testing frameworks that mirror deployment conditions and consider downstream allocation effects, multi-turn interactions, and parameter changes. The work provides practical mitigation directions—such as domain-tailored evaluation suites, multi-method red-teaming, and parameter-risk monitoring—to improve the reliability of fairness assessments in real-world GenAI deployments and to better inform policy. Overall, it highlights the need for technically grounded, regulatable testing protocols that reduce discrimination hazards in GenAI systems while supporting accountability and liability considerations.
Abstract
Generative AI (GenAI) models present new challenges in regulating against discriminatory behavior. In this paper, we argue that GenAI fairness research still has not met these challenges; instead, a significant gap remains between existing bias assessment methods and regulatory goals. This leads to ineffective regulation that can allow deployment of reportedly fair, yet actually discriminatory, GenAI systems. Towards remedying this problem, we connect the legal and technical literature around GenAI bias evaluation and identify areas of misalignment. Through four case studies, we demonstrate how this misalignment between fairness testing techniques and regulatory goals can result in discriminatory outcomes in real-world deployments, especially in adaptive or complex environments. We offer practical recommendations for improving discrimination testing to better align with regulatory goals and enhance the reliability of fairness assessments in future deployments.
