Table of Contents
Fetching ...

ABLEIST: Intersectional Disability Bias in LLM-Generated Hiring Scenarios

Mahika Phutane, Hayoung Jung, Matthew Kim, Tanushree Mitra, Aditya Vashistha

TL;DR

This study conducts a global-south–oriented audit of six LLMs across 2,820 hiring conversations to reveal pervasive ableist and intersectional biases toward disabled candidates. It introduces ABLEist, a set of eight metrics (five ableism-specific and three intersectional) grounded in disability studies to detect covert harms, demonstrating substantial harms and complex amplification when gender and caste intersect with disability. Notably, standard toxicity detectors fail to flag these harms, prompting the authors to develop a reusable open-weight ABLEist detector via distillation of a strong teacher model. The work underscores the need for intersectional safety evaluations in high-stakes AI applications like hiring and provides both methodological tools and empirical evidence to push safety research beyond single-axis analyses. Collectively, the findings argue for integrating ABLEist assessments into AI governance to prevent deepening socio-economic disparities in the Global South and beyond.

Abstract

Large language models (LLMs) are increasingly under scrutiny for perpetuating identity-based discrimination in high-stakes domains such as hiring, particularly against people with disabilities (PwD). However, existing research remains largely Western-centric, overlooking how intersecting forms of marginalization--such as gender and caste--shape experiences of PwD in the Global South. We conduct a comprehensive audit of six LLMs across 2,820 hiring scenarios spanning diverse disability, gender, nationality, and caste profiles. To capture subtle intersectional harms and biases, we introduce ABLEIST (Ableism, Inspiration, Superhumanization, and Tokenism), a set of five ableism-specific and three intersectional harm metrics grounded in disability studies literature. Our results reveal significant increases in ABLEIST harms towards disabled candidates--harms that many state-of-the-art models failed to detect. These harms were further amplified by sharp increases in intersectional harms (e.g., Tokenism) for gender and caste-marginalized disabled candidates, highlighting critical blind spots in current safety tools and the need for intersectional safety evaluations of frontier models in high-stakes domains like hiring.

ABLEIST: Intersectional Disability Bias in LLM-Generated Hiring Scenarios

TL;DR

This study conducts a global-south–oriented audit of six LLMs across 2,820 hiring conversations to reveal pervasive ableist and intersectional biases toward disabled candidates. It introduces ABLEist, a set of eight metrics (five ableism-specific and three intersectional) grounded in disability studies to detect covert harms, demonstrating substantial harms and complex amplification when gender and caste intersect with disability. Notably, standard toxicity detectors fail to flag these harms, prompting the authors to develop a reusable open-weight ABLEist detector via distillation of a strong teacher model. The work underscores the need for intersectional safety evaluations in high-stakes AI applications like hiring and provides both methodological tools and empirical evidence to push safety research beyond single-axis analyses. Collectively, the findings argue for integrating ABLEist assessments into AI governance to prevent deepening socio-economic disparities in the Global South and beyond.

Abstract

Large language models (LLMs) are increasingly under scrutiny for perpetuating identity-based discrimination in high-stakes domains such as hiring, particularly against people with disabilities (PwD). However, existing research remains largely Western-centric, overlooking how intersecting forms of marginalization--such as gender and caste--shape experiences of PwD in the Global South. We conduct a comprehensive audit of six LLMs across 2,820 hiring scenarios spanning diverse disability, gender, nationality, and caste profiles. To capture subtle intersectional harms and biases, we introduce ABLEIST (Ableism, Inspiration, Superhumanization, and Tokenism), a set of five ableism-specific and three intersectional harm metrics grounded in disability studies literature. Our results reveal significant increases in ABLEIST harms towards disabled candidates--harms that many state-of-the-art models failed to detect. These harms were further amplified by sharp increases in intersectional harms (e.g., Tokenism) for gender and caste-marginalized disabled candidates, highlighting critical blind spots in current safety tools and the need for intersectional safety evaluations of frontier models in high-stakes domains like hiring.

Paper Structure

This paper contains 34 sections, 11 figures, 16 tables.

Figures (11)

  • Figure 1: We create many candidate profiles, spanning single-attribute identities (disability) to multi-attribute (disability, gender, and caste), generating 2,820 hiring conversations across 6 LLMs. We annotate the generated conversations across the ABLEist metrics. Our findings point to compounding intersectional harms for gender and caste-marginalized disabled candidates.
  • Figure 2: Heatmaps of ABLEist metrics scores by LLMs. A score of 0.5 indicates that 50% of all conversations generated by this LLM contained this harm. Scores are significantly greater when disability is specified.
  • Figure 3: Categorical Scatter Plot for Mean Intersectional Harm Metric scores (Inspiration Porn, Tokenism, Superhumanization) across baseline, disability, and intersectional identity groups. Gender & Caste Marginalized PwD groups (red) surface more harm than Gender & Caste Dominant groups (blue). Table \ref{['tab:differences_ableists_scores']} contains significance test results across these identity groups for each ABLEist metric.
  • Figure 4: Left: Proportion of conversations flagged for each metric by our ABLEist method (red) and Chast model (blue). Right: Boxplots of harm scores assigned by industry and popular baseline models: Perspective API, Azure AI Content Safety (Azure Safety), Detoxify, OpenAI Omni-Moderation (OpenAI Moderation). Flagging threshold=0.3 (red, horizontal line). None produce scores past threshold.
  • Figure 5: Mean ABLEist scores across disabilities. A Kruskal Wallis test computed initial significance across groups, and a post-hoc Dunn's test revealed significant pairwise differences, reported as *p < 0.05, ***p < 0.001. Large differences are visible for Superhumanization, Anticipated Ableism, and One-Size-Fits-All between Autism and Cerebral Palsy. Notably, blind candidate profiles generated the most Technoableism.
  • ...and 6 more figures