Who's Asking? Investigating Bias Through the Lens of Disability Framed Queries in LLMs
Vishnu Hari, Kalpana Panda, Srikant Panda, Amit Agarwal, Hitesh Laxmichand Patel
TL;DR
The paper addresses the risk that LLMs infer user demographics from prompt phrasing, with disability cues potentially biasing those inferences. It conducts a comprehensive audit across eight instruction-tuned LLMs (3B–72B) using the AccessEval framework, nine disability categories, six business domains, and five demographic attributes under neutral and disability-aware prompts. Key findings show models infer demographics in up to 97% of cases, disability contexts shift attribute distributions, and domain context can amplify biases, with larger models sometimes displaying greater sensitivity to disability cues. The study highlights persistent intersections between ableism and other stereotypes, proposes abstention calibration and counterfactual fine-tuning as remedies, and contributes an evaluation framework and data to encourage disability-inclusive benchmarking. This work underscores the need for robust fairness mechanisms in LLM alignment and has practical implications for privacy, accessibility, and equitable AI deployment.
Abstract
Large Language Models (LLMs) routinely infer users demographic traits from phrasing alone, which can result in biased responses, even when no explicit demographic information is provided. The role of disability cues in shaping these inferences remains largely uncharted. Thus, we present the first systematic audit of disability-conditioned demographic bias across eight state-of-the-art instruction-tuned LLMs ranging from 3B to 72B parameters. Using a balanced template corpus that pairs nine disability categories with six real-world business domains, we prompt each model to predict five demographic attributes - gender, socioeconomic status, education, cultural background, and locality - under both neutral and disability-aware conditions. Across a varied set of prompts, models deliver a definitive demographic guess in up to 97\% of cases, exposing a strong tendency to make arbitrary inferences with no clear justification. Disability context heavily shifts predicted attribute distributions, and domain context can further amplify these deviations. We observe that larger models are simultaneously more sensitive to disability cues and more prone to biased reasoning, indicating that scale alone does not mitigate stereotype amplification. Our findings reveal persistent intersections between ableism and other demographic stereotypes, pinpointing critical blind spots in current alignment strategies. We release our evaluation framework and results to encourage disability-inclusive benchmarking and recommend integrating abstention calibration and counterfactual fine-tuning to curb unwarranted demographic inference. Code and data will be released on acceptance.
