Quota-based debiasing can decrease representation of already underrepresented groups
Ivan Smirnov, Florian Lemmerich, Markus Strohmaier
TL;DR
The paper addresses the Debiasing Paradox: quota-based debiasing on a single binary attribute can worsen representation for already underrepresented subgroups when other correlated attributes are ignored. It introduces a theoretical model with two correlated binary attributes and perceived quality $\hat{q} = q - d_{color} I^{color} - d_{shape} I^{shape}$, showing a condition $d_{shape} > -d_{color} /(1 - 2f)$ under which debiasing on one attribute harms the most disadvantaged group. The authors validate the phenomenon across four real-world domains—education, wages, scientific citations, and recidivism—finding that quotas often decrease representation for certain subgroups and can reduce overall ranking fairness. They advocate for addressing root causes of inequality rather than relying on numeric quota solutions, and provide publicly available code to reproduce the results.
Abstract
Many important decisions in societies such as school admissions, hiring, or elections are based on the selection of top-ranking individuals from a larger pool of candidates. This process is often subject to biases, which typically manifest as an under-representation of certain groups among the selected or accepted individuals. The most common approach to this issue is debiasing, for example via the introduction of quotas that ensure proportional representation of groups with respect to a certain, often binary attribute. Cases include quotas for women on corporate boards or ethnic quotas in elections. This, however, has the potential to induce changes in representation with respect to other attributes. For the case of two correlated binary attributes we show that quota-based debiasing based on a single attribute can worsen the representation of already underrepresented groups and decrease overall fairness of selection. We use several data sets from a broad range of domains from recidivism risk assessments to scientific citations to assess this effect in real-world settings. Our results demonstrate the importance of including all relevant attributes in debiasing procedures and that more efforts need to be put into eliminating the root causes of inequalities as purely numerical solutions such as quota-based debiasing might lead to unintended consequences.
