Table of Contents
Fetching ...

Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations

Lee Cohen, Jack Hsieh, Connie Hong, Judy Hanwen Shen

TL;DR

This work analyzes fairness and accuracy in hiring under stochastic resume manipulations by large language models. It introduces a two-ticket scheme where the Hirer applies an additional LLM-based manipulation and evaluates both the original and manipulated resumes, providing theoretical guarantees under a No False Positives objective. The approach generalizes to an $n$-ticket scheme, showing convergence to a group-independent decision as $n \to \infty$, thereby mitigating disparities due to unequal LLM access. Empirical validation on real resumes demonstrates improvements in true positive rate and reductions in group disparities, supporting the method’s potential to alleviate fairness gaps in automated hiring while highlighting deployment considerations and limitations.

Abstract

In an era of increasingly capable foundation models, job seekers are turning to generative AI tools to enhance their application materials. However, unequal access to and knowledge about generative AI tools can harm both employers and candidates by reducing the accuracy of hiring decisions and giving some candidates an unfair advantage. To address these challenges, we introduce a new variant of the strategic classification framework tailored to manipulations performed using large language models, accommodating varying levels of manipulations and stochastic outcomes. We propose a ``two-ticket'' scheme, where the hiring algorithm applies an additional manipulation to each submitted resume and considers this manipulated version together with the original submitted resume. We establish theoretical guarantees for this scheme, showing improvements for both the fairness and accuracy of hiring decisions when the true positive rate is maximized subject to a no false positives constraint. We further generalize this approach to an $n$-ticket scheme and prove that hiring outcomes converge to a fixed, group-independent decision, eliminating disparities arising from differential LLM access. Finally, we empirically validate our framework and the performance of our two-ticket scheme on real resumes using an open-source resume screening tool.

Two Tickets are Better than One: Fair and Accurate Hiring Under Strategic LLM Manipulations

TL;DR

This work analyzes fairness and accuracy in hiring under stochastic resume manipulations by large language models. It introduces a two-ticket scheme where the Hirer applies an additional LLM-based manipulation and evaluates both the original and manipulated resumes, providing theoretical guarantees under a No False Positives objective. The approach generalizes to an -ticket scheme, showing convergence to a group-independent decision as , thereby mitigating disparities due to unequal LLM access. Empirical validation on real resumes demonstrates improvements in true positive rate and reductions in group disparities, supporting the method’s potential to alleviate fairness gaps in automated hiring while highlighting deployment considerations and limitations.

Abstract

In an era of increasingly capable foundation models, job seekers are turning to generative AI tools to enhance their application materials. However, unequal access to and knowledge about generative AI tools can harm both employers and candidates by reducing the accuracy of hiring decisions and giving some candidates an unfair advantage. To address these challenges, we introduce a new variant of the strategic classification framework tailored to manipulations performed using large language models, accommodating varying levels of manipulations and stochastic outcomes. We propose a ``two-ticket'' scheme, where the hiring algorithm applies an additional manipulation to each submitted resume and considers this manipulated version together with the original submitted resume. We establish theoretical guarantees for this scheme, showing improvements for both the fairness and accuracy of hiring decisions when the true positive rate is maximized subject to a no false positives constraint. We further generalize this approach to an -ticket scheme and prove that hiring outcomes converge to a fixed, group-independent decision, eliminating disparities arising from differential LLM access. Finally, we empirically validate our framework and the performance of our two-ticket scheme on real resumes using an open-source resume screening tool.

Paper Structure

This paper contains 31 sections, 18 theorems, 42 equations, 11 figures, 8 tables.

Key Result

Lemma 5.1

$Z_1$ stochastically dominates $Z_2$ if and only if for every non-decreasing function $u$,

Figures (11)

  • Figure 1: Resume score distribution of 50 qualified (matching occupation) and 50 unqualified (different occupation) resumes before and after LLM manipulations for two job descriptions. Models such as GPT-4o and Claude-3.5-Sonnet and DeepSeek-V3 generate a distribution of unqualified resumes that is indistinguishable from qualified resumes without manipulation for the Product Manager job and significantly enhance the scores of the qualified resumes for the UX Designer position.
  • Figure 2: (a) Applying LLM manipulations to resumes result in stochastic outcomes: even when the average score increases (e.g., DeepSeek-V3, some resumes receive lower scores after manipulation). (b) Repeatedly using LLMs to enhance a resume results in stagnating improvements.
  • Figure 3: Visualization of an LLM Manipulation$L$ over $\mathcal{X} = \mathbb{R}^2$ with one style feature ($x_1$) and one fundamental feature ($x_2$). Modifying $x_1$ may move the candidate into the acceptance region (in blue).
  • Figure 4: Example of Privileged-Unprivileged Assignment in the Training Set Resulting in Differing Thresholds Between the Traditional and Two-Ticket Schemes.
  • Figure 5: Resume score distribution of 50 qualified (matching occupation) and 50 unqualified (different occupation) resumes before and after LLM manipulations for more Product Manager Job Descriptions
  • ...and 6 more figures

Theorems & Definitions (45)

  • Definition 4.1: Mathematical formulation of Strategic LLM Manipulation
  • Remark
  • Definition 4.2: No False Positives
  • Definition 4.3: Traditional Hiring Scheme under LLM Manipulation
  • Remark
  • Definition 5.1
  • Definition 5.2: levhari1975efficiency
  • Lemma 5.1: levhari1975efficiency
  • Definition 5.3
  • Remark
  • ...and 35 more