The Initial Screening Order Problem

Jose M. Alvarez; Antonio Mastropietro; Salvatore Ruggieri

The Initial Screening Order Problem

Jose M. Alvarez, Antonio Mastropietro, Salvatore Ruggieri

TL;DR

The paper addresses how the initial screening order (ISO) influences the selection of a size-$k$ candidate set under two search formulations, best-$k$ and good-$k$, and with a representational quota $q$. It formalizes a utility-based framework, introduces an algorithmic screener and a human-like screener with fatigue and position bias, and provides two sequential search procedures (ExaminationSearch and CascadeSearch) to study ISO effects via Monte Carlo simulations. The key finding is that ISO can degrade individual fairness for human-like screeners while preserving group fairness, and that partial-search dynamics under ISO can hinder optimality depending on score distributions and correlations between ISO and scores. The work also delivers a flexible simulation framework that can model multiple screening tasks and guide practitioners in mitigating ISO-induced biases before conducting real-world screening. Overall, the study highlights the importance of carefully designing the ISO and screening procedures to balance fairness and efficiency in candidate selection.

Abstract

We investigate the role of the initial screening order (ISO) in candidate screening. The ISO refers to the order in which the screener searches the candidate pool when selecting $k$ candidates. Today, it is common for the ISO to be the product of an information access system, such as an online platform or a database query. The ISO has been largely overlooked in the literature, despite its impact on the optimality and fairness of the selected $k$ candidates, especially under a human screener. We define two problem formulations describing the search behavior of the screener given an ISO: the best-$k$, where it selects the top $k$ candidates; and the good-$k$, where it selects the first good-enough $k$ candidates. To study the impact of the ISO, we introduce a human-like screener and compare it to its algorithmic counterpart, where the human-like screener is conceived to be inconsistent over time. Our analysis, in particular, shows that the ISO, under a human-like screener solving for the good-$k$ problem, hinders individual fairness despite meeting group fairness, and hampers the optimality of the selected $k$ candidates. This is due to position bias, where a candidate's evaluation is affected by its position within the ISO. We report extensive simulated experiments exploring the parameters of the best-$k$ and good-$k$ problems for both screeners. Our simulation framework is flexible enough to account for multiple candidate screening tasks, being an alternative to running real-world procedures.

The Initial Screening Order Problem

TL;DR

The paper addresses how the initial screening order (ISO) influences the selection of a size-

candidate set under two search formulations, best-

and good-

, and with a representational quota

. It formalizes a utility-based framework, introduces an algorithmic screener and a human-like screener with fatigue and position bias, and provides two sequential search procedures (ExaminationSearch and CascadeSearch) to study ISO effects via Monte Carlo simulations. The key finding is that ISO can degrade individual fairness for human-like screeners while preserving group fairness, and that partial-search dynamics under ISO can hinder optimality depending on score distributions and correlations between ISO and scores. The work also delivers a flexible simulation framework that can model multiple screening tasks and guide practitioners in mitigating ISO-induced biases before conducting real-world screening. Overall, the study highlights the importance of carefully designing the ISO and screening procedures to balance fairness and efficiency in candidate selection.

Abstract

We investigate the role of the initial screening order (ISO) in candidate screening. The ISO refers to the order in which the screener searches the candidate pool when selecting

candidates. Today, it is common for the ISO to be the product of an information access system, such as an online platform or a database query. The ISO has been largely overlooked in the literature, despite its impact on the optimality and fairness of the selected

candidates, especially under a human screener. We define two problem formulations describing the search behavior of the screener given an ISO: the best-

, where it selects the top

candidates; and the good-

, where it selects the first good-enough

candidates. To study the impact of the ISO, we introduce a human-like screener and compare it to its algorithmic counterpart, where the human-like screener is conceived to be inconsistent over time. Our analysis, in particular, shows that the ISO, under a human-like screener solving for the good-

problem, hinders individual fairness despite meeting group fairness, and hampers the optimality of the selected

candidates. This is due to position bias, where a candidate's evaluation is affected by its position within the ISO. We report extensive simulated experiments exploring the parameters of the best-

and good-

problems for both screeners. Our simulation framework is flexible enough to account for multiple candidate screening tasks, being an alternative to running real-world procedures.

Paper Structure (28 sections, 10 equations, 5 figures, 4 algorithms)

This paper contains 28 sections, 10 equations, 5 figures, 4 algorithms.

Introduction
Qualitative Background
Related Work
Searching the Pool of Candidates
Setting
Two Problem Formulations
Best-$k$.
Good-$k$
Two Search Procedures
The Human-Like Screener
Fatigue and Fatigued Scores
Position Bias Implications
An Experimental Framework
Setup
Generating the sample.
...and 13 more sections

Figures (5)

Figure 1: ExaminationSearch
Figure 2: Upper. Left: densities of candidate score distributions, with scores raging from 0 to 1. Center and Right: resp., RtB and JdS for all score distributions over $\psi$ under $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$. Lower. Left: $n, k$ combinations for $tN(0.5, 0.02)$ distribution over $\Psi$ under $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$. Center and Right: resp., RtB under different $\rho$'s (i.e., under $\theta \not\!\perp\!\!\!\perp s$) for $tN(0.5, 0.02)$ and $tN(1, 0.05)$ distributions over $\Psi$.
Figure 3: Upper. RtB for different candidate score distributions under $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$. Left: good-$k$ solution for fatigue with $\epsilon_1$ over $\psi$. Center: good-$k$ solution for fatigue with $\epsilon_2$ over $\psi$. Right: best-$k$ solution for fatigue with $\epsilon_1$ over $q$. Lower. RtB for different $\rho$'s (i.e., under $\theta \not\!\perp\!\!\!\perp s$) over $\psi$. Left: good-$k$ solution for fatigue with $\epsilon_1$ and $tN(0.5, 0.02)$ distribution. Center: good-$k$ solution for fatigue with $\epsilon_2$ and $tN(0.5, 0.02)$ distribution. Right: good-$k$ solution for fatigue with $\epsilon_1$ and $tN(1, 0.05)$ distribution.
Figure 4: HumanLikeExaminationSearch
Figure 5: Left: fraction of protected candidates in the solution of good-$k$ for different representational quotas $q$, for the $tN(0.5, 0.02)$ score distribution and with setting $n=400$, $k=20$, $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$. Center: RtB for different representational quotas $q$, for the $tN(0.5, 0.02)$ score distribution and with setting $n=400$, $k=20$, $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$. Right: RtB for different representational quotas $q$, for the $tN(1, 0.05)$ score distribution and with setting $n=400$, $k=20$, $\theta \mathrel{{\hbox{$\perp\mkern-10mu\perp$}}} s$.

Theorems & Definitions (2)

Remark 1
Remark 2

The Initial Screening Order Problem

TL;DR

Abstract

The Initial Screening Order Problem

Authors

TL;DR

Abstract

Table of Contents

Figures (5)

Theorems & Definitions (2)