Finding a Fair Scoring Function for Top-$k$ Selection: From Hardness to Practice

Guangya Cai

Finding a Fair Scoring Function for Top-$k$ Selection: From Hardness to Practice

Guangya Cai

TL;DR

This work studies fair top-$k$ selection under a linear scoring model, proving conditional hardness results that hinder universal scalable solutions in higher dimensions and motivating a two-pronged algorithmic strategy. For small $k$, a $k$-level-based method leverages duality and computational geometry to achieve practical speedups, while for large $k$, a MILP-based approach offers robust performance despite worst-case NP-hardness. Experimental evaluations on real datasets (e.g., COMPAS and IIT-JEE) show orders-of-magnitude improvements over state-of-the-art baselines and provide guidance on algorithm selection based on dimensionality and $k$. The study integrates hardness analysis, algorithm design, engineering optimization, and empirical evaluation to deliver a practically efficient solution with broad implications for fairness-aware decision-making systems.

Abstract

Selecting a subset of the $k$ "best" items from a dataset of $n$ items, based on a scoring function, is a key task in decision-making. Given the rise of automated decision-making software, it is important that the outcome of this process, called top-$k$ selection, is fair. Here we consider the problem of identifying a fair linear scoring function for top-$k$ selection. The function computes a score for each item as a weighted sum of its (numerical) attribute values, and must ensure that the selected subset includes adequate representation of a minority or historically disadvantaged group. Existing algorithms do not scale efficiently, particularly in higher dimensions. Our hardness analysis shows that in more than two dimensions, no algorithm is likely to achieve good scalability with respect to dataset size, and the computational complexity is likely to increase rapidly with dimensionality. However, the hardness results also provide key insights guiding algorithm design, leading to our two-pronged solution: (1) For small values of $k$, our hardness analysis reveals a gap in the hardness barrier. By addressing various engineering challenges, including achieving efficient parallelism, we turn this potential of efficiency into an optimized algorithm delivering substantial practical performance gains. (2) For large values of $k$, where the hardness is robust, we employ a practically efficient algorithm which, despite being theoretically worse, achieves superior real-world performance. Experimental evaluations on real-world datasets then explore scenarios where worst-case behavior does not manifest, identifying areas critical to practical performance. Our solution achieves speed-ups of up to several orders of magnitude compared to SOTA, an efficiency made possible through a tight integration of hardness analysis, algorithm design, practical engineering, and empirical evaluation.

Finding a Fair Scoring Function for Top-$k$ Selection: From Hardness to Practice

TL;DR

This work studies fair top-

selection under a linear scoring model, proving conditional hardness results that hinder universal scalable solutions in higher dimensions and motivating a two-pronged algorithmic strategy. For small

, a

-level-based method leverages duality and computational geometry to achieve practical speedups, while for large

, a MILP-based approach offers robust performance despite worst-case NP-hardness. Experimental evaluations on real datasets (e.g., COMPAS and IIT-JEE) show orders-of-magnitude improvements over state-of-the-art baselines and provide guidance on algorithm selection based on dimensionality and

. The study integrates hardness analysis, algorithm design, engineering optimization, and empirical evaluation to deliver a practically efficient solution with broad implications for fairness-aware decision-making systems.

Abstract

Selecting a subset of the

"best" items from a dataset of

items, based on a scoring function, is a key task in decision-making. Given the rise of automated decision-making software, it is important that the outcome of this process, called top-

selection, is fair. Here we consider the problem of identifying a fair linear scoring function for top-

selection. The function computes a score for each item as a weighted sum of its (numerical) attribute values, and must ensure that the selected subset includes adequate representation of a minority or historically disadvantaged group. Existing algorithms do not scale efficiently, particularly in higher dimensions. Our hardness analysis shows that in more than two dimensions, no algorithm is likely to achieve good scalability with respect to dataset size, and the computational complexity is likely to increase rapidly with dimensionality. However, the hardness results also provide key insights guiding algorithm design, leading to our two-pronged solution: (1) For small values of

, our hardness analysis reveals a gap in the hardness barrier. By addressing various engineering challenges, including achieving efficient parallelism, we turn this potential of efficiency into an optimized algorithm delivering substantial practical performance gains. (2) For large values of

, where the hardness is robust, we employ a practically efficient algorithm which, despite being theoretically worse, achieves superior real-world performance. Experimental evaluations on real-world datasets then explore scenarios where worst-case behavior does not manifest, identifying areas critical to practical performance. Our solution achieves speed-ups of up to several orders of magnitude compared to SOTA, an efficiency made possible through a tight integration of hardness analysis, algorithm design, practical engineering, and empirical evaluation.

Finding a Fair Scoring Function for Top-$k$ Selection: From Hardness to Practice

TL;DR

Abstract

Finding a Fair Scoring Function for Top-$k$ Selection: From Hardness to Practice

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (8)

Theorems & Definitions (16)