What's in a Query: Polarity-Aware Distribution-Based Fair Ranking
Aparna Balagopalan, Kai Wang, Olawale Salaudeen, Asia Biega, Marzyeh Ghassemi
TL;DR
This work introduces DistFaiR, a divergence-based, distribution-aware framework for amortized fair ranking that accounts for the full distributions of cumulative attention and relevance across a sequence of queries, and crucially incorporates query polarity to prevent fairwashing. By defining both individual and group unfairness via divergences between attention and relevance distributions, and proving that group unfairness is upper-bounded by individual unfairness for a broad class of divergences, DistFaiR provides a principled path to improve fairness without sacrificing performance. The authors operationalize this through an ILP-based re-ranking that minimizes the worst-case divergence while maintaining DCG-based quality, and they validate the approach across synthetic and real datasets, showing that polarity-aware methods yield stronger fairness improvements and reveal fairness-utility tradeoffs. The work highlights the practical importance of incorporating query properties, demonstrates the potential for improved fairness at both individual and group levels, and discusses risks of fairwashing, offering a concrete, testable framework for polarity-aware fair ranking in safety-critical settings.
Abstract
Machine learning-driven rankings, where individuals (or items) are ranked in response to a query, mediate search exposure or attention in a variety of safety-critical settings. Thus, it is important to ensure that such rankings are fair. Under the goal of equal opportunity, attention allocated to an individual on a ranking interface should be proportional to their relevance across search queries. In this work, we examine amortized fair ranking -- where relevance and attention are cumulated over a sequence of user queries to make fair ranking more feasible in practice. Unlike prior methods that operate on expected amortized attention for each individual, we define new divergence-based measures for attention distribution-based fairness in ranking (DistFaiR), characterizing unfairness as the divergence between the distribution of attention and relevance corresponding to an individual over time. This allows us to propose new definitions of unfairness, which are more reliable at test time. Second, we prove that group fairness is upper-bounded by individual fairness under this definition for a useful class of divergence measures, and experimentally show that maximizing individual fairness through an integer linear programming-based optimization is often beneficial to group fairness. Lastly, we find that prior research in amortized fair ranking ignores critical information about queries, potentially leading to a fairwashing risk in practice by making rankings appear more fair than they actually are.
