Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making

Alex Chohlas-Wood; Madison Coots; Henry Zhu; Emma Brunskill; Sharad Goel

Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making

Alex Chohlas-Wood, Madison Coots, Henry Zhu, Emma Brunskill, Sharad Goel

TL;DR

The paper critiques axiomatic fairness in predictive decision-making and proposes a consequentialist fairness framework that foregrounds downstream outcomes. It develops a policy-optimization approach that elicits stakeholder preferences over outcomes and budgets, then computes utility-maximizing policies by solving a linear program, with extensions to online learning via contextual bandits under budgets. The authors provide sample-complexity bounds for learning under tabular and linear reward models and demonstrate an adaptive learning strategy (epsilon-greedy, Thompson sampling, UCB) through simulations inspired by a rideshare-to-court program, showing improved utility and reduced spending disparities. This work offers a principled, data-driven method to balance efficiency and equity in resource-constrained settings and demonstrates practical tools for policymakers to operationalize context-sensitive equity.

Abstract

In an attempt to make algorithms fair, the machine learning literature has largely focused on equalizing decisions, outcomes, or error rates across race or gender groups. To illustrate, consider a hypothetical government rideshare program that provides transportation assistance to low-income people with upcoming court dates. Following this literature, one might allocate rides to those with the highest estimated treatment effect per dollar, while constraining spending to be equal across race groups. That approach, however, ignores the downstream consequences of such constraints, and, as a result, can induce unexpected harms. For instance, if one demographic group lives farther from court, enforcing equal spending would necessarily mean fewer total rides provided, and potentially more people penalized for missing court. Here we present an alternative framework for designing equitable algorithms that foregrounds the consequences of decisions. In our approach, one first elicits stakeholder preferences over the space of possible decisions and the resulting outcomes--such as preferences for balancing spending parity against court appearance rates. We then optimize over the space of decision policies, making trade-offs in a way that maximizes the elicited utility. To do so, we develop an algorithm for efficiently learning these optimal policies from data for a large family of expressive utility functions. In particular, we use a contextual bandit algorithm to explore the space of policies while solving a convex optimization problem at each step to estimate the best policy based on the available information. This consequentialist paradigm facilitates a more holistic approach to equitable decision-making.

Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making

TL;DR

Abstract

Learning to be Fair: A Consequentialist Approach to Equitable Decision-Making

TL;DR

Abstract

Paper Structure

Table of Contents

Key Result

Figures (9)

Theorems & Definitions (20)