Fairness, Accuracy, and Unreliable Data
Kevin Stangl
TL;DR
This work addresses how standard empirical risk minimization can fail when data exhibit unreliable properties such as fairness bias, sequential screening, or adversarial perturbations. It develops mathematical models to study three axes—fairness under biased data, multi-stage screening with fairness constraints, and adversarial robustness—proposing interventions like Equal Opportunity, re-weighting, and randomized hypothesis expansions to recover or preserve performance. The thesis provides formal theorems and proofs delineating when fairness constraints improve or harm performance, exact and approximate algorithms for optimizing fairness-constrained pipelines, and defense strategies against strategic manipulation and malicious noise, validated by synthetic and semi-synthetic experiments. Collectively, these results offer practical guidance for designing reliable AI systems in high-stakes settings, along with rigorous boundaries on what fairness interventions can and cannot achieve under data unreliability. Theoretical insights are complemented by algorithmic frameworks (DP-based and FPTAS approaches) and empirical validation to guide practitioners in choosing distribution-aware fairness, screening, and robustness strategies.
Abstract
This thesis investigates three areas targeted at improving the reliability of machine learning; fairness in machine learning, strategic classification, and algorithmic robustness. Each of these domains has special properties or structure that can complicate learning. A theme throughout this thesis is thinking about ways in which a `plain' empirical risk minimization algorithm will be misleading or ineffective because of a mis-match between classical learning theory assumptions and specific properties of some data distribution in the wild. Theoretical understanding in eachof these domains can help guide best practices and allow for the design of effective, reliable, and robust systems.
