Projected Boosting with Fairness Constraints: Quantifying the Cost of Fair Training Distributions
Amir Asiaee, Kaveh Aryan
TL;DR
This work addresses enforcing group fairness in boosting by introducing FairProj, a projection-based approach that decouples the ensemble-induced distribution used for coefficient updates from a fairness-constrained training distribution. At each round, FairProj projects the current exponential-weights distribution $q^t$ onto a convex fairness set to obtain $w^t$, trains the weak learner on $w^t$, and computes the boosting coefficient $\alpha_t$ under $q^t$, preserving the AdaBoost-style exponential-loss dynamics. Theoretical contributions include a tight edge-transfer bound $\gamma_t^{(q)} \geq \gamma_t^{(w)} - \delta_t$ with $\delta_t = \sqrt{\mathrm{KL}(w^t \| q^t)/2}$ and an exponential loss bound $L_{ m exp}(f_T) \leq n \exp\big(-2 \sum_{t=1}^T (\gamma_t^{(w)} - \delta_t)^2\big)$, which explicitly quantifies the cost of enforcing fairness in boosting. The paper also provides a closed-form KL-projection solution, a practical algorithm (FairProj) with warm-starting, and empirical evidence across standard benchmarks showing meaningful fairness-accuracy tradeoffs and stable training dynamics. Overall, it offers a principled framework to analyze and quantify the tradeoffs between fairness constraints and boosting performance, enabling targeted, data-driven fairness in sequential ensemble learning.
Abstract
Boosting algorithms enjoy strong theoretical guarantees: when weak learners maintain positive edge, AdaBoost achieves geometric decrease of exponential loss. We study how to incorporate group fairness constraints into boosting while preserving analyzable training dynamics. Our approach, FairBoost, projects the ensemble-induced exponential-weights distribution onto a convex set of distributions satisfying fairness constraints (as a reweighting surrogate), then trains weak learners on this fair distribution. The key theoretical insight is that projecting the training distribution reduces the effective edge of weak learners by a quantity controlled by the KL-divergence of the projection. We prove an exponential-loss bound where the convergence rate depends on weak learner edge minus a "fairness cost" term $δ_t = \sqrt{\mathrm{KL}(w^t \| q^t)/2}$. This directly quantifies the accuracy-fairness tradeoff in boosting dynamics. Experiments on standard benchmarks validate the theoretical predictions and demonstrate competitive fairness-accuracy tradeoffs with stable training curves.
