Increasing Fairness via Combination with Learning Guarantees
Yijun Bian, Kun Zhang
TL;DR
This work addresses hidden discrimination in ML by introducing discriminative risk ($\mathrm{DR}$), a fairness quality that integrates individual and group perspectives. It develops first- and second-order oracle bounds and PAC bounds showing that ensemble voting can reduce discrimination through a cancellation-of-biases effect, and then contributes POAF, a Pareto-optimal ensemble pruning method that improves fairness with limited accuracy loss. The authors validate DR and the theoretical bounds across binary and multi-class tasks, and demonstrate that POAF yields fairer ensembles with competitive or superior performance compared with baseline fairness-aware methods and state-of-the-art pruning approaches. The study provides learning-guaranteed principles for boosting fairness in ensembles, with practical implications for deploying fair, multi-attribute, and multi-class classifiers.
Abstract
The concern about hidden discrimination in ML models is growing, as their widespread real-world application increasingly impacts human lives. Various techniques, including commonly used group fairness measures and several fairness-aware ensemble-based methods, have been developed to enhance fairness. However, existing fairness measures typically focus on only one aspect -- either group or individual fairness, and the hard compatibility among them indicates a possibility of remaining biases even when one of them is satisfied. Moreover, existing mechanisms to boost fairness usually present empirical results to show validity, yet few of them discuss whether fairness can be boosted with certain theoretical guarantees. To address these issues, we propose a fairness quality measure named 'discriminative risk (DR)' to reflect both individual and group fairness aspects. Furthermore, we investigate its properties and establish the first- and second-order oracle bounds to show that fairness can be boosted via ensemble combination with theoretical learning guarantees. The analysis is suitable for both binary and multi-class classification. A pruning method is also proposed to utilise our proposed measure and comprehensive experiments are conducted to evaluate the effectiveness of the proposed methods.
