Local Statistical Parity for the Estimation of Fair Decision Trees
Andrea Quintanilla, Johan Van Horebeek
TL;DR
This work introduces Local Statistical Parity, a node-level fairness criterion that implies global Statistical Parity for decision trees, and demonstrates how to enforce it within recursive tree estimation. It develops the Constrained Logistic Regression Tree (C-LRT), a CART-like algorithm that selects node splits via Constrained Logistic Regression with a covariance constraint $|\widehat{Cov}(sd_{\theta}(X), A, D)| \le c$, enabling a tunable trade-off between accuracy and fairness. A key theoretical result shows that sufficient local independence of node tests from the protected attribute leads to global fairness, while empirical results on four AF datasets illustrate controllable fairness improvements with a clear accuracy-Fairness trade-off and some degenerate cases under strong constraints. The practical contribution is a convex, interpretable framework for fair decision trees that integrates fairness into the estimation process without abandoning the recursive tree structure.
Abstract
Given the high computational complexity of decision tree estimation, classical methods construct a tree by adding one node at a time in a recursive way. To facilitate promoting fairness, we propose a fairness criterion local to the tree nodes. We prove how it is related to the Statistical Parity criterion, popular in the Algorithmic Fairness literature, and show how to incorporate it into standard recursive tree estimation algorithms. We present a tree estimation algorithm called Constrained Logistic Regression Tree (C-LRT), which is a modification of the standard CART algorithm using locally linear classifiers and imposing restrictions as done in Constrained Logistic Regression. Finally, we evaluate the performance of trees estimated with C-LRT on datasets commonly used in the Algorithmic Fairness literature, using various classification and fairness metrics. The results confirm that C-LRT successfully allows to control and balance accuracy and fairness.
