humancompatible.detect: a Python Toolkit for Detecting Bias in AI Models
German M. Matilla, Jiri Nemecek, Illia Kryvoviaz, Jakub Marecek
TL;DR
This work tackles the challenge of detecting and evaluating bias in AI, with a focus on intersectional bias in tabular data. It introduces two complementary methods—Maximum Subgroup Discrepancy (MSD) and a subsampled distance test—implemented in the Python toolkit humancompatible.detect, aiming for scalable and interpretable bias analysis. MSD identifies the most discrepant subgroup across all protected attribute intersections, leveraging optimization and offering linear sample complexity in the number of protected attributes, while the subsampled test provides threshold-based bias checks with PAC-style guarantees for individual subgroups. The open-source toolkit, licensed under Apache-2.0, supports multiple input modalities and a user-friendly API to aid regulatory compliance and practical fairness auditing in high-risk AI deployments.
Abstract
There is a strong recent emphasis on trustworthy AI. In particular, international regulations, such as the AI Act, demand that AI practitioners measure data quality on the input and estimate bias on the output of high-risk AI systems. However, there are many challenges involved, including scalability (MMD) and computability (Wasserstein-1) issues of traditional methods for estimating distances on measure spaces. Here, we present humancompatible.detect, a toolkit for bias detection that addresses these challenges. It incorporates two newly developed methods to detect and evaluate bias: maximum subgroup discrepancy (MSD) and subsampled $\ell_\infty$ distances. It has an easy-to-use API documented with multiple examples. humancompatible.detect is licensed under the Apache License, Version 2.0.
