Evaluating AI fairness in credit scoring with the BRIO tool
Greta Coraglia, Francesco A. Genco, Pellegrino Piantadosi, Enrico Bagli, Pietro Giuffrida, Davide Posillipo, Giuseppe Primiero
TL;DR
This work addresses fairness in AI-driven credit scoring by applying the model-agnostic BRIO tool to quantify bias and risk across sensitive attributes in the German Credit Dataset. The authors construct a transparent credit-scorecard using Optibinning, achieving $\text{AUC}=0.8$ and $\text{Gini}=0.6$, and then quantify fairness risk via BRIO’s aggregated tests, including divergence-based measures. A new integrated risk metric combines multiple fairness checks, and a revenue analysis links fairness risk to provisions and profit, revealing a potential threshold (around $620$) that balances fairness and profitability. Overall, the study demonstrates how BRIO can guide fair lending practices while considering financial implications, and it outlines paths to extend the approach to other datasets and contexts.
Abstract
We present a method for quantitative, in-depth analyses of fairness issues in AI systems with an application to credit scoring. To this aim we use BRIO, a tool for the evaluation of AI systems with respect to social unfairness and, more in general, ethically undesirable behaviours. It features a model-agnostic bias detection module, presented in \cite{DBLP:conf/beware/CoragliaDGGPPQ23}, to which a full-fledged unfairness risk evaluation module is added. As a case study, we focus on the context of credit scoring, analysing the UCI German Credit Dataset \cite{misc_statlog_(german_credit_data)_144}. We apply the BRIO fairness metrics to several, socially sensitive attributes featured in the German Credit Dataset, quantifying fairness across various demographic segments, with the aim of identifying potential sources of bias and discrimination in a credit scoring model. We conclude by combining our results with a revenue analysis.
