The more the merrier: logical and multistage processors in credit scoring
Arturo Pérez-Peralta, Sandra Benítez-Peña, Rosa E. Lillo
TL;DR
This paper addresses fairness in credit scoring by introducing two methods that extend fairness beyond single-attribute settings: logical processors (LP) that map multiple sensitive attributes into one binary variable, and multistage processors (MP) that stack fairness methods across pre-, in-, and post-processing pipeline stages. The authors formalize fairness criteria (Independence, Separation, Sufficiency) and argue that, for credit scoring, separation is often the most appropriate metric, then implement and evaluate LPs and MPs across simulations and a German loan dataset. Empirical results show LPs can effectively handle multiple sensitive attributes with fairness and accuracy preserved, while MPs can yield synergistic improvements—sometimes surpassing individual methods—in both accuracy and separation, albeit with dependencies on the LP choice and post-processing stages. The findings suggest practical pathways to enforce multi-attribute fairness in finance and point to future work on expanding the processor repertoire, enhancing interpretability, and tuning hyper-parameters for robust deployment.
Abstract
Machine Learning algorithms are ubiquitous in key decision-making contexts such as organizational justice or healthcare, which has spawned a great demand for fairness in these procedures. In this paper we focus on the application of fair ML in finance, more concretely on the use of fairness techniques on credit scoring. This paper makes two contributions. On the one hand, it addresses the existent gap concerning the application of established methods in the literature to the case of multiple sensitive variables through the use of a new technique called logical processors (LP). On the other hand, it also explores the novel method of multistage processors (MP) to investigate whether the combination of fairness methods can work synergistically to produce solutions with improved fairness or accuracy. Furthermore, we examine the intersection of these two lines of research by exploring the integration of fairness methods in the multivariate case. The results are very promising and suggest that logical processors are an appropriate way of handling multiple sensitive variables. Furthermore, multistage processors are capable of improving the performance of existing methods.
