A Fair Post-Processing Method based on the MADD Metric for Predictive Student Models
Mélina Verger, Chunyang Fan, Sébastien Lallé, François Bouchet, Vanda Luengo
TL;DR
This work addresses fairness in predictive student models by introducing the Model Absolute Density Distance ($MADD$) as a fairness metric and proposing a post-processing method that adjusts predicted probabilities via a mapping controlled by a fairness coefficient $\lambda$. The method converges the group-specific density distributions toward a target distribution using a convex combination, while optimizing a joint objective $\mathcal{L}(\lambda)$ that balances accuracy and fairness. Experiments on simulated data and the Open University Learning Analytics Dataset (OULAD) demonstrate substantial fairness improvements with only modest losses in predictive accuracy, and the approach remains practical without requiring access to the original training data or model. The authors provide open-source code and data at GitHub, highlighting the method's potential for real-world deployment and future expansion to multiple sensitive attributes.
Abstract
Predictive student models are increasingly used in learning environments. However, due to the rising social impact of their usage, it is now all the more important for these models to be both sufficiently accurate and fair in their predictions. To evaluate algorithmic fairness, a new metric has been developed in education, namely the Model Absolute Density Distance (MADD). This metric enables us to measure how different a predictive model behaves regarding two groups of students, in order to quantify its algorithmic unfairness. In this paper, we thus develop a post-processing method based on this metric, that aims at improving the fairness while preserving the accuracy of relevant predictive models' results. We experiment with our approach on the task of predicting student success in an online course, using both simulated and real-world educational data, and obtain successful results. Our source code and data are in open access at https://github.com/melinaverger/MADD .
