When Machine Learning Gets Personal: Understanding Fairness of Personalized Models
Louisa Cornelis, Guillermo Bernárdez, Haewon Jeong, Nina Miolane
TL;DR
This paper tackles the question of when personalization in machine learning benefits both predictive accuracy and explanation quality. It introduces a unified Benefit of Personalization (BoP) framework that applies to classification and regression, defining population and group costs $C(h)$ and $C(h,s)$ and the BoP metric $\operatorname{BoP}(h_0,h_p)=C(h_0)-C(h_p)$, with a focus on both prediction (BoP-P) and explanations (BoP-X). The authors derive information-theoretic lower bounds on the reliability of BoP testing, including exponential-family specializations and a maximum attribute-count bound via Lambert W, and show that regression can tolerate more personal attributes than classification. Experiments on MIMIC-III demonstrate heterogeneous effects: personalization can improve accuracy and explanations in some subgroups while offering limited or mixed gains in others, underscoring the need to evaluate both prediction and explainability in healthcare settings. Overall, the framework provides practical tools to balance accuracy, fairness, and interpretability in personalized models, guiding safer deployment of personalized ML in high-stakes contexts.
Abstract
Personalization in machine learning involves tailoring models to individual users by incorporating personal attributes such as demographic or medical data. While personalization can improve prediction accuracy, it may also amplify biases and reduce explainability. This work introduces a unified framework to evaluate the impact of personalization on both prediction accuracy and explanation quality across classification and regression tasks. We derive novel upper bounds for the number of personal attributes that can be used to reliably validate benefits of personalization. Our analysis uncovers key trade-offs. We show that regression models can potentially utilize more personal attributes than classification models. We also demonstrate that improvements in prediction accuracy due to personalization do not necessarily translate to enhanced explainability -- underpinning the importance to evaluate both metrics when personalizing machine learning models in critical settings such as healthcare. Validated with a real-world dataset, this framework offers practical guidance for balancing accuracy, fairness, and interpretability in personalized models.
