Dropout-Based Rashomon Set Exploration for Efficient Predictive Multiplicity Estimation
Hsiang Hsu, Guihong Li, Shaohan Hu, Chun-Fu, Chen
TL;DR
This work addresses predictive multiplicity arising when many near-optimal models (the Rashomon set) disagree on individual predictions. It introduces a dropout-based framework to efficiently explore near-optimal models in neural networks, establishing theoretical links between dropout hyperparameters and Rashomon-set loss deviations for linear models and FFNNs. The approach yields substantial runtime speedups over re-training and enables practical mitigation via dropout ensembles and model selection guided by multiplicity estimates. Empirical results across diverse domains show dropout-based exploration consistently outperforms baselines in estimating multiplicity metrics, while maintaining comparable accuracy, and demonstrate its applicability to real-world tasks such as finance, medicine, and computer vision. This framework offers a scalable, principled path to quantify and reduce predictive multiplicity, enhancing fairness and reliability in deployment of complex models.
Abstract
Predictive multiplicity refers to the phenomenon in which classification tasks may admit multiple competing models that achieve almost-equally-optimal performance, yet generate conflicting outputs for individual samples. This presents significant concerns, as it can potentially result in systemic exclusion, inexplicable discrimination, and unfairness in practical applications. Measuring and mitigating predictive multiplicity, however, is computationally challenging due to the need to explore all such almost-equally-optimal models, known as the Rashomon set, in potentially huge hypothesis spaces. To address this challenge, we propose a novel framework that utilizes dropout techniques for exploring models in the Rashomon set. We provide rigorous theoretical derivations to connect the dropout parameters to properties of the Rashomon set, and empirically evaluate our framework through extensive experimentation. Numerical results show that our technique consistently outperforms baselines in terms of the effectiveness of predictive multiplicity metric estimation, with runtime speedup up to $20\times \sim 5000\times$. With efficient Rashomon set exploration and metric estimation, mitigation of predictive multiplicity is then achieved through dropout ensemble and model selection.
