Confident Feature Ranking
Bitya Neuhof, Yuval Benjamini
TL;DR
The paper tackles instability in post-hoc FI rankings by formalizing a base-to-global framework that separates base FI variability from global FI values. It introduces procedures to construct simultaneous confidence intervals for true feature ranks via pairwise tests and FWER control (Holm, Min-P), enabling reliable top-k selections even under correlation and nonnormality. The authors validate the approach with synthetic experiments, SHAP/TreeSHAP-based simulations, and real data (bike sharing, COMPAS, Nomao), showing high simultaneous coverage and competitive efficiency, while highlighting practical considerations like runtime and tail behavior. This work provides a principled, interpretable way to quantify ranking uncertainty in FI analyses, with implications for more stable model explanations and feature selection decisions.
Abstract
Machine learning models are widely applied in various fields. Stakeholders often use post-hoc feature importance methods to better understand the input features' contribution to the models' predictions. The interpretation of the importance values provided by these methods is frequently based on the relative order of the features (their ranking) rather than the importance values themselves. Since the order may be unstable, we present a framework for quantifying the uncertainty in global importance values. We propose a novel method for the post-hoc interpretation of feature importance values that is based on the framework and pairwise comparisons of the feature importance values. This method produces simultaneous confidence intervals for the features' ranks, which include the ``true'' (infinite sample) ranks with high probability, and enables the selection of the set of the top-k important features.
