Accurate estimation of feature importance faithfulness for tree models
Mateusz Gajewski, Adam Karczmarz, Mateusz Rapicki, Piotr Sankowski
TL;DR
This work introduces $PGI^2$, a perturbation-based faithfulness metric for feature rankings in tree ensembles, and shows that the squared prediction gap $\mathrm{PG}^2$ can be computed exactly in $O(n^2)$ time for given perturbation distributions. It also proposes a $\mathrm{PG}^2$-based greedy feature ordering to rank features, and compares its faithfulness to SHAP across multiple datasets. Through experiments, the authors demonstrate that the exact PG^2 computation is numerically stable and can outperform Monte Carlo approaches under tight time budgets, while the PG^2-based ranking often yields higher $\overline{\mathrm{PGI}^2}$ than SHAP on bigger models, with SHAP sometimes excelling on remove-and-retrain metrics. The results suggest that $\mathrm{PG}^2$ offers a principled, efficient alternative for measuring and utilizing feature importance in tree-based models with potential impact on explainability benchmarks and practical deployments.
Abstract
In this paper, we consider a perturbation-based metric of predictive faithfulness of feature rankings (or attributions) that we call PGI squared. When applied to decision tree-based regression models, the metric can be computed accurately and efficiently for arbitrary independent feature perturbation distributions. In particular, the computation does not involve Monte Carlo sampling that has been typically used for computing similar metrics and which is inherently prone to inaccuracies. Moreover, we propose a method of ranking features by their importance for the tree model's predictions based on PGI squared. Our experiments indicate that in some respects, the method may identify the globally important features better than the state-of-the-art SHAP explainer
