Fast Calculation of Feature Contributions in Boosting Trees

Zhongli Jiang; Min Zhang; Dabao Zhang

Fast Calculation of Feature Contributions in Boosting Trees

Zhongli Jiang, Min Zhang, Dabao Zhang

TL;DR

The paper tackles global interpretability of tree ensembles by decomposing the explained variance via Shapley values of $R^2$ and introduces Q-SHAP, a polynomial-time algorithm for Shapley values of quadratic losses. It develops a single-tree core and then extends to boosting ensembles, using leaf-based reweighting and polynomial identities to compute feature contributions efficiently. Across simulations and real data, Q-SHAP achieves accurate, stable feature-specific $R^2$ estimates and dramatically outperforms SAGE and SPVIM in runtime. This work provides a scalable framework for global feature attribution in high-dimensional tree models and can generalize to broader quadratic loss functions.

Abstract

Recently, several fast algorithms have been proposed to decompose predicted value into Shapley values, enabling individualized feature contribution analysis in tree models. While such local decomposition offers valuable insights, it underscores the need for a global evaluation of feature contributions. Although coefficients of determination ($R^2$) allow for comparative assessment of individual features, individualizing $R^2$ is challenged by the underlying quadratic losses. To address this, we propose Q-SHAP, an efficient algorithm that reduces the computational complexity of calculating Shapley values for quadratic losses to polynomial time. Our simulations show that Q-SHAP not only improves computational efficiency but also enhances the accuracy of feature-specific $R^2$ estimates.

Fast Calculation of Feature Contributions in Boosting Trees

TL;DR

The paper tackles global interpretability of tree ensembles by decomposing the explained variance via Shapley values of

and introduces Q-SHAP, a polynomial-time algorithm for Shapley values of quadratic losses. It develops a single-tree core and then extends to boosting ensembles, using leaf-based reweighting and polynomial identities to compute feature contributions efficiently. Across simulations and real data, Q-SHAP achieves accurate, stable feature-specific

estimates and dramatically outperforms SAGE and SPVIM in runtime. This work provides a scalable framework for global feature attribution in high-dimensional tree models and can generalize to broader quadratic loss functions.

Abstract

) allow for comparative assessment of individual features, individualizing

is challenged by the underlying quadratic losses. To address this, we propose Q-SHAP, an efficient algorithm that reduces the computational complexity of calculating Shapley values for quadratic losses to polynomial time. Our simulations show that Q-SHAP not only improves computational efficiency but also enhances the accuracy of feature-specific

estimates.

Paper Structure (20 sections, 4 theorems, 34 equations, 12 figures, 6 tables, 2 algorithms)

This paper contains 20 sections, 4 theorems, 34 equations, 12 figures, 6 tables, 2 algorithms.

Introduction
Shapley Values of R-squared for Individual Features
Model specification
Empirical Estimation
From R-squared to a Quadratic Loss
From Quadratic Loss to Q-SHAP
The Algorithm Q-SHAP for Single Trees
Notations
Distributing the prediction to leaves
The Algorithm
The Algorithm Q-SHAP for Tree Ensembles from Boosting
Simulation Study
Real Data Analysis
Conclusion
Proofs
...and 5 more sections

Key Result

Proposition 1

For any well-defined $p, n, |F|$,

Figures (12)

Figure 1: Illustration of (a) a decision tree built with both features $X_1$ and $X_2$ and (b) its hypothetical tree with only feature $X_1$.
Figure 2: The mean absolute error (MAE) averaged over 1,000 datasets with $p=100$
Figure 3: The running time (seconds) in simulation study
Figure 4: Boxplots of (a) $X_1$-specific, (b) $X_2$-specific, (c) $X_3$-specific, and (d) the sum of all feature-specific $R^2$ in the three models with $n=500$, $p=100$. The dashed lines show the theoretical $R^2$.
Figure 5: Boxplots of (a) $X_1$-specific, (b) $X_2$-specific, (c) $X_3$-specific, and (d) the sum of all feature-specific $R^2$ in the three models with $n=1000$, $p=100$. The dashed lines show the theoretical $R^2$.
...and 7 more figures

Theorems & Definitions (4)

Proposition 1
Theorem 1
Proposition 2
Lemma 1

Fast Calculation of Feature Contributions in Boosting Trees

TL;DR

Abstract

Fast Calculation of Feature Contributions in Boosting Trees

Authors

TL;DR

Abstract

Table of Contents

Key Result

Figures (12)

Theorems & Definitions (4)