Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Cheng-Hsi Hsiao; Krishna Kumar; Ellen M. Rathje

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Cheng-Hsi Hsiao, Krishna Kumar, Ellen M. Rathje

Abstract

Explainable Boosting Machines (EBMs) provide transparent predictions through additive shape functions, enabling direct inspection of feature contributions. However, EBMs can learn non-physical relationships that reduce their reliability in natural hazard applications. This study presents a domain-informed framework to improve the physical consistency of EBMs for lateral spreading prediction. Our approach modifies learned shape functions based on domain knowledge. These modifications correct non-physical behavior while maintaining data-driven patterns. We apply the method to the 2011 Christchurch earthquake dataset and correct non-physical trends observed in the original EBM. The resulting model produces more physically consistent global and local explanations, with an acceptable tradeoff in accuracy (4--5\%).

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Abstract

Paper Structure (14 sections, 16 equations, 16 figures, 2 tables)

This paper contains 14 sections, 16 equations, 16 figures, 2 tables.

Introduction
Dataset Description: 2011 Christchurch Earthquake Lateral Spreading
Methodology
Explainable Boosting Machine (EBM)
EBM model for lateral spreading
Incorporating domain knowledge in EBM
Univariate function modification
Bivariate function modification
Results
Performance Degradation
Global Explanation
Local Explanation
Conclusion
Bivariate Function Maps

Figures (16)

Figure 1: Spatial distribution of 2011 Christchurch earthquake showing lateral spreading Yes/No sites.
Figure 2: Distribution of input features and outliers analysis.
Figure 3: Relationship between the score $s$ and probability $P(y=1)$ through the logistic function. The classification threshold $P=0.5$ corresponds to $s=0$.
Figure 4: Round-robin training for univariate shape functions.
Figure 5: Shape function examples. (a)(b) 1D shape function examples for $f_1$ and $f_2$. The x axis is the feature value and the y axis is the score. (d) 2D shape function example for $f_{1,2}$. The color denotes the score. The red point denotes the example point with feature value $x_1=5$ and $x_2=0.5$.
...and 11 more figures

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Abstract

Domain-informed explainable boosting machines for trustworthy lateral spread predictions

Authors

Abstract

Table of Contents

Figures (16)