Beyond explaining: XAI-based Adaptive Learning with SHAP Clustering for Energy Consumption Prediction
Tobias Clement, Hung Truong Thanh Nguyen, Nils Kemmerzell, Mohamed Abdelaal, Davor Stjelja
TL;DR
This work addresses robust energy consumption prediction for buildings under distribution shifts by integrating explainable AI with adaptive learning. The core method, SCAL, combines SHAP-based explanations, dimensionality reduction, and clustering to form an explanation space, from which clustering characteristics guide iterative model refinement. Three building blocks—SHAP clustering in explanation space, extraction of clustering characteristics, and adaptive model refinement—enable interpretable, Sharper-generalizing models across regression and classification tasks. Empirical results on building energy data, plus transferability tests on Financial Distress and Power datasets, show improved test performance and richer explanations, with the approach offering practical benefits for robustness and anomaly detection in real-world deployments.
Abstract
This paper presents an approach integrating explainable artificial intelligence (XAI) techniques with adaptive learning to enhance energy consumption prediction models, with a focus on handling data distribution shifts. Leveraging SHAP clustering, our method provides interpretable explanations for model predictions and uses these insights to adaptively refine the model, balancing model complexity with predictive performance. We introduce a three-stage process: (1) obtaining SHAP values to explain model predictions, (2) clustering SHAP values to identify distinct patterns and outliers, and (3) refining the model based on the derived SHAP clustering characteristics. Our approach mitigates overfitting and ensures robustness in handling data distribution shifts. We evaluate our method on a comprehensive dataset comprising energy consumption records of buildings, as well as two additional datasets to assess the transferability of our approach to other domains, regression, and classification problems. Our experiments demonstrate the effectiveness of our approach in both task types, resulting in improved predictive performance and interpretable model explanations.
