SARIMAX-Based Power Outage Prediction During Extreme Weather Events
Haoran Ye, Qiuzhuang Sun, Yang Yang
TL;DR
This work presents a SARIMAX-based framework for short-term power outage forecasting during extreme weather, leveraging a two-stage feature extraction pipeline to reduce 84 weather features to 37 representative inputs augmented by temporal embeddings and multi-scale lag features. Each of 83 counties is modeled with an independent SARIMAX$(1,0,1)$ using exogenous weather inputs, with a robust training pipeline that includes constant/correlation preprocessing and hierarchical fitting with fallbacks to ARIMA or historical means. The approach achieves $RMSE=177.2$, an $8.4\%$ improvement over a baseline of $RMSE=193.4$, demonstrating the value of careful feature engineering and robust optimization for resilience forecasting. The study also reveals that simpler statistical models can outperform complex neural architectures in data-limited, non-stationary outage scenarios, and discusses future directions including spatiotemporal models and rare-event techniques to further enhance predictive performance.
Abstract
This study develops a SARIMAX-based prediction system for short-term power outage forecasting during extreme weather events. Using hourly data from Michigan counties with outage counts and comprehensive weather features, we implement a systematic two-stage feature engineering pipeline: data cleaning to remove zero-variance and unknown features, followed by correlation-based filtering to eliminate highly correlated predictors. The selected features are augmented with temporal embeddings, multi-scale lag features, and weather variables with their corresponding lags as exogenous inputs to the SARIMAX model. To address data irregularity and numerical instability, we apply standardization and implement a hierarchical fitting strategy with sequential optimization methods, automatic downgrading to ARIMA when convergence fails, and historical mean-based fallback predictions as a final safeguard. The model is optimized separately for short-term (24 hours) and medium-term (48 hours) forecast horizons using RMSE as the evaluation metric. Our approach achieves an RMSE of 177.2, representing an 8.4\% improvement over the baseline method (RMSE = 193.4), thereby validating the effectiveness of our feature engineering and robust optimization strategy for extreme weather-related outage prediction.
