Graph Attention Network for Predicting Duration of Large-Scale Power Outages Induced by Natural Disasters
Chenghao Duan, Chuanyi Ji
TL;DR
This work tackles predicting the duration of large-scale power outages caused by natural disasters by leveraging spatial structure in outage data. It introduces BiGAT, a bimodal Graph Attention Network with an unsupervised pre-training stage that assigns county nodes to one of two embedding clusters, followed by semi-supervised, attention-based learning to classify outage duration into three categories: short ($<2$ days), medium ($2$-$6$ days), and long ($>6$ days). Using real data from four Atlantic hurricanes (2017–2020) across 501 counties and 11.6 million customers, BiGAT outperforms XGBoost, Random Forest, GCN, and GAT by 2–15% in overall and class-wise metrics, achieving about 93% average accuracy. The model’s ability to incorporate spatial dependence and heterogeneity renders it a promising tool for resilience planning and emergency response, though generalization to unseen events remains an area for future work, including multi-event training and feature importance analyses.
Abstract
Natural disasters such as hurricanes, wildfires, and winter storms have induced large-scale power outages in the U.S., resulting in tremendous economic and societal impacts. Accurately predicting power outage recovery and impact is key to resilience of power grid. Recent advances in machine learning offer viable frameworks for estimating power outage duration from geospatial and weather data. However, three major challenges are inherent to the task in a real world setting: spatial dependency of the data, spatial heterogeneity of the impact, and moderate event data. We propose a novel approach to estimate the duration of severe weather-induced power outages through Graph Attention Networks (GAT). Our network uses a simple structure from unsupervised pre-training, followed by semi-supervised learning. We use field data from four major hurricanes affecting $501$ counties in eight Southeastern U.S. states. The model exhibits an excellent performance ($>93\%$ accuracy) and outperforms the existing methods XGBoost, Random Forest, GCN and simple GAT by $2\% - 15\%$ in both the overall performance and class-wise accuracy.
