PPGN: Physics-Preserved Graph Networks for Real-Time Fault Location in Distribution Systems with Limited Observation and Labels
Wenting Li, Deepjyoti Deka
TL;DR
The paper tackles real-time fault location in distribution networks under limited observability and scarce labels by introducing PPGN, a two-stage physics-preserved graph network. Stage I builds a graph embedding with an adjustable adjacency $A$ to reflect grid physics and sparse observability, while Stage II propagates labels through a correlation graph with adjacency $B$ derived from Stage I embeddings, grounded by a random-walk interpretation. The approach yields superior accuracy and robustness on IEEE 123-node and 37-node feeders, notably at low label rates and under load/topology variations, outperforming three baselines. The work highlights the value of integrating physical grid relationships into graph learning for critical monitoring tasks and points to future PMU placement optimization to further enhance performance.
Abstract
Electrical faults may trigger blackouts or wildfires without timely monitoring and control strategy. Traditional solutions for locating faults in distribution systems are not real-time when network observability is low, while novel black-box machine learning methods are vulnerable to stochastic environments. We propose a novel Physics-Preserved Graph Network (PPGN) architecture to accurately locate faults at the node level with limited observability and labeled training data. PPGN has a unique two-stage graph neural network architecture. The first stage learns the graph embedding to represent the entire network using a few measured nodes. The second stage finds relations between the labeled and unlabeled data samples to further improve the location accuracy. We explain the benefits of the two-stage graph configuration through a random walk equivalence. We numerically validate the proposed method in the IEEE 123-node and 37-node test feeders, demonstrating the superior performance over three baseline classifiers when labeled training data is limited, and loads and topology are allowed to vary.
