Efficient Policy Adaptation for Voltage Control Under Unknown Topology Changes
Jie Feng, Yuanyuan Shi, Deepjyoti Deka
TL;DR
The paper tackles voltage control in distribution networks with uncertain topology changes by developing a topology-aware online policy adaptation framework. It combines data-driven estimation of voltage–reactive power sensitivities with a topology-change detector and a sparse line-change identification method, enabling fast updates to a pretrained monotone neural-network controller. The approach yields high line-change identification accuracy with small data windows and substantial improvements in control performance over non-adaptive and regression-based baselines, demonstrated on IEEE 13-bus and SCE 56-bus systems. By exploiting radial structure and change sparsity, it delivers real-time adaptation and robust voltage regulation under topology reconfigurations and load variations, with potential impact on scalable, DER-enabled distribution grids.
Abstract
Reinforcement learning (RL) has shown great potential for designing voltage control policies, but their performance often degrades under changing system conditions such as topology reconfigurations and load variations. We introduce a topology-aware online policy optimization framework that leverages data-driven estimation of voltage-reactive power sensitivities to achieve efficient policy adaptation. Exploiting the sparsity of topology-switching events, where only a few lines change at a time, our method efficiently detects topology changes and identifies the affected lines and parameters, enabling fast and accurate sensitivity updates without recomputing the full sensitivity matrix. The estimated sensitivity is subsequently used for online policy optimization of a pre-trained neural-network-based RL controller. Simulations on both the IEEE 13-bus and SCE 56-bus systems demonstrate over 90 percent line identification accuracy, using only 15 data points. The proposed method also significantly improves voltage regulation performance compared with non-adaptive policies and adaptive policies that rely on regression-based online optimization methods for sensitivity estimation.
