Learning with Adaptive Conservativeness for Distributionally Robust Optimization: Incentive Design for Voltage Regulation
Zhirui Liang, Qi Li, Joshua Comden, Andrey Bernstein, Yury Dvorkin
TL;DR
The paper tackles incentive design for voltage regulation under information asymmetry between the DSO and DERAs by formulating a Stackelberg game and introducing a model-based online learning approach to infer the incentive–DERA response mapping. It advances a distributionally robust incentive design that uses a Wasserstein- metric ambiguity set and yields a convex reformulation, enabling tractable optimization. A gradient-based method adaptively tunes the conservativeness level based on historical performance, supported by convergence analysis and numerical experiments. The work provides a principled framework for robust, learning-enabled incentives in distribution networks, with practical implications for reliable voltage support under uncertainty.
Abstract
Information asymmetry between the Distribution System Operator (DSO) and Distributed Energy Resource Aggregators (DERAs) obstructs designing effective incentives for voltage regulation. To capture this effect, we employ a Stackelberg game-theoretic framework, where the DSO seeks to overcome the information asymmetry and refine its incentive strategies by learning from DERA behavior over multiple iterations. We introduce a model-based online learning algorithm for the DSO, aimed at inferring the relationship between incentives and DERA responses. Given the uncertain nature of these responses, we also propose a distributionally robust incentive design model to control the probability of voltage regulation failure and then reformulate it into a convex problem. This model allows the DSO to periodically revise distribution assumptions on uncertain parameters in the decision model of the DERA. Finally, we present a gradient-based method that permits the DSO to adaptively modify its conservativeness level, measured by the size of a Wasserstein metric-based ambiguity set, according to historical voltage regulation performance. The effectiveness of our proposed method is demonstrated through numerical experiments.
