Socially Optimal Energy Usage via Adaptive Pricing
Jiayi Li, Matthew Motoki, Baosen Zhang
TL;DR
The paper addresses coordinating electricity usage under privacy constraints by introducing a two-time-scale mechanism in which an operator updates a price vector $\mathbf{p}$ based on the marginal social cost $\mathbf{e}(\mathbf{x}) = \nabla g(\sum_i \mathbf{x}_i)$ and users respond by solving $\min_{\mathbf{x}_i \in \mathcal{X}_i} f_i(\mathbf{x}_i) + \mathbf{p}^\top \mathbf{x}_i$. It proves that the iterative price updates converge to a unique $\mathbf{p}^{*}$ and the associated user actions $\mathbf{x}^{*}(\mathbf{p}^{*})$ solve the global social welfare problem $\min_{\mathbf{x}} \sum_i f_i(\mathbf{x}_i) + g\left(\sum_i \mathbf{x}_i\right)$, even when $f_i$ are nonconvex or when $T>1$, using continuous-time Lyapunov analyses. Convergence is guaranteed under two sufficient conditions: (i) a quadratic system cost $g(\mathbf{z}) = \tfrac{1}{2} \mathbf{z}^T \mathbf{B} \mathbf{z}$ or (ii) strictly convex differentiable $f_i$ with strictly convex differentiable $g$, with proofs relying on either a quadratic Lyapunov function or a Bregman-divergence based Lyapunov function. Simulations on single-period and multi-period load scenarios, including Q-learning based load control for water heaters, demonstrate rapid convergence and meaningful reductions in both social cost and peak demand, suggesting practical privacy-preserving demand response via adaptive pricing.
Abstract
A central challenge in using price signals to coordinate the electricity consumption of a group of users is the operator's lack of knowledge of the users due to privacy concerns. In this paper, we develop a two-time-scale incentive mechanism that alternately updates between the users and a system operator. As long as the users can optimize their own consumption subject to a given price, the operator does not need to know or attempt to learn any private information of the users for price design. Users adjust their consumption following the price and the system redesigns the price based on the users' consumption. We show that under mild assumptions, this iterative process converges to the social welfare solution. In particular, the cost of the users need not always be convex and its consumption can be the output of a machine learning-based load control algorithm.
