Dynamic Incentive Selection for Hierarchical Convex Model Predictive Control
Akshay Thirugnanam, Koushil Sreenath
TL;DR
This work addresses the design of incentives in hierarchical MPC where an upper-level BiMPC leads a population of lower-level LoMPCs by providing incentives that steer their optimal inputs. By framing this as an incentive Stackelberg game, the authors establish linear and linear-convex incentive structures that yield incentive controllability and allow the BiMPC to achieve team-optimal behavior without explicit knowledge of the LoMPC costs. They develop iterative, majorization-based algorithms to compute optimal incentives, prove convergence properties, and extend the approach to multiple LoMPCs with a robust BiMPC reformulation that guarantees bounded error relative to the team optimum. The dynamic price control example for EV charging demonstrates scalability to large EV populations and shows how ISO pricing can achieve valley-filling behavior while preserving feasibility. Overall, the framework provides scalable, privacy-preserving incentive design for hierarchical MPC with provable bounds and practical applicability to grid-scale dynamic pricing.
Abstract
In this paper, we discuss incentive design for hierarchical model predictive control (MPC) systems viewed as Stackelberg games. We consider a hierarchical MPC formulation where, given a lower-level convex MPC (LoMPC), the upper-level system solves a bilevel MPC (BiMPC) subject to the constraint that the lower-level system inputs are optimal for the LoMPC. Such hierarchical problems are challenging due to optimality constraints in the BiMPC formulation. We analyze an incentive Stackelberg game variation of the problem, where the BiMPC provides additional incentives for the LoMPC cost function, which grants the BiMPC influence over the LoMPC inputs. We show that for such problems, the BiMPC can be reformulated as a simpler optimization problem, and the optimal incentives can be iteratively computed without knowing the LoMPC cost function. We extend our formulation for the case of multiple LoMPCs and propose an algorithm that finds bounded suboptimal solutions for the BiMPC. We demonstrate our algorithm for a dynamic price control example, where an independent system operator (ISO) sets the electricity prices for electric vehicle (EV) charging with the goal of minimizing a social cost and satisfying electricity generation constraints. Notably, our method scales well to large EV population sizes.
