SOC-Boundary and Battery Aging Aware Hierarchical Coordination of Multiple EV Aggregates Among Multi-stakeholders with Multi-Agent Constrained Deep Reinforcement Learning
Xin Chen
TL;DR
The paper tackles the challenge of coordinating large-scale EVs for vehicle-to-grid (V2G) services under battery conditioning constraints, aiming to balance the interests of the DSO, EV aggregators (EVAs), and EV users. It introduces a hierarchical framework that combines Multi-Agent Constrained Policy Optimization (MACPO) with a Proof-of-Stake (PoS) power allocation to ensure safe, scalable coordination while integrating SOC, SOP, and SOH into the decision process. Key contributions include the SOC/SOP/SOH-aware MACPO-based EVA coordination, the PoS-inspired decentralized power allocation, and a rigorous simulation showing improved grid stability, reduced battery aging, and favorable cost outcomes across multiple stakeholders. The results demonstrate the approach’s potential for enabling high-penetration renewable energy integration through robust, large-scale V2G coordination, with practical implications for grid reliability and EV participation incentives.
Abstract
As electric vehicles (EV) become more prevalent and advances in electric vehicle electronics continue, vehicle-to-grid (V2G) techniques and large-scale scheduling strategies are increasingly important to promote renewable energy utilization and enhance the stability of the power grid. This study proposes a hierarchical multistakeholder V2G coordination strategy based on safe multi-agent constrained deep reinforcement learning (MCDRL) and the Proof-of-Stake algorithm to optimize benefits for all stakeholders, including the distribution system operator (DSO), electric vehicle aggregators (EVAs) and EV users. For DSO, the strategy addresses load fluctuations and the integration of renewable energy. For EVAs, energy constraints and charging costs are considered. The three critical parameters of battery conditioning, state of charge (SOC), state of power (SOP), and state of health (SOH), are crucial to the participation of EVs in V2G. Hierarchical multi-stakeholder V2G coordination significantly enhances the integration of renewable energy, mitigates load fluctuations, meets the energy demands of the EVAs, and reduces charging costs and battery degradation simultaneously.
