A Machine Learning Approach to Boost the Vehicle-2-Grid Scheduling
Gabriele Agliardi, Giorgio Cortiana, Anton Dekusar, Kumar Ghosh, Naeimeh Mohseni, Corey O'Meara, Víctor Valls, Kavitha Yogaraj, Sergiy Zhuk
TL;DR
The paper tackles leveraging fleets of electric vehicles as a distributed BESS to provide capacity-related grid services in wholesale markets, focusing on the charging/discharging scheduling problem under operational constraints. It proposes a two-stage framework that first generates scheduling policies via approximate dynamic programming using historical data, then trains kernel-based classifiers (SVC/QSVC) to predict aggregate EV actions from the current state. Empirical results show that with large fleets, the approach attains objective values comparable to CPLEX and approximate DP but with substantially reduced run times, enabling frequent reoptimization as system conditions change. This work demonstrates a scalable, data-driven path to harness EVs for grid services in DAM/ICM environments, with practical implications for real-time energy arbitrage and grid flexibility.
Abstract
Electric Vehicles (EVs) are emerging as battery energy storage systems (BESSs) of increasing importance for different power grid services. However, the unique characteristics of EVs makes them more difficult to operate than dedicated BESSs. In this work, we apply a data-driven learning approach to leverage EVs as a BESS to provide capacity-related services to the grid. The approach uses machine learning to predict how to charge and discharge EVs while satisfying their operational constraints. As a paradigm application, we use flexible energy commercialization in the wholesale markets, but the approach can be applied to a broader range of capacity-related grid services. We evaluate the proposed approach numerically and show that when the number of EVs is large, we can obtain comparable objective values to CPLEX and approximate dynamic programming, but with shorter run times. These reduced run times are important because they allow us to (re)optimize frequently to adapt to the time-varying system conditions.
