Optimizing Electric Vehicle Charging Station Placement Using Reinforcement Learning and Agent-Based Simulations
Minh-Duc Nguyen, Dung D. Le, Phi Long Nguyen
TL;DR
The paper tackles EV charging-station siting under dynamic urban conditions by coupling deep reinforcement learning (DRL) with a GAM-based agent simulation. It introduces a hybrid framework with dual Deep Q-Networks for location and port configuration and a hybrid reward that blends deterministic cues with simulation-derived feedback, enabling adaptive decision-making. In Hanoi, Vietnam, the approach achieves a 53.28% reduction in average waiting time compared with the initial state and outperforms static baselines, demonstrating robustness to real-world variability and scalability to future demand. The work provides a practical, data-driven pathway for scalable EV infrastructure planning, highlighting the value of simulation-informed rewards and Voronoi-based candidate locations to balance coverage, power constraints, and user experience.
Abstract
The rapid growth of electric vehicles (EVs) necessitates the strategic placement of charging stations to optimize resource utilization and minimize user inconvenience. Reinforcement learning (RL) offers an innovative approach to identifying optimal charging station locations; however, existing methods face challenges due to their deterministic reward systems, which limit efficiency. Because real-world conditions are dynamic and uncertain, a deterministic reward structure cannot fully capture the complexities of charging station placement. As a result, evaluation becomes costly and time-consuming, and less reflective of real-world scenarios. To address this challenge, we propose a novel framework that integrates deep RL with agent-based simulations to model EV movement and estimate charging demand in real time. Our approach employs a hybrid RL agent with dual Q-networks to select optimal locations and configure charging ports, guided by a hybrid reward function that combines deterministic factors with simulation-derived feedback. Case studies in Hanoi, Vietnam, show that our method reduces average waiting times by 53.28% compared to the initial state, outperforming static baseline methods. This scalable and adaptive solution enhances EV infrastructure planning, effectively addressing real-world complexities and improving user experience.
