Adaptive Safe Reinforcement Learning-Enabled Optimization of Battery Fast-Charging Protocols
Myisha A. Chowdhury, Saif S. S. Al-Wahaibi, Qiugang Lu
TL;DR
The paper addresses safe, rapid battery fast-charging via an adaptive safe reinforcement learning framework. It combines TD3 as the learning engine with an action-projection safety layer that uses Gaussian process surrogates to predict constraint behavior and enforce them with an upper confidence bound, ensuring safety with high probability. A static GP-based safety model provides a baseline, while an adaptive GP scheme learns residual dynamics in real time to cope with ambient and aging-induced changes. Simulation results with PyBaMM show that adaptive safe TD3 maintains temperature and voltage constraints under varying conditions, albeit with some conservative charging, highlighting the method's practical potential for reliable, fast-charging protocols.
Abstract
Optimizing charging protocols is critical for reducing battery charging time and decelerating battery degradation in applications such as electric vehicles. Recently, reinforcement learning (RL) methods have been adopted for such purposes. However, RL-based methods may not ensure system (safety) constraints, which can cause irreversible damages to batteries and reduce their lifetime. To this end, this work proposes an adaptive and safe RL framework to optimize fast charging strategies while respecting safety constraints with a high probability. In our method, any unsafe action that the RL agent decides will be projected into a safety region by solving a constrained optimization problem. The safety region is constructed using adaptive Gaussian process (GP) models, consisting of static and dynamic GPs, that learn from online experience to adaptively account for any changes in battery dynamics. Simulation results show that our method can charge the batteries rapidly with constraint satisfaction under varying operating conditions.
