Reinforcement Learning-based Home Energy Management with Heterogeneous Batteries and Stochastic EV Behaviour

Meng Yuan; Ye Wang; Xinghuo Yu; Torsten Wik; Changfu Zou

Reinforcement Learning-based Home Energy Management with Heterogeneous Batteries and Stochastic EV Behaviour

Meng Yuan, Ye Wang, Xinghuo Yu, Torsten Wik, Changfu Zou

TL;DR

The paper tackles home energy management with PV, stationary storage, and EVs under stochastic EV usage by formulating a constrained Markov decision process and solving it with a Lagrangian Soft Actor-Critic algorithm. It uniquely incorporates heterogeneous degradation dynamics for stationary and EV batteries and models stochastic EV arrival/departure and driving distance using Swedish travel data, enabling robust policy learning. The approach achieves price arbitrage, maintains indoor comfort within tight bounds, and reduces total cost while lowering battery degradation compared to rule-based baselines. This method offers a practical, data-driven framework for real-world HEMS, improving economic performance and battery longevity without sacrificing occupant comfort. The results highlight the importance of respecting technology-heterogeneity and user behavior in DRL-based energy management for homes.

Abstract

The widespread adoption of photovoltaic (PV), electric vehicles (EVs), and stationary energy storage systems (ESS) in households increases system complexity while simultaneously offering new opportunities for energy regulation. However, effectively coordinating these resources under uncertainties remains challenging. This paper proposes a novel home energy management framework based on deep reinforcement learning (DRL) that can jointly minimise energy expenditure and battery degradation while guaranteeing occupant comfort and EV charging requirements. Distinct from existing studies, we explicitly account for the heterogeneous degradation characteristics of stationary and EV batteries in the optimisation, alongside stochastic user behaviour regarding arrival time, departure time, and driving distance. The energy scheduling problem is formulated as a constrained Markov decision process (CMDP) and solved using a Lagrangian soft actor-critic (SAC) algorithm. This approach enables the agent to learn optimal control policies that enforce physical constraints, including indoor temperature bounds and target EV state of charge upon departure, despite stochastic uncertainties. Numerical simulations over a one-year horizon demonstrate the effectiveness of the proposed framework in satisfying physical constraints while eliminating thermal oscillations and achieving significant economic benefits. Specifically, the method reduces the cumulative operating cost substantially compared to two standard rule-based baselines while simultaneously decreasing battery degradation costs by 8.44%.

Reinforcement Learning-based Home Energy Management with Heterogeneous Batteries and Stochastic EV Behaviour

TL;DR

Abstract

Paper Structure (28 sections, 44 equations, 6 figures, 6 tables)

This paper contains 28 sections, 44 equations, 6 figures, 6 tables.

Introduction
System Modelling
Model of Ordinary Residential Appliances
PV Generation Subsystem
HVAC System
Energy Storage System
EV Model
Power Balancing
Statistical Modelling of Uncertainty in Home Energy Management
Travel Behaviour Analysis
Battery SoC Uncertainty Analysis
Problem Formulation and Cost Consideration
User Satisfaction Cost
Battery Degradation Cost
RL-based Optimisation Formulation
...and 13 more sections

Figures (6)

Figure 1: Structure of the investigated home with a smart energy management system.
Figure 2: The distributions of leaving and arrival home time.
Figure 3: Daily travel distance distribution.
Figure 4: Learning curve of the proposed RL agent.
Figure 5: Energy scheduling performance of the proposed method over a two-day horizon. (a) SoC and electricity price, (b) Power profiles of the HVAC, ESS, EV, PV, and grid, (c) Operating schedules of household appliances. The green shaded area indicates EV at home.
...and 1 more figures

Reinforcement Learning-based Home Energy Management with Heterogeneous Batteries and Stochastic EV Behaviour

TL;DR

Abstract

Reinforcement Learning-based Home Energy Management with Heterogeneous Batteries and Stochastic EV Behaviour

Authors

TL;DR

Abstract

Table of Contents

Figures (6)