Natural-gas storage modelling by deep reinforcement learning
Tiziano Balaconi, Aldo Glielmo, Marco Taboga
TL;DR
GasRL addresses how optimal storage policies by a monopolistic operator influence natural-gas price dynamics under regulatory mandates. It couples a calibrated Italian market with a storage-operator RL agent trained via Soft Actor-Critic to maximize a multi-objective reward that includes $g_t$, $(\Delta p_t)^2$, $m_t$, and $n_t$ under inventory bounds $0 \le I_t \le I_{\max}$. The results show endogenously realistic price seasonality and volatility, profitability, and near-absence of market failures, with SAC outperforming other methods; regulatory experiments with a minimum-threshold of 83% November refilling modestly boost resilience to supply shocks at some profitability and volatility costs. The framework offers a reproducible platform for market analysis and regulatory design, with potential extensions to GPUs, multi-agent settings, and international market linkages.
Abstract
We introduce GasRL, a simulator that couples a calibrated representation of the natural gas market with a model of storage-operator policies trained with deep reinforcement learning (RL). We use it to analyse how optimal stockpile management affects equilibrium prices and the dynamics of demand and supply. We test various RL algorithms and find that Soft Actor Critic (SAC) exhibits superior performance in the GasRL environment: multiple objectives of storage operators - including profitability, robust market clearing and price stabilisation - are successfully achieved. Moreover, the equilibrium price dynamics induced by SAC-derived optimal policies have characteristics, such as volatility and seasonality, that closely match those of real-world prices. Remarkably, this adherence to the historical distribution of prices is obtained without explicitly calibrating the model to price data. We show how the simulator can be used to assess the effects of EU-mandated minimum storage thresholds. We find that such thresholds have a positive effect on market resilience against unanticipated shifts in the distribution of supply shocks. For example, with unusually large shocks, market disruptions are averted more often if a threshold is in place.
