Modelling crypto markets by multi-agent reinforcement learning
Johann Lussange, Stefano Vrizzi, Stefano Palminteri, Boris Gutkin
TL;DR
This work develops SYMBA, a multi-agent reinforcement learning crypto market simulator calibrated to Binance data for 153 assets between 2018 and 2022. Each agent runs two RL modules (forecasting and trading) and bases asset valuations on both market prices and a cointegration-based fundamental estimate, integrated within a centralized double-auction order book. The study demonstrates that SYMBA reproduces key stylized facts of crypto markets, including non-normal returns, volatility and volume clustering, and decaying price-autocorrelations, while allowing analysis of learning dynamics and agent-level behavior. The model offers a data-driven framework for risk management, policy evaluation, and strategy optimization in rapidly evolving crypto markets, with clear avenues for incorporating intraday dynamics and regulatory changes in future work.
Abstract
Building on a previous foundation work (Lussange et al. 2020), this study introduces a multi-agent reinforcement learning (MARL) model simulating crypto markets, which is calibrated to the Binance's daily closing prices of $153$ cryptocurrencies that were continuously traded between 2018 and 2022. Unlike previous agent-based models (ABM) or multi-agent systems (MAS) which relied on zero-intelligence agents or single autonomous agent methodologies, our approach relies on endowing agents with reinforcement learning (RL) techniques in order to model crypto markets. This integration is designed to emulate, with a bottom-up approach to complexity inference, both individual and collective agents, ensuring robustness in the recent volatile conditions of such markets and during the COVID-19 era. A key feature of our model also lies in the fact that its autonomous agents perform asset price valuation based on two sources of information: the market prices themselves, and the approximation of the crypto assets fundamental values beyond what those market prices are. Our MAS calibration against real market data allows for an accurate emulation of crypto markets microstructure and probing key market behaviors, in both the bearish and bullish regimes of that particular time period.
