A Framework for Empowering Reinforcement Learning Agents with Causal Analysis: Enhancing Automated Cryptocurrency Trading
Rasoul Amirzadeh, Dhananjay Thiruvady, Asef Nazari, Mong Shan Ee
TL;DR
The paper tackles profitable automated trading in volatile cryptocurrency markets by proposing the CausalReinforceNet (CRN) framework, which combines Bayesian-network–based feature engineering with dynamic Bayesian network–driven price-direction signals to augment reinforcement learning traders. Two model-free RL agents (PPO and DDPG) are deployed within CRN, leveraging curated state representations and adaptive position sizing under risk controls, and are evaluated against Buy-and-Hold and a baseline OHLCV RL model across Binance Coin, Ethereum, Litecoin, Ripple, and Tether. The main contributions include the CRN architecture that fuses causal analysis with RL, per-coin feature selection via BN, DBN-informed observations, and a comparative empirical analysis of two RL algorithms under realistic transaction costs and risk rules. Results show CRN generally improves profitability over baselines, with performance varying by coin and algorithm, and provide insights into decision patterns arising from causal features and price-direction predictions. The work underscores the value of explainable, risk-aware RL frameworks for cryptocurrency trading and points to future extensions such as higher-frequency trading, adaptive thresholds, portfolio-level RL, and broader domain applications.
Abstract
Despite advances in artificial intelligence-enhanced trading methods, developing a profitable automated trading system remains challenging in the rapidly evolving cryptocurrency market. This research focuses on developing a reinforcement learning (RL) framework to tackle the complexities of trading five prominent altcoins: Binance Coin, Ethereum, Litecoin, Ripple, and Tether. To this end, we present the CausalReinforceNet~(CRN) framework, which integrates both Bayesian and dynamic Bayesian network techniques to empower the RL agent in trade decision-making. We develop two agents using the framework based on distinct RL algorithms to analyse performance compared to the Buy-and-Hold benchmark strategy and a baseline RL model. The results indicate that our framework surpasses both models in profitability, highlighting CRN's consistent superiority, although the level of effectiveness varies across different cryptocurrencies.
