Once Burned, Twice Shy? The Effect of Stock Market Bubbles on Traders that Learn by Experience
Haibei Zhu, Svitlana Vyetrenko, Serafin Grundl, David Byrd, Kshama Dwarakanath, Tucker Balch
TL;DR
This paper investigates how experience with stock market bubbles shapes reinforcement learning traders in a multi-agent market environment (ABIDES). By training agents with varying levels of bubble exposure and evaluating them in bubble-rich scenarios, the authors show that bubble experience pushes agents from momentum-based to value-based strategies, which in turn suppress bubble formation, magnitude, and duration. The approach combines PPO-trained RL agents, SHAP feature attribution, and controlled bubble-generation mechanics (including a mean-reverting fundamental model) to quantify the impact of experience on market dynamics. The findings imply that learning from rare, extreme market events can modulate boom-bust cycles, with memory effects that gradually fade yet offer potential stability benefits for autonomous trading systems.
Abstract
We study how experience with asset price bubbles changes the trading strategies of reinforcement learning (RL) traders and ask whether the change in trading strategies helps to prevent future bubbles. We train the RL traders in a multi-agent market simulation platform, ABIDES, and compare the strategies of traders trained with and without bubble experience. We find that RL traders without bubble experience behave like short-term momentum traders, whereas traders with bubble experience behave like value traders. Therefore, RL traders without bubble experience amplify bubbles, whereas RL traders with bubble experience tend to suppress and sometimes prevent them. This finding suggests that learning from experience is a mechanism for a boom and bust cycle where the experience of a collapsing bubble makes future bubbles less likely for a period of time until the memory fades and bubbles become more likely to form again.
