Reinforcement Learning for Monetary Policy Under Macroeconomic Uncertainty: Analyzing Tabular and Function Approximation Methods
Tony Wang, Kyle Feinstein, Sheryl Chen
TL;DR
The paper evaluates nine reinforcement learning methods for dynamic monetary policy under macroeconomic uncertainty, using a data-driven, FRED-based environment with a linear-Gaussian state transition and a discrete-action MDP. Contrary to expectations, simple tabular Q-learning often outperforms deep and Bayesian methods and traditional rules, highlighting robustness of discretized policies in this setting. The results underscore both the potential and current limitations of applying modern RL to macro policy, suggesting RL as a decision-support tool with uncertainty quantification rather than a replacement for human judgment. The work also emphasizes the importance of problem characteristics, discretization, and stability when selecting RL algorithms for economic applications.
Abstract
We study how a central bank should dynamically set short-term nominal interest rates to stabilize inflation and unemployment when macroeconomic relationships are uncertain and time-varying. We model monetary policy as a sequential decision-making problem where the central bank observes macroeconomic conditions quarterly and chooses interest rate adjustments. Using publically accessible historical Federal Reserve Economic Data (FRED), we construct a linear-Gaussian transition model and implement a discrete-action Markov Decision Process with a quadratic loss reward function. We chose to compare nine different reinforcement learning style approaches against Taylor Rule and naive baselines, including tabular Q-learning variants, SARSA, Actor-Critic, Deep Q-Networks, Bayesian Q-learning with uncertainty quantification, and POMDP formulations with partial observability. Surprisingly, standard tabular Q-learning achieved the best performance (-615.13 +- 309.58 mean return), outperforming both enhanced RL methods and traditional policy rules. Our results suggest that while sophisticated RL techniques show promise for monetary policy applications, simpler approaches may be more robust in this domain, highlighting important challenges in applying modern RL to macroeconomic policy.
