MARS: A Meta-Adaptive Reinforcement Learning Framework for Risk-Aware Multi-Agent Portfolio Management
Jiayi Chen, Jing Li, Guiling Wang
TL;DR
MARS tackles non-stationarity and risk in portfolio management by combining a heterogeneous ensemble of Safety-Critic agents with explicit risk profiles and a high level Meta-Adaptive Controller that dynamically orchestrates the ensemble. The two-tier design enables robust adaptation across market regimes while enforcing practical trading constraints via a risk overlay. Empirical results on DJI and HSI show superior risk-adjusted returns and capital preservation in bear markets, with ablations confirming the necessity of both MAC and HAE components for performance gains. The work offers a scalable, adaptive framework for risk-aware multi-agent portfolio optimization with potential for real-world deployment.
Abstract
Reinforcement Learning (RL) has shown significant promise in automated portfolio management; however, effectively balancing risk and return remains a central challenge, as many models fail to adapt to dynamically changing market conditions. We propose Meta-controlled Agents for a Risk-aware System (MARS), a novel framework addressing this through a multi-agent, risk-aware approach. MARS replaces monolithic models with a Heterogeneous Agent Ensemble, where each agent's unique risk profile is enforced by a Safety-Critic network to span behaviors from capital preservation to aggressive growth. A high-level Meta-Adaptive Controller (MAC) dynamically orchestrates this ensemble, shifting reliance between conservative and aggressive agents to minimize drawdown during downturns while seizing opportunities in bull markets. This two-tiered structure leverages behavioral diversity rather than explicit feature engineering to ensure a disciplined portfolio robust across market regimes. Experiments on major international indexes confirm that our framework significantly reduces maximum drawdown and volatility while maintaining competitive returns.
