Beyond Playtesting: A Generative Multi-Agent Simulation System for Massively Multiplayer Online Games
Ran Zhang, Kun Ouyang, Tiancheng Ma, Yida Yang, Dong Fang
TL;DR
This work tackles MMO economy design by introducing a scalable generative multi-agent simulation driven by LLMs fine-tuned on real player data, paired with a data-driven environment model. It builds high-fidelity Player Agents and a Battle Server to capture microscopic decision-making and macroscopic outcomes, while treating certain non-player systems deterministically. The authors validate their framework through extensive experiments, showing strong alignment with real-world behavior and the ability to predict intervention effects, enabling cost-efficient design optimization. The approach offers interpretable insights and a practical pathway to test-then-deploy gameplay changes before live rollout.
Abstract
Optimizing numerical systems and mechanism design is crucial for enhancing player experience in Massively Multiplayer Online (MMO) games. Traditional optimization approaches rely on large-scale online experiments or parameter tuning over predefined statistical models, which are costly, time-consuming, and may disrupt player experience. Although simplified offline simulation systems are often adopted as alternatives, their limited fidelity prevents agents from accurately mimicking real player reasoning and reactions to interventions. To address these limitations, we propose a generative agent-based MMO simulation system empowered by Large Language Models (LLMs). By applying Supervised Fine-Tuning (SFT) and Reinforcement Learning (RL) on large-scale real player behavioral data, we adapt LLMs from general priors to game-specific domains, enabling realistic and interpretable player decision-making. In parallel, a data-driven environment model trained on real gameplay logs reconstructs dynamic in-game systems. Experiments demonstrate strong consistency with real-world player behaviors and plausible causal responses under interventions, providing a reliable, interpretable, and cost-efficient framework for data-driven numerical design optimization.
