Massively Multiagent Minigames for Training Generalist Agents
Kyoung Whan Choe, Ryan Sullivan, Joseph Suárez
TL;DR
Meta MMO extends Neural MMO by providing a configurable suite of minigames with domain randomization and adaptive difficulty to study generalization in environments with many agents. It enables training a single generalist policy across multiple minigames using PPO/IPPO in a decentralized setting, and demonstrates that the generalist can match or exceed specialist performance with the same target-task data while offering up to a threefold increase in training speed. Additional contributions include new minigames, team-oriented training wrappers, and an open-source release of environment, baselines, and training code under MIT license, supporting research on curriculum learning, coordination, and cross-task transfer in large-scale MARL. Together, these advances provide a practical, scalable benchmark for probing generalization, coordination, and curriculum design in many-agent RL.
Abstract
We present Meta MMO, a collection of many-agent minigames for use as a reinforcement learning benchmark. Meta MMO is built on top of Neural MMO, a massively multiagent environment that has been the subject of two previous NeurIPS competitions. Our work expands Neural MMO with several computationally efficient minigames. We explore generalization across Meta MMO by learning to play several minigames with a single set of weights. We release the environment, baselines, and training code under the MIT license. We hope that Meta MMO will spur additional progress on Neural MMO and, more generally, will serve as a useful benchmark for many-agent generalization.
