Improving Generalization in Game Agents with Data Augmentation in Imitation Learning
Derek Yadgaroff, Alessandro Sestini, Konrad Tollmar, Ayca Ozcelikkale, Linus Gisslén
TL;DR
Imitation-learning agents in games often generalize poorly to unseen scenarios due to distribution shift between demonstrations and deployment. The authors propose data augmentation on feature-based state spaces within Behavioral Cloning to diversify the training distribution without changing actions, and they benchmark across four distinct 3D test environments. They conduct a large study of 38 augmentation combinations (up to 3 augmentations per example) across 228 models and demonstrate consistent generalization improvements, with some configurations achieving up to $1.6$ times the baseline performance. The work provides practical guidance for practitioners and suggests directions for broader evaluation across tasks and environments.
Abstract
Imitation learning is an effective approach for training game-playing agents and, consequently, for efficient game production. However, generalization - the ability to perform well in related but unseen scenarios - is an essential requirement that remains an unsolved challenge for game AI. Generalization is difficult for imitation learning agents because it requires the algorithm to take meaningful actions outside of the training distribution. In this paper we propose a solution to this challenge. Inspired by the success of data augmentation in supervised learning, we augment the training data so the distribution of states and actions in the dataset better represents the real state-action distribution. This study evaluates methods for combining and applying data augmentations to observations, to improve generalization of imitation learning agents. It also provides a performance benchmark of these augmentations across several 3D environments. These results demonstrate that data augmentation is a promising framework for improving generalization in imitation learning agents.
