Auto-Encoding Adversarial Imitation Learning
Kaifeng Zhang, Rui Zhao, Ziming Zhang, Yang Gao
TL;DR
AEAIL tackles the reliance on environment rewards in reinforcement learning by replacing the discriminator-based reward in adversarial imitation learning with a reconstruction-error reward from an auto-encoder. This surrogate enables denser, more informative feedback and yields robustness to noisy expert demonstrations, while its theoretical grounding ties the objective to the Wasserstein distance between expert and learned policies. Empirically, AEAIL surpasses state-of-the-art baselines on state- and image-based tasks and maintains strong performance across diverse auto-encoder types and distance divergences. The approach offers a simple, scalable, and robust pathway for imitation learning in high-dimensional settings with imperfect demonstrations.
Abstract
Reinforcement learning (RL) provides a powerful framework for decision-making, but its application in practice often requires a carefully designed reward function. Adversarial Imitation Learning (AIL) sheds light on automatic policy acquisition without access to the reward signal from the environment. In this work, we propose Auto-Encoding Adversarial Imitation Learning (AEAIL), a robust and scalable AIL framework. To induce expert policies from demonstrations, AEAIL utilizes the reconstruction error of an auto-encoder as a reward signal, which provides more information for optimizing policies than the prior discriminator-based ones. Subsequently, we use the derived objective functions to train the auto-encoder and the agent policy. Experiments show that our AEAIL performs superior compared to state-of-the-art methods on both state and image based environments. More importantly, AEAIL shows much better robustness when the expert demonstrations are noisy.
