Domain Adversarial Reinforcement Learning
Bonnie Li, Vincent François-Lavet, Thang Doan, Joelle Pineau
TL;DR
Domain Adversarial Reinforcement Learning tackles generalization in visual RL by learning domain-invariant latent representations. It combines Soft Actor-Critic with a domain-adversarial loss implemented via a gradient reversal layer to align features across background domains. On DeepMind Control tasks, DARL yields substantial zero-shot generalization to unseen and non-stationary visual contexts, with latent spaces that remain task-relevant. This approach demonstrates that enforcing domain invariance in representations can significantly improve robustness of pixel-based RL policies for real-world-like variability.
Abstract
We consider the problem of generalization in reinforcement learning where visual aspects of the observations might differ, e.g. when there are different backgrounds or change in contrast, brightness, etc. We assume that our agent has access to only a few of the MDPs from the MDP distribution during training. The performance of the agent is then reported on new unknown test domains drawn from the distribution (e.g. unseen backgrounds). For this "zero-shot RL" task, we enforce invariance of the learned representations to visual domains via a domain adversarial optimization process. We empirically show that this approach allows achieving a significant generalization improvement to new unseen domains.
