Adversarial Diffusion for Robust Reinforcement Learning
Daniele Foffano, Alessio Russo, Alexandre Proutiere
TL;DR
This work tackles robustness in reinforcement learning by integrating diffusion-based trajectory modeling with CVaR-based adversarial risk optimization. It introduces Adversarial Diffusion for Robust Reinforcement Learning (AD-RRL), where adversarially guided diffusion samples worst-case trajectories to train policies within a Dyna-style loop, increasing resilience to modeling errors and environmental uncertainty. The approach yields stronger robustness than state-of-the-art baselines on MuJoCo tasks while maintaining competitive nominal performance, though at higher computational cost due to diffusion steps. This framework advances practical robust RL by combining trajectory-level generation with tail-risk optimization, offering a scalable path toward safer, more reliable policies in uncertain dynamics.
Abstract
Robustness to modeling errors and uncertainties remains a central challenge in reinforcement learning (RL). In this work, we address this challenge by leveraging diffusion models to train robust RL policies. Diffusion models have recently gained popularity in model-based RL due to their ability to generate full trajectories "all at once", mitigating the compounding errors typical of step-by-step transition models. Moreover, they can be conditioned to sample from specific distributions, making them highly flexible. We leverage conditional sampling to learn policies that are robust to uncertainty in environment dynamics. Building on the established connection between Conditional Value at Risk (CVaR) optimization and robust RL, we introduce Adversarial Diffusion for Robust Reinforcement Learning (AD-RRL). AD-RRL guides the diffusion process to generate worst-case trajectories during training, effectively optimizing the CVaR of the cumulative return. Empirical results across standard benchmarks show that AD-RRL achieves superior robustness and performance compared to existing robust RL methods.
