A Roadmap Towards Improving Multi-Agent Reinforcement Learning With Causal Discovery And Inference
Giovanni Briglia, Stefano Mariani, Franco Zambonelli
TL;DR
This work tackles the challenge of transferring causal reasoning to multi-agent reinforcement learning by proposing Causality-Driven Reinforcement Learning (CDRL), which learns a minimal Structural Causal Model and uses $do$-based causal inference to filter actions. The approach is algorithm-agnostic and aims to improve policy efficacy, learning efficiency, and safety across MARL tasks with partial observability and continuous spaces. Empirical results across navigation, flocking, and give-way scenarios show conditional gains and notable failures, underscoring the importance of cooperation structure and the need for sophisticated causal discovery and collaborative learning among agents. The paper further maps a roadmap for advancing causal MARL, including informative interventions, robust model assessment, soft interventions for continuous domains, and formal convergence considerations.
Abstract
Causal reasoning is increasingly used in Reinforcement Learning (RL) to improve the learning process in several dimensions: efficacy of learned policies, efficiency of convergence, generalisation capabilities, safety and interpretability of behaviour. However, applications of causal reasoning to Multi-Agent RL (MARL) are still mostly unexplored. In this paper, we take the first step in investigating the opportunities and challenges of applying causal reasoning in MARL. We measure the impact of a simple form of causal augmentation in state-of-the-art MARL scenarios increasingly requiring cooperation, and with state-of-the-art MARL algorithms exploiting various degrees of collaboration between agents. Then, we discuss the positive as well as negative results achieved, giving us the chance to outline the areas where further research may help to successfully transfer causal RL to the multi-agent setting.
