MACIE: Multi-Agent Causal Intelligence Explainer for Collective Behavior Understanding
Abraham Itzhak Weinberg
TL;DR
MACIE presents a principled framework for explainable collective intelligence in multi-agent RL by unifying structural causal models, interventional counterfactuals, and Shapley-value attribution. It introduces novel collective intelligence metrics—Synergy Index, Coordination Score, and Information Integration—to quantify emergent behavior and distinguish it from individual contributions, while generating natural language explanations for stakeholders. The approach is implemented with computational optimizations that deliver sub-second explanations on CPU and is validated across four MARL scenarios, achieving accurate attributions (mean |φ_i| ≈ 5.07) and robust emergence detection (SI up to 0.461). This work advances trustworthy MARL by enabling fair credit assignment, emergent-behavior analysis, and actionable explanations with practical efficiency for real-world deployment.
Abstract
As Multi Agent Reinforcement Learning systems are used in safety critical applications. Understanding why agents make decisions and how they achieve collective behavior is crucial. Existing explainable AI methods struggle in multi agent settings. They fail to attribute collective outcomes to individuals, quantify emergent behaviors, or capture complex interactions. We present MACIE Multi Agent Causal Intelligence Explainer, a framework combining structural causal models, interventional counterfactuals, and Shapley values to provide comprehensive explanations. MACIE addresses three questions. First, each agent's causal contribution using interventional attribution scores. Second, system level emergent intelligence through synergy metrics separating collective effects from individual contributions. Third, actionable explanations using natural language narratives synthesizing causal insights. We evaluate MACIE across four MARL scenarios: cooperative, competitive, and mixed motive. Results show accurate outcome attribution, mean phi_i equals 5.07, standard deviation less than 0.05, detection of positive emergence in cooperative tasks, synergy index up to 0.461, and efficient computation, 0.79 seconds per dataset on CPU. MACIE uniquely combines causal rigor, emergence quantification, and multi agent support while remaining practical for real time use. This represents a step toward interpretable, trustworthy, and accountable multi agent AI.
