Trust-Based Social Learning for Communication (TSLEC) Protocol Evolution in Multi-Agent Reinforcement Learning
Abraham Itzhak Weinberg
TL;DR
TSLEC addresses the problem of slow, independent emergence of communication protocols in MARL by introducing explicit, trust-modulated peer teaching. The method integrates Q-learning with emergent communication, dynamic trust networks, and mission adaptation to accelerate convergence while yielding compositional languages that remain robust under changing objectives. Key findings include a $23.9\%$ reduction in episodes-to-convergence, $\mathcal{C}=0.38$, $\Phi>0.867$, and a strong $r=0.743$ correlation between trust and teaching effectiveness, indicating effective knowledge filtering. This work demonstrates that explicit social learning can fundamentally speed up coordination in multi-agent systems and informs design principles for scalable, interpretable, and safe AI.
Abstract
Emergent communication in multi-agent systems typically occurs through independent learning, resulting in slow convergence and potentially suboptimal protocols. We introduce TSLEC (Trust-Based Social Learning with Emergent Communication), a framework where agents explicitly teach successful strategies to peers, with knowledge transfer modulated by learned trust relationships. Through experiments with 100 episodes across 30 random seeds, we demonstrate that trust-based social learning reduces episodes-to-convergence by 23.9% (p < 0.001, Cohen's d = 1.98) compared to independent emergence, while producing compositional protocols (C = 0.38) that remain robust under dynamic objectives (Phi > 0.867 decoding accuracy). Trust scores strongly correlate with teaching quality (r = 0.743, p < 0.001), enabling effective knowledge filtering. Our results establish that explicit social learning fundamentally accelerates emergent communication in multi-agent coordination.
