Rule-Based Conflict-Free Decision Framework in Swarm Confrontation
Zhaoqi Dong, Zhinan Wang, Quanqi Zheng, Bin Xu, Lei Chen, Jinhu Lv
TL;DR
The paper tackles jitter and deadlock in rule-based swarm decision-making by introducing a probabilistic finite state machine (PFSM) framework that governs transitions with a learned transition matrix $\boldsymbol{P}$ in a 2D adversarial swarm setting governed by double-integrator dynamics. It fuses a multistream deep convolutional network, comprising AgentNet, TeammateNet, and EnemyNet, to produce the PFSM transition probabilities via $\boldsymbol{P}=f_{\Lambda}(\boldsymbol{z})$, and optimizes these transitions with a PPO-based Actor-Critic under a carefully designed reward that penalizes deadlock and jitter. Key contributions include formal PFSM integration with neural architectures, a relational attention mechanism for teammate interactions, and a learning objective that stabilizes state transitions through sparsity and consistency terms in $L^{\text{CLIP}}(\theta)$. Experimental validation in both simulations and real unmanned ground vehicles demonstrates superior rewards and high win rates, with evidence of scalable performance as swarm size grows. Overall, the framework provides interpretable yet adaptive decision-making for robust swarm confrontation, with demonstrated potential for real-world deployment and transfer to larger, more complex multi-agent systems.
Abstract
Traditional rule-based decision-making methods with interpretable advantage, such as finite state machine, suffer from the jitter or deadlock(JoD) problems in extremely dynamic scenarios. To realize agent swarm confrontation, decision conflicts causing many JoD problems are a key issue to be solved. Here, we propose a novel decision-making framework that integrates probabilistic finite state machine, deep convolutional networks, and reinforcement learning to implement interpretable intelligence into agents. Our framework overcomes state machine instability and JoD problems, ensuring reliable and adaptable decisions in swarm confrontation. The proposed approach demonstrates effective performance via enhanced human-like cooperation and competitive strategies in the rigorous evaluation of real experiments, outperforming other methods.
