Decentralized Covert Routing in Heterogeneous Networks Using Reinforcement Learning
Justin Kong, Terrence J. Moore, Fikadu T. Dagefu
TL;DR
This work addresses covert routing in heterogeneous networks where multiple communication modalities are available. It introduces a decentralized Q-learning framework (Q-covert routing) in which each node selects the next hop and a modality using only local feedback, aiming to maximize the end-to-end detection-exclusion probability $P_{ ext{DEP}}$ while enforcing a throughput constraint $U_{ ext{target}}$. The method defines state, action, and cost as $S_T$, $A_T$, and $c_T(a)=\ln(1/P_{ ext{DEP},h(a)})$, and updates $Q_T(s,a)$ via $Q_T(s,a) \leftarrow (1-\alpha)Q_T(s,a) + \alpha ( c_T(a) + \gamma \hat{c}_T(a) )$ with an $\epsilon$-greedy policy. Numerical results show that the decentralized approach closely matches the centralized optimum (e.g., $P_{ ext{DEP}}$ around 0.78 with a negligible gap) and outperforms naive routing methods, demonstrating scalability and robustness to Willie’s position.
Abstract
This letter investigates covert routing communications in a heterogeneous network where a source transmits confidential data to a destination with the aid of relaying nodes where each transmitter judiciously chooses one modality among multiple communication modalities. We develop a novel reinforcement learning-based covert routing algorithm that finds a route from the source to the destination where each node identifies its next hop and modality only based on the local feedback information received from its neighboring nodes. We show based on numerical simulations that the proposed covert routing strategy has only negligible performance loss compared to the optimal centralized routing scheme.
