Table of Contents
Fetching ...

Learning a Decentralized Medium Access Control Protocol for Shared Message Transmission

Lorenzo Mario Amorosa, Zhan Gao, Roberto Verdone, Petar Popovski, Deniz Gündüz

TL;DR

This paper tackles decentralized medium access control for delivering multiple shared messages in large-scale IoT, where inter-node coordination is unavailable. It proves that the optimal solutions are deterministic and the problem is NP-hard, and introduces a decentralized unsupervised learning framework (DUSL) with per-message DNNs and an online adaptation mechanism to handle time-varying activation patterns. Through CTDE-based training, REINFORCE-style updates, and a deterministic inference path, DUSL achieves scalable, collision-tolerant MAC decisions that outperform distributed MAB baselines, especially as network size grows. The work provides theoretical performance bounds under distribution shifts and demonstrates robust, rapid adaptation to dynamic environments, making it suitable for URLLC and energy-constrained IoT deployments.

Abstract

In large-scale Internet of things networks, efficient medium access control (MAC) is critical due to the growing number of devices competing for limited communication resources. In this work, we consider a new challenge in which a set of nodes must transmit a set of shared messages to a central controller, without inter-node communication or retransmissions. Messages are distributed among random subsets of nodes, which must implicitly coordinate their transmissions over shared communication opportunities. The objective is to guarantee the delivery of all shared messages, regardless of which nodes transmit them. We first prove the optimality of deterministic strategies, and characterize the success rate degradation of a deterministic strategy under dynamic message-transmission patterns. To solve this problem, we propose a decentralized learning-based framework that enables nodes to autonomously synthesize deterministic transmission strategies aiming to maximize message delivery success, together with an online adaptation mechanism that maintains stable performance in dynamic scenarios. Extensive simulations validate the framework's effectiveness, scalability, and adaptability, demonstrating its robustness to varying network sizes and fast adaptation to dynamic changes in transmission patterns, outperforming existing multi-armed bandit approaches.

Learning a Decentralized Medium Access Control Protocol for Shared Message Transmission

TL;DR

This paper tackles decentralized medium access control for delivering multiple shared messages in large-scale IoT, where inter-node coordination is unavailable. It proves that the optimal solutions are deterministic and the problem is NP-hard, and introduces a decentralized unsupervised learning framework (DUSL) with per-message DNNs and an online adaptation mechanism to handle time-varying activation patterns. Through CTDE-based training, REINFORCE-style updates, and a deterministic inference path, DUSL achieves scalable, collision-tolerant MAC decisions that outperform distributed MAB baselines, especially as network size grows. The work provides theoretical performance bounds under distribution shifts and demonstrates robust, rapid adaptation to dynamic environments, making it suitable for URLLC and energy-constrained IoT deployments.

Abstract

In large-scale Internet of things networks, efficient medium access control (MAC) is critical due to the growing number of devices competing for limited communication resources. In this work, we consider a new challenge in which a set of nodes must transmit a set of shared messages to a central controller, without inter-node communication or retransmissions. Messages are distributed among random subsets of nodes, which must implicitly coordinate their transmissions over shared communication opportunities. The objective is to guarantee the delivery of all shared messages, regardless of which nodes transmit them. We first prove the optimality of deterministic strategies, and characterize the success rate degradation of a deterministic strategy under dynamic message-transmission patterns. To solve this problem, we propose a decentralized learning-based framework that enables nodes to autonomously synthesize deterministic transmission strategies aiming to maximize message delivery success, together with an online adaptation mechanism that maintains stable performance in dynamic scenarios. Extensive simulations validate the framework's effectiveness, scalability, and adaptability, demonstrating its robustness to varying network sizes and fast adaptation to dynamic changes in transmission patterns, outperforming existing multi-armed bandit approaches.

Paper Structure

This paper contains 21 sections, 3 theorems, 32 equations, 11 figures, 2 algorithms.

Key Result

Theorem 1

The set of optimal joint strategies for the optimization problem defined in (eq:optim) either consists of a single deterministic joint strategy or includes a finite number $D > 1$ of deterministic joint strategies along with infinitely many randomized ones, i.e.,$\forall\,\mathbf{X} \in \{0, 1\}^{L where $\oplus$ is the exclusive or, the existence of an element $x$ is denoted by $\exists x$, whil

Figures (11)

  • Figure 1: Example scenario where message repetition is crucial. Panels (a)-(d) demonstrate how a strategy with message repetition handles four possible active sets of shared messages with limited communication opportunities, ensuring that for every active set at least one copy of each shared message can be delivered without collision.
  • Figure 2: Reference scenario comprising $N$ nodes transmitting $L$ messages over $M$ communication opportunities to a central controller $S$. Nodes $n_1$ and $n_2$ transmit over the same communication opportunity $m_2$ simultaneously, resulting in a "collision".
  • Figure 3: DUSL learning framework: Each node independently determines communication strategies for messages using $L$ local DNNs, where feedback $\xi$ guides local training procedures.
  • Figure 4: Training in static scenarios with conditional activation patterns and $L = 1$.
  • Figure 5: Training in static scenarios with conditional activation patterns and multiple messages.
  • ...and 6 more figures

Theorems & Definitions (6)

  • Theorem 1
  • proof
  • Theorem 2
  • proof
  • Theorem 3
  • proof