HRL-TSCH: A Hierarchical Reinforcement Learning-based TSCH Scheduler for IIoT
F. Fernando Jurado-Lasso, Charalampos Orfanidis, J. F. Jurado, Xenofon Fafoutis
TL;DR
The paper tackles the challenge of adaptable TSCH scheduling in IIoT networks, where application requirements demand balancing throughput, delay, and power. It introduces HRL-TSCH, a hierarchical reinforcement learning framework with a higher-level policy for link management and lower-level policies for slot/channel assignment, guided by a multi-objective cost function and trained via Deep Q-Networks. A TSCH lookup component translates RL decisions into concrete schedule cells, and a comprehensive evaluation in Contiki-NG/Cooja demonstrates superior trade-offs over centralized and decentralized baselines (Orchestra, MSF, QL-TSCH) across power, latency, and throughput, validated through Pareto-front analyses. The work highlights HRL-TSCH as a scalable, adaptable solution for dynamic IIoT environments, with potential extensions to slotframe size and contention-based scheduling to further enhance robustness and performance.
Abstract
The Industrial Internet of Things (IIoT) demands adaptable Networked Embedded Systems (NES) for optimal performance. Combined with recent advances in Artificial Intelligence (AI), tailored solutions can be developed to meet specific application requirements. This study introduces HRL-TSCH, an approach rooted in Hierarchical Reinforcement Learning (HRL), to devise Time Slotted Channel Hopping (TSCH) schedules provisioning IIoT demand. HRL-TSCH employs dual policies: one at a higher level for TSCH schedule link management, and another at a lower level for timeslot and channel assignments. The proposed RL agents address a multi-objective problem, optimizing throughput, power efficiency, and network delay based on predefined application requirements. Simulation experiments demonstrate HRL-TSCH superiority over existing state-of-art approaches, effectively achieving an optimal balance between throughput, power consumption, and delay, thereby enhancing IIoT network performance.
