MDP-based Energy-aware Task Scheduling for Battery-less IoT
Shahab Jahanbazi, Mateen Ashraf, Onel L. A. López
TL;DR
This work addresses task scheduling for battery-less IoT devices under stochastic energy harvesting by formulating the problem as a Markov decision process (MDP). It derives an optimal stationary threshold-based (OSTB) policy that maximizes long-term task completion while minimizing power failures, using a two-layer transition model (micro and macro) and two reward schemes (basic and sigmoid-based). A unichain property is proven and the policy structure is shown to be threshold-based in capacitor voltage within each superstate, enabling efficient computation via linear programming. Simulation demonstrates that OSTB outperforms the ALAP baseline in task completion rate, power-failure reduction, and latency, with larger capacitors reducing the gap, indicating practical benefits for ultra-low-power, energy-harvesting IoT deployments.
Abstract
Realizing high long-term task completion rates represents a fundamental challenge in battery-less Internet of Things (IoT) devices powered by ambient energy harvesting. This difficulty is primarily due to the stochastic and time-varying characteristics of the available energy, which significantly complicate the design of optimal task scheduling policies. In this paper, we consider a battery-less IoT device that must periodically report sensing measurements to a monitoring center. We adopt the Markov decision process (MDP) framework to handle energy variability while aiming to maximize the long-term task completion rate. For this, we first identify its components and then define two appropriate reward functions. We demonstrate the inherent properties associated with the MDP formulation and the related optimal policy. Subsequently, we solve the resulting optimization problem, leading to the optimal stationary threshold-based (OSTB) scheduling. Simulation results demonstrate that OSTB outperforms the well-known ``as late as possible'' (ALAP) scheduling strategy. For instance, an $8.6\%$ increase in the task completion rate, along with a $65\%$ reduction in power failures and a $86.29\%$ decrease in execution delays during task execution are registered assuming a $4.7$ mF capacitor.
