Minimizing Functions of Age of Incorrect Information for Remote Estimation
Ismail Cosandal, Sennur Ulukus, Nail Akar
TL;DR
This work tackles remote estimation with a general AoII-based cost by modeling a push-based, threshold-driven transmission policy as a discrete-time semi-Markov decision process. It introduces dual-regime absorbing Markov chains (DR-AMC) and dual-regime phase-type (DR-PH) distributions to accurately characterize the time-to-absorption and AoII evolution under a policy, enabling closed-form or efficiently computable expressions for SMDP parameters. The authors derive explicit parameter calculations for polynomial AoII penalties, implement a policy-iteration solution, and demonstrate that the resulting multi-threshold policy significantly outperforms benchmarks and matches exhaustive-search optimality. The framework provides a scalable and flexible approach to optimizing AoII-costs with arbitrary penalties in remote estimation systems.
Abstract
The age of incorrect information (AoII) process which keeps track of the time since the source and monitor processes are in sync, has been extensively used in remote estimation problems. In this paper, we consider a push-based remote estimation system with a discrete-time Markov chain (DTMC) information source transmitting status update packets towards the monitor once the AoII process exceeds a certain estimation-based threshold. In this paper, the time average of an arbitrary function of AoII is taken as the AoII cost, as opposed to using the average AoII as the mismatch metric, whereas this function is also allowed to depend on the estimation value. In this very general setting, our goal is to minimize a weighted sum of AoII and transmission costs. For this purpose, we formulate a discrete-time semi-Markov decision process (SMDP) regarding the multi-threshold status update policy. We propose a novel tool in discrete-time called 'dual-regime absorbing Markov chain' (DR-AMC) and its corresponding absorption time distribution named as 'dual-regime phase-type' (DR-PH) distribution, to obtain the characterizing parameters of the SMDP, which allows us to obtain the distribution of the AoII process for a given policy, and hence the average of any function of AoII. The proposed method is validated with numerical results by which we compare our proposed method against other policies obtained by exhaustive-search, and also various benchmark policies.
