Pragmatic Communication for Remote Control of Finite-State Markov Processes
Pietro Talli, Edoardo David Santi, Federico Chiariotti, Touraj Soleymani, Federico Mason, Andrea Zanella, Deniz Gündüz
TL;DR
The paper develops a general theory for pragmatic, goal-oriented remote control of finite-state Markov processes over costly zero-delay channels, introducing pull- and push-based architectures and three optimization algorithms. It proves that push-based policies strictly improve over pull-based ones and provides formal optimality guarantees and complexity results for each algorithm, including an exact POMDP-based Joint Policy Optimization with exponential cost and two polynomial-time, locally optimal alternatives. The framework captures implicit information from non-transmissions and yields scalable approaches that perform well in remote control tasks, with remote estimation posing greater computational challenges. The findings offer a principled path to design efficient, cooperative encoder–decoder strategies for cyber-physical systems under tight communication resources, with potential extensions to delays, losses, and multi-node settings.
Abstract
Pragmatic or goal-oriented communication can optimize communication decisions beyond the reliable transmission of data, instead aiming at directly affecting application performance with the minimum channel utilization. In this paper, we develop a general theoretical framework for the remote control of finite-state Markov processes, using pragmatic communication over a costly zero-delay communication channel. To that end, we model a cyber-physical system composed of an encoder, which observes and transmits the states of a process in real-time, and a decoder, which receives that information and controls the behavior of the process. The encoder and the decoder should cooperatively optimize the trade-off between the control performance (i.e., reward) and the communication cost (i.e., channel use). This scenario underscores a pragmatic (i.e., goal-oriented) communication problem, where the purpose is to convey only the data that is most valuable for the underlying task, taking into account the state of the decoder (hence, the pragmatic aspect). We investigate two different decision-making architectures: in pull-based remote control, the decoder is the only decision-maker, while in push-based remote control, the encoder and the decoder constitute two independent decision-makers, leading to a multi-agent scenario. We propose three algorithms to optimize our system (i.e., design the encoder and the decoder policies), discuss the optimality guarantees ofs the algorithms, and shed light on their computational complexity and fundamental limits.
