Joint Transmission and Control in a Goal-oriented NOMA Network
Kunpeng Liu, Shaohua Wu, Aimin Li, Qinyu Zhang
TL;DR
The paper introduces the Goal-oriented Tensor (GoT) as a closed-loop utility metric for goal-oriented remote control over wireless links and formulates a joint transmission-control problem in a pull-based NOMA network as a partially observable Markov decision process (POMDP). A belief-state construction is derived to address partial observability, and a Double-Dueling Deep Q-Network (D3QN) is trained to adapt power allocation and control actions. The GoT for each source is GoT_{t,i}^{\pi}=C_1(X_{t,i},\phi_t)+\alpha C_2(P_{t,i})+\beta C_3(C_{t,i}), and the long-term objective minimizes the average GoT. Simulation results show a fundamental trade-off between transmission efficiency and control fidelity, with NOMA outperforming OMA in multi-loop remote control scenarios, highlighting GoT-based optimization as a promising framework for goal-oriented networks. GoT-based framework for goal-oriented wireless control, framed as a POMDP with belief-state augmentation and solved via D3QN, demonstrates superior utility and actionable trade-offs in multi-loop NOMA remote control systems, with $GoT_{t,i}^{\pi}$ guiding power and actuation decisions.
Abstract
Goal-oriented communication shifts the focus from merely delivering timely information to maximizing decision-making effectiveness by prioritizing the transmission of high-value information. In this context, we introduce the Goal-oriented Tensor (GoT), a novel closed-loop metric designed to directly quantify the ultimate utility in Goal-oriented systems, capturing how effectively the transmitted information meets the underlying application's objectives. Leveraging the GoT, we model a Goal-oriented Non-Orthogonal Multiple Access (NOMA) network comprising multiple transmission-control loops. Operating under a pull-based framework, we formulate the joint optimization of transmission and control as a Partially Observable Markov Decision Process (POMDP), which we solve by deriving the belief state and training a Double-Dueling Deep Q-Network (D3QN). This framework enables adaptive decision-making for power allocation and control actions. Simulation results reveal a fundamental trade-off between transmission efficiency and control fidelity. Additionally, the superior utility of NOMA over Orthogonal Multiple Access (OMA) in multi-loop remote control scenarios is demonstrated.
