Non-orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach

A. A. Habob; H. Tabassum; O. Waqar

Non-orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach

A. A. Habob, H. Tabassum, O. Waqar

TL;DR

This article considers minimizing the age-of-information (AoI) and transmit power consumption in a vehicular network, where a roadside unit (RSU) provides timely updates about a set of physical processes to vehicles, and presents a two-stage meta-multi-objective reinforcement learning solution to estimate the Pareto front.

Abstract

This paper considers minimizing the age-of-information (AoI) and transmit power consumption in a vehicular network, where a roadside unit (RSU) provides timely updates about a set of physical processes to vehicles. We consider non-orthogonal multi-modal information dissemination, which is based on superposed message transmission from RSU and successive interference cancellation (SIC) at vehicles. The formulated problem is a multi-objective mixed-integer nonlinear programming problem; thus, a Pareto-optimal front is very challenging to obtain. First, we leverage the weighted-sum approach to decompose the multi-objective problem into a set of multiple single-objective sub-problems corresponding to each predefined objective preference weight. Then, we develop a hybrid deep Q-network (DQN)-deep deterministic policy gradient (DDPG) model to solve each optimization sub-problem respective to predefined objective-preference weight. The DQN optimizes the decoding order, while the DDPG solves the continuous power allocation. The model needs to be retrained for each sub-problem. We then present a two-stage meta-multi-objective reinforcement learning solution to estimate the Pareto front with a few fine-tuning update steps without retraining the model for each sub-problem. Simulation results illustrate the efficacy of the proposed solutions compared to the existing benchmarks and that the meta-multi-objective reinforcement learning model estimates a high-quality Pareto frontier with reduced training time.

Non-orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach

TL;DR

Abstract

Paper Structure (24 sections, 25 equations, 16 figures, 3 tables, 3 algorithms)

This paper contains 24 sections, 25 equations, 16 figures, 3 tables, 3 algorithms.

Introduction
Related Works
Motivation and Contributions
System Model and Performance Metrics
Network Model
Received Signal and SINR Model
Decoding Error Probability
Age of Information
A Toy Example
Multi-objective Problem Statement
Hybrid DQN-DDPG -Based DRL Solution
Theoretical Preliminaries
DRL Model
The Environment State Space
Action Space
...and 9 more sections

Figures (16)

Figure 1: System model with $V=4$ vehicles, $F=4$ processes, information demand $\mathbf{R}$ in \ref{['demR']}, and a decoding order decision $\bm{\pi}^{(t)} = [1, 2, 3, 4]$.
Figure 2: Hybrid DQN-DDPG model for age-optimum information dissemination.
Figure 3: The learning curves for the hybrid DQN-DDPG DRL model versus the number of training episodes.
Figure 4: Objective function in \ref{['Obj']} versus the relative weight $\zeta$ with $V=10$ vehicles, $F=4$ processes and $\abs{\mathcal{R}_i}=2$.
Figure 5: Average AoI and power expenditure versus the relative weight $\zeta$ with $V=10$ vehicles, processes $F=4$ processes and $\abs{\mathcal{R}_i}=2$.
...and 11 more figures

Non-orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach

TL;DR

Abstract

Non-orthogonal Age-Optimal Information Dissemination in Vehicular Networks: A Meta Multi-Objective Reinforcement Learning Approach

Authors

TL;DR

Abstract

Table of Contents

Figures (16)