Table of Contents
Fetching ...

Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach

Shisheng Hu, Mushu Li, Jie Gao, Conghao Zhou, Xuemin Shen

TL;DR

The paper tackles adaptive device–edge collaboration for DNN inference in AIoT under dynamic workloads and unknown task arrivals. It introduces a digital twin framework with two DTs to (i) evaluate hypothetical offloading decisions during on-device inference and (ii) emulate workload evolution, enabling data-augmented learning and reduced signaling. The authors formulate a long-run utility objective balancing delay, accuracy, and energy, transform it into per-task online decisions via an optimal-stopping–based, learning-assisted policy implemented by ContValueNet, and derive conditions to prune the decision space. Simulation on AlexNet-based wiring demonstrates that the DT-assisted approach outperforms single-shot baselines, achieves favorable latency–accuracy–energy tradeoffs, and reduces decision complexity through principled space reduction. The work advances practical AIoT deployments by enabling robust, adaptive offloading decisions with limited signaling and unknown arrival statistics.

Abstract

Device-edge collaboration on deep neural network (DNN) inference is a promising approach to efficiently utilizing network resources for supporting artificial intelligence of things (AIoT) applications. In this paper, we propose a novel digital twin (DT)-assisted approach to device-edge collaboration on DNN inference that determines whether and when to stop local inference at a device and upload the intermediate results to complete the inference on an edge server. Instead of determining the collaboration for each DNN inference task only upon its generation, multi-step decision-making is performed during the on-device inference to adapt to the dynamic computing workload status at the device and the edge server. To enhance the adaptivity, a DT is constructed to evaluate all potential offloading decisions for each DNN inference task, which provides augmented training data for a machine learning-assisted decision-making algorithm. Then, another DT is constructed to estimate the inference status at the device to avoid frequently fetching the status information from the device, thus reducing the signaling overhead. We also derive necessary conditions for optimal offloading decisions to reduce the offloading decision space. Simulation results demon-strate the outstanding performance of our DT-assisted approach in terms of balancing the tradeoff among inference accuracy, delay, and energy consumption.

Adaptive Device-Edge Collaboration on DNN Inference in AIoT: A Digital Twin-Assisted Approach

TL;DR

The paper tackles adaptive device–edge collaboration for DNN inference in AIoT under dynamic workloads and unknown task arrivals. It introduces a digital twin framework with two DTs to (i) evaluate hypothetical offloading decisions during on-device inference and (ii) emulate workload evolution, enabling data-augmented learning and reduced signaling. The authors formulate a long-run utility objective balancing delay, accuracy, and energy, transform it into per-task online decisions via an optimal-stopping–based, learning-assisted policy implemented by ContValueNet, and derive conditions to prune the decision space. Simulation on AlexNet-based wiring demonstrates that the DT-assisted approach outperforms single-shot baselines, achieves favorable latency–accuracy–energy tradeoffs, and reduces decision complexity through principled space reduction. The work advances practical AIoT deployments by enabling robust, adaptive offloading decisions with limited signaling and unknown arrival statistics.

Abstract

Device-edge collaboration on deep neural network (DNN) inference is a promising approach to efficiently utilizing network resources for supporting artificial intelligence of things (AIoT) applications. In this paper, we propose a novel digital twin (DT)-assisted approach to device-edge collaboration on DNN inference that determines whether and when to stop local inference at a device and upload the intermediate results to complete the inference on an edge server. Instead of determining the collaboration for each DNN inference task only upon its generation, multi-step decision-making is performed during the on-device inference to adapt to the dynamic computing workload status at the device and the edge server. To enhance the adaptivity, a DT is constructed to evaluate all potential offloading decisions for each DNN inference task, which provides augmented training data for a machine learning-assisted decision-making algorithm. Then, another DT is constructed to estimate the inference status at the device to avoid frequently fetching the status information from the device, thus reducing the signaling overhead. We also derive necessary conditions for optimal offloading decisions to reduce the offloading decision space. Simulation results demon-strate the outstanding performance of our DT-assisted approach in terms of balancing the tradeoff among inference accuracy, delay, and energy consumption.
Paper Structure (28 sections, 52 equations, 13 figures, 1 table, 1 algorithm)

This paper contains 28 sections, 52 equations, 13 figures, 1 table, 1 algorithm.

Figures (13)

  • Figure 1: System model.
  • Figure 2: Task queuing model.
  • Figure 3: DT-Assisted approach to adaptive device-edge collaboration on DNN inference.
  • Figure 4: Decomposition of the on-device queuing delay $T_n^{\rm lq}$.
  • Figure 5: DT and learning-assisted algorithm for DNN task offloading decision-making.
  • ...and 8 more figures

Theorems & Definitions (4)

  • proof
  • proof
  • proof
  • proof