Multi-Agent RL-Based Industrial AIGC Service Offloading over Wireless Edge Networks
Siyuan Li, Xi Lin, Hansong Xu, Kun Hua, Xiaomin Jin, Gaolei Li, Jianhua Li
TL;DR
This work addresses the challenge of deploying generative AIGC at the network edge for industrial IoT by introducing GMEL, a diffusion-model–driven edge learning framework, and AMARL, an attention-enhanced multi-agent reinforcement learning algorithm for offloading AIGC tasks. The approach jointly optimizes computation and communication resources for heterogeneous AIGC workloads, via a centralized training–decentralized execution MARL paradigm with a multi-head attention critic to handle large joint state spaces. The authors demonstrate that AMARL achieves faster convergence and higher task completion rates than baseline DRL methods, reducing overall system latency and improving scalability as the number of IEDs grows. These results suggest that diffusion-based AIGC at the network edge, coupled with attention-driven MARL, can enable practical, privacy-preserving, low-latency industrial AIGC services with few-shot learning capabilities.
Abstract
Currently, the generative model has garnered considerable attention due to its application in addressing the challenge of scarcity of abnormal samples in the industrial Internet of Things (IoT). However, challenges persist regarding the edge deployment of generative models and the optimization of joint edge AI-generated content (AIGC) tasks. In this paper, we focus on the edge optimization of AIGC task execution and propose GMEL, a generative model-driven industrial AIGC collaborative edge learning framework. This framework aims to facilitate efficient few-shot learning by leveraging realistic sample synthesis and edge-based optimization capabilities. First, a multi-task AIGC computational offloading model is presented to ensure the efficient execution of heterogeneous AIGC tasks on edge servers. Then, we propose an attention-enhanced multi-agent reinforcement learning (AMARL) algorithm aimed at refining offloading policies within the IoT system, thereby supporting generative model-driven edge learning. Finally, our experimental results demonstrate the effectiveness of the proposed algorithm in optimizing the total system latency of the edge-based AIGC task completion.
