Secure Resource Allocation via Constrained Deep Reinforcement Learning
Jianfei Sun, Qiang Gao, Cong Wu, Yuxian Li, Jiacheng Wang, Dusit Niyato
TL;DR
The paper tackles secure, efficient resource allocation in serverless multi-cloud edge environments under IoT/6G demands. It introduces SARMTO, a framework built on action-constrained DRL (AC-DQN) with an MDP formulation, adaptive security, and robust optimization techniques to jointly optimize task offloading, resources, and security. Key contributions include embedding feasibility constraints within the learning process, modeling security overhead, and demonstrating substantial improvements in system cost and energy efficiency across diverse scenarios. The findings suggest SARMTO enables scalable, secure resource management for heterogeneous edge-cloud ecosystems, with practical implications for next-generation IoT and edge applications. $C(\pi)=\alpha_1 T(\pi)+\alpha_2 E(\pi)$ and related security overheads underpin the optimization, highlighting the framework's balance between latency, energy, and protection.
Abstract
The proliferation of Internet of Things (IoT) devices and the advent of 6G technologies have introduced computationally intensive tasks that often surpass the processing capabilities of user devices. Efficient and secure resource allocation in serverless multi-cloud edge computing environments is essential for supporting these demands and advancing distributed computing. However, existing solutions frequently struggle with the complexity of multi-cloud infrastructures, robust security integration, and effective application of traditional deep reinforcement learning (DRL) techniques under system constraints. To address these challenges, we present SARMTO, a novel framework that integrates an action-constrained DRL model. SARMTO dynamically balances resource allocation, task offloading, security, and performance by utilizing a Markov decision process formulation, an adaptive security mechanism, and sophisticated optimization techniques. Extensive simulations across varying scenarios, including different task loads, data sizes, and MEC capacities, show that SARMTO consistently outperforms five baseline approaches, achieving up to a 40% reduction in system costs and a 41.5% improvement in energy efficiency over state-of-the-art methods. These enhancements highlight SARMTO's potential to revolutionize resource management in intricate distributed computing environments, opening the door to more efficient and secure IoT and edge computing applications.
