Resource Allocation and Secure Wireless Communication in the Large Model-based Mobile Edge Computing System
Zefan Wang, Yitong Wang, Jun Zhao
TL;DR
This work tackles privacy-preserving offsite-tuning of large models in a mobile-edge computing (MEC) setting, where data owners receive a lightweight adapter $\mathcal{A}_n$ and a compressed emulator $\mathcal{E}^*_n$ and return a fine-tuned adapter through a confidential uplink. It formulates a joint resource-allocation problem to maximize the utility-consumption ratio $\mathcal{U}/\mathcal{S}$ under physical-layer security, solved via a hybrid algorithm that combines the Dinkelbach transform, fractional programming, successive convex approximation, and alternating optimization. The solution decomposes into Subproblem 1 (optimizing adapter-extraction rate $\boldsymbol{\phi}$ and the auxiliary parameter $y$) and Subproblem 2 (optimizing bandwidth, power, GPU frequencies, and the time variable), with closed-form and convex reformulations derived through KKT and FP/SCA techniques. Experimental results show the proposed method outperforms baselines in energy, delay, and the overall utility-cost ratio, demonstrating practical viability for privacy-preserving LM fine-tuning at the network edge.
Abstract
With the rapid advancement of large models and mobile edge computing, transfer learning, particularly through fine-tuning, has become crucial for adapting models to downstream tasks. Traditionally, this requires users to share their data with model owners for fine-tuning, which is not only costly but also raises significant privacy concerns. Furthermore, fine-tuning large-scale models is computationally intensive and often impractical for many users. To tackle these challenges, we introduce a system that combines offsite-tuning with physical-layer security, which provides local data owners with a lightweight adapter and a compressed emulator. Data owners then fine-tune the adapter locally and securely send it back to the model owners through a confidential channel for integration, ensuring privacy and resource conservation. Our paper focuses on optimizing computational resource allocation among data owners and the large model owner deployed on edge, and on the compression ratio of adapters. We incorporate a secrecy uplink channel to maximize the utility that we defined while minimizing system costs like energy consumption and delay. The optimization uses the Dinkelbach algorithm, fractional programming, successive convex approximation and alternating optimization. Experiments demonstrate our algorithm's superiority over existing methods.
