A Middle Path for On-Premises LLM Deployment: Preserving Privacy Without Sacrificing Model Confidentiality
Hanbo Huang, Yihan Li, Bowen Jiang, Bo Jiang, Lin Liu, Ruoyu Sun, Zhuotao Liu, Shiyu Liang
TL;DR
The paper addresses the challenge of privacy-preserving on-premises LLM deployment by showing that protecting only the output layer leaves models vulnerable to query-based distillation. It introduces SOLID, a semi-open deployment framework that secures a minimal set of bottom decoder layers guided by a distillation-difficulty metric, and proves a transition-layer phenomenon that favors bottom-layer protection. Through extensive experiments on open-source decoders from 1.3B to 70B parameters, SOLID achieves security levels comparable to fully secured deployments while preserving customization capabilities close to full fine-tuning, and demonstrates robustness to attack dataset size. The study provides a practical, tuning-free approach to balance security and customization in on-prem LLMs, with implications for privacy-sensitive domains and future work to extend defenses beyond distillation alone.
Abstract
Privacy-sensitive users require deploying large language models (LLMs) within their own infrastructure (on-premises) to safeguard private data and enable customization. However, vulnerabilities in local environments can lead to unauthorized access and potential model theft. To address this, prior research on small models has explored securing only the output layer within hardware-secured devices to balance model confidentiality and customization. Yet this approach fails to protect LLMs effectively. In this paper, we discover that (1) query-based distillation attacks targeting the secured top layer can produce a functionally equivalent replica of the victim model; (2) securing the same number of layers, bottom layers before a transition layer provide stronger protection against distillation attacks than top layers, with comparable effects on customization performance; and (3) the number of secured layers creates a trade-off between protection and customization flexibility. Based on these insights, we propose SOLID, a novel deployment framework that secures a few bottom layers in a secure environment and introduces an efficient metric to optimize the trade-off by determining the ideal number of hidden layers. Extensive experiments on five models (1.3B to 70B parameters) demonstrate that SOLID outperforms baselines, achieving a better balance between protection and downstream customization.
