SLIP: Securing LLMs IP Using Weights Decomposition
Yehonathan Refael, Adam Hakim, Lev Greenberg, Satya Lokam, Tal Aviv, Ben Fishman, Shachar Seidman, Racchit Jain, Jay Tenenbaum
TL;DR
SLIP addresses the risk of IP theft when deploying large language models on edge devices by partitioning model weights between a secure resource and a cost-efficient, insecure device using a SVD-based decomposition. The secure side retains top-k singular components to protect IP, while the remainder is offloaded, with a provably secure masking protocol that preserves accuracy and introduces minimal latency. The framework defines four core guarantees—Usefulness, Safety, Security, and Efficiency—and demonstrates applicability to MLPs, CNNs, and transformers, with experimental validation on GPT-2, Phi-2, and LLaMA2-7B showing robust IP protection even under fine-tuning attempts. This work provides a practical path for edge-enabled, IP-safe LLM deployment, balancing cost, latency, and security without sacrificing model quality.
Abstract
Large language models (LLMs) have recently seen widespread adoption in both academia and industry. As these models grow, they become valuable intellectual property (IP), reflecting substantial investments by their owners. The high cost of cloud-based deployment has spurred interest in running models on edge devices, but this risks exposing parameters to theft and unauthorized use. Existing approaches to protect model IP on the edge trade off practicality, accuracy, or deployment requirements. We introduce SLIP, a hybrid inference algorithm designed to protect edge-deployed models from theft. SLIP is, to our knowledge, the first hybrid protocol that is both practical for real-world applications and provably secure, while incurring zero accuracy degradation and minimal latency overhead. It partitions the model across two computing resources: one secure but expensive, and one cost-effective but vulnerable. Using matrix decomposition, the secure resource retains the most sensitive portion of the model's IP while performing only a small fraction of the computation; the vulnerable resource executes the remainder. The protocol includes security guarantees that prevent attackers from using the partition to infer the protected information. Finally, we present experimental results that demonstrate the robustness and effectiveness of our method, positioning it as a compelling solution for protecting LLMs.
